From a0c3b540a6da26a79c43296ca059d58cdd1d6687 Mon Sep 17 00:00:00 2001 From: Yuchen Pei Date: Sat, 12 May 2018 16:55:31 +0200 Subject: minor edit on engine --- site/microblog.html | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) (limited to 'site/microblog.html') diff --git a/site/microblog.html b/site/microblog.html index 33e017f..c551725 100644 --- a/site/microblog.html +++ b/site/microblog.html @@ -22,7 +22,7 @@

2018-05-11

Some notes on RNN, FSM / FA, TM and UTM

Related to a previous micropost.

-

These slides from Toronto is a nice introduction to RNN (recurrent neural network) from a computational point of view. It states that RNN can simulate any FSM (finite state machine, a.k.a. finite automata abbr. FA) with a toy example computing the parity of a binary string.

+

These slides from Toronto are a nice introduction to RNN (recurrent neural network) from a computational point of view. It states that RNN can simulate any FSM (finite state machine, a.k.a. finite automata abbr. FA) with a toy example computing the parity of a binary string.

Goodfellow et. al.’s book (see page 372 and 374) goes one step further, stating that RNN with a hidden-to-hidden layer can simulate Turing machines, and not only that, but also the universal Turing machine abbr. UTM (the book referenced Siegelmann-Sontag), a property not shared by the weaker network where the hidden-to-hidden layer is replaced by an output-to-hidden layer (page 376).

By the way, the RNN with a hidden-to-hidden layer has the same architecture as the so-called linear dynamical system mentioned in Hinton’s video.

From what I have learned, the universality of RNN and feedforward networks are therefore due to different arguments, the former coming from Turing machines and the latter from an analytical view of approximation by step functions.

-- cgit v1.2.3