Search CORE

6 research outputs found

Provably Stable Interpretable Encodings of Context Free Grammars in RNNs with a Differentiable Stack

Author: Giles C Lee
Mali Ankur
Stogin John
Publication venue
Publication date: 10/06/2020
Field of study

Given a collection of strings belonging to a context free grammar (CFG) and another collection of strings not belonging to the CFG, how might one infer the grammar? This is the problem of grammatical inference. Since CFGs are the languages recognized by pushdown automata (PDA), it suffices to determine the state transition rules and stack action rules of the corresponding PDA. An approach would be to train a recurrent neural network (RNN) to classify the sample data and attempt to extract these PDA rules. But neural networks are not a priori aware of the structure of a PDA and would likely require many samples to infer this structure. Furthermore, extracting the PDA rules from the RNN is nontrivial. We build a RNN specifically structured like a PDA, where weights correspond directly to the PDA rules. This requires a stack architecture that is somehow differentiable (to enable gradient-based learning) and stable (an unstable stack will show deteriorating performance with longer strings). We propose a stack architecture that is differentiable and that provably exhibits orbital stability. Using this stack, we construct a neural network that provably approximates a PDA for strings of arbitrary length. Moreover, our model and method of proof can easily be generalized to other state machines, such as a Turing Machine.Comment: 20 pages, 2 figure

arXiv.org e-Print Archive

State-Dependent Computation Using Coupled Recurrent Networks

Author: Bongard J.
Dayan P.
Hopcroft J.
Maass W.
McCulloch W.
Minksy L.
Neftci E.
Rodger S.
Rodney J. Douglas
Seung H. S.
Strogatz S.
Ueli Rutishauser
von Neumann J.
Publication venue: 'MIT Press - Journals'
Publication date: 24/09/2008
Field of study

Although conditional branching between possible behavioral states is a hallmark of intelligent behavior, very little is known about the neuronal mechanisms that support this processing. In a step toward solving this problem, we demonstrate by theoretical analysis and simulation how networks of richly interconnected neurons, such as those observed in the superficial layers of the neocortex, can embed reliable, robust finite state machines. We show how a multistable neuronal network containing a number of states can be created very simply by coupling two recurrent networks whose synaptic weights have been configured for soft winner-take-all (sWTA) performance. These two sWTAs have simple, homogeneous, locally recurrent connectivity except for a small fraction of recurrent cross-connections between them, which are used to embed the required states. This coupling between the maps allows the network to continue to express the current state even after the input that elicited that state iswithdrawn. In addition, a small number of transition neurons implement the necessary input-driven transitions between the embedded states. We provide simple rules to systematically design and construct neuronal state machines of this kind. The significance of our finding is that it offers a method whereby the cortex could construct networks supporting a broad range of sophisticated processing by applying only small specializations to the same generic neuronal circuit

arXiv.org e-Print Archive

Stable Encoding of Finite-State Machines in Discrete-Time Recurrent Neural Nets with Sigmoid Units

Author: M. Ángeles Valdés-muñoz
Mikel L. Forcada
Rafael C. Carrasco
Ramón P. Ñeco
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/2000
Field of study

There has been a lot of interest in the use of discrete-time recurrent neural nets (DTRNN) to learn �nite-state tasks, with interesting results regarding the induction of simple �nite-state machines from input–output strings. Parallel work has studied the computational power of DTRNN in connection with �nite-state computation. This article describes a simple strategy to devise stable encodings of �nite-state machines in computationally capable discrete-time recurrent neural architectures with sigmoid units and gives a detailed presentation on how this strategy may be applied to encode a general class of �nite-state machines in a variety of commonly used �rst- and second-order recurrent neural networks. Unlike previous work that either imposed some restrictions to state values or used a detailed analysis based on �xed-point attractors, our approach applies to any positive, bounded, strictly growing, continuous activation function and uses simple bounding criteria based on a study of the conditions under which a proposed encoding scheme guarantees that the DTRNN is actually behaving as a �nite-state machine.

CiteSeerX

Stable Encoding of Finite-State Machines in Discrete-Time Recurrent Neural Nets with Sigmoid Units

Author: Departament Llenguatges
M. Ángeles Valdés-Muñoz
Mikel L. Forcada
Rafael C. Carrasco
Ramón Ñeco
Publication venue
Publication date
Field of study

In recent years, there has been a lot of interest in the use of discrete-time recurrent neural nets (DTRNN) to learn finite-state tasks, with interesting results regarding the induction of simple finite-state machines from input-output strings. Parallel work has studied the computational power of DTRNN in connection with finite-state computation. This paper describes a simple strategy to devise stable encodings of finite-state machines in computationally capable discrete-time recurrent neural architectures with sigmoid units, and gives a detailed presentation on how this strategy may be applied to encode a general class of finite-state machines in a variety of commonly-used first- and second-order recurrent neural networks. Unlike previous work that either imposed some restrictions to state values, or used a detailed analysis based on fixed-point attractors, the present approach applies to any positive, bounded, strictly growing, continuous activation function, and uses simple bounding criteri..

CiteSeerX