Search CORE

14 research outputs found

Using Regular Languages to Explore the Representational Capacity of Recurrent Neural Architectures

Author: AS Reber
AW Smith
B Yoshua
G Jager
I Simon
J Rogers
JL Elman
M Casey
MP Marcus
N Chomsky
N Chomsky
S Hochreiter
WT Fitch
Y LeCun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

The presence of Long Distance Dependencies (LDDs) in sequential data poses significant challenges for computational models. Various recurrent neural architectures have been designed to mitigate this issue. In order to test these state-of-the-art architectures, there is growing need for rich benchmarking datasets. However, one of the drawbacks of existing datasets is the lack of experimental control with regards to the presence and/or degree of LDDs. This lack of control limits the analysis of model performance in relation to the specific challenge posed by LDDs. One way to address this is to use synthetic data having the properties of subregular languages. The degree of LDDs within the generated data can be controlled through the k parameter, length of the generated strings, and by choosing appropriate forbidden strings. In this paper, we explore the capacity of different RNN extensions to model LDDs, by evaluating these models on a sequence of SPk synthesized datasets, where each subsequent dataset exhibits a longer degree of LDD. Even though SPk are simple languages, the presence of LDDs does have significant impact on the performance of recurrent neural architectures, thus making them prime candidate in benchmarking tasks.Comment: International Conference of Artificial Neural Networks (ICANN) 201

arXiv.org e-Print Archive

Crossref

Arrow@TUDublin

Using Regular Languages to Explore the Representational Capacity of Recurrent Neural Architectures

Author: Kelleher John D.
Mahalunkar Abhijit
Publication venue: Dublin Institute of Technology
Publication date: 01/01/2018
Field of study

Arrow@TUDublin

Multi-Element Long Distance Dependencies: Using SPk Languages to Explore the Characteristics of Long-Distance Dependencies

Author: Kelleher John
Mahalunkar Abhijit
Publication venue: Dublin Institute of Technology
Publication date: 01/01/2019
Field of study

In order to successfully model Long Distance Dependencies (LDDs) it is necessary to under-stand the full-range of the characteristics of the LDDs exhibited in a target dataset. In this paper, we use Strictly k-Piecewise languages to generate datasets with various properties. We then compute the characteristics of the LDDs in these datasets using mutual information and analyze the impact of factors such as (i) k, (ii) length of LDDs, (iii) vocabulary size, (iv) forbidden strings, and (v) dataset size. This analysis reveal that the number of interacting elements in a dependency is an important characteristic of LDDs. This leads us to the challenge of modelling multi-element long-distance dependencies. Our results suggest that attention mechanisms in neural networks may aide in modeling datasets with multi-element long-distance dependencies. However, we conclude that there is a need to develop more efficient attention mechanisms to address this issue

arXiv.org e-Print Archive

Crossref

Arrow@TUDublin

Recommended from our members

Learning Interactions of Local and Non-Local Phonotactic Constraints from Positive Input

Author: Aksënova Alëna
De Santo Aniello
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2021
Field of study

This paper proposes a grammatical inference algorithm to learn input-sensitive tier-based strictly local languages across multiple tiers from positive data only, when the locality of the tier-constraints and the tier-projection function is set to 2 (MITSL; De Santo and Graf, 2019). We conduct simulations showing that the algorithm succeeds in learning MITSL patterns over a set of artificial languages

ScholarWorks@UMass Amherst