Search CORE

1,786 research outputs found

Evolino for recurrent support vector machines

Author: Gagliolo Matteo
Gomez Faustino
Schmidhuber Juergen
Wierstra Daan
Publication venue
Publication date: 15/12/2005
Field of study

Traditional Support Vector Machines (SVMs) need pre-wired finite time windows to predict and classify time series. They do not have an internal state necessary to deal with sequences involving arbitrary long-term dependencies. Here we introduce a new class of recurrent, truly sequential SVM-like devices with internal adaptive states, trained by a novel method called EVOlution of systems with KErnel-based outputs (Evoke), an instance of the recent Evolino class of methods. Evoke evolves recurrent neural networks to detect and represent temporal dependencies while using quadratic programming/support vector regression to produce precise outputs. Evoke is the first SVM-based mechanism learning to classify a context-sensitive language. It also outperforms recent state-of-the-art gradient-based recurrent neural networks (RNNs) on various time series prediction tasks.Comment: 10 pages, 2 figure

arXiv.org e-Print Archive

CiteSeerX

DI-fusion

Response Characterization for Auditing Cell Dynamics in Long Short-term Memory Networks

Author: Amini Alexander
Grosu Radu
Hasani Ramin M.
Lechner Mathias
Naser Felix
Rus Daniela
Publication venue
Publication date: 11/09/2018
Field of study

In this paper, we introduce a novel method to interpret recurrent neural networks (RNNs), particularly long short-term memory networks (LSTMs) at the cellular level. We propose a systematic pipeline for interpreting individual hidden state dynamics within the network using response characterization methods. The ranked contribution of individual cells to the network's output is computed by analyzing a set of interpretable metrics of their decoupled step and sinusoidal responses. As a result, our method is able to uniquely identify neurons with insightful dynamics, quantify relationships between dynamical properties and test accuracy through ablation analysis, and interpret the impact of network capacity on a network's dynamical distribution. Finally, we demonstrate generalizability and scalability of our method by evaluating a series of different benchmark sequential datasets

arXiv.org e-Print Archive

Crossref

IST Austria: PubRep (Institute of Science and Technology)

Comparing Deep Recurrent Networks Based on the MAE Random Sampling, a First Approach

Author: G Litjens
RN Bracewell
S Albelwi
S Haykin
S Hochreiter
S Min
VK Ojha
Y Bengio
Y LeCun
Publication venue
Publication date: 26/11/2018
Field of study

Recurrent neural networks have demonstrated to be good at tackling prediction problems, however due to their high sensitivity to hyper-parameter configuration, finding an appropriate network is a tough task. Automatic hyper-parameter optimization methods have emerged to find the most suitable configuration to a given problem, but these methods are not generally adopted because of their high computational cost. Therefore, in this study we extend the MAE random sampling, a low-cost method to compare single-hidden layer architectures, to multiple-hidden-layer ones. We validate empirically our proposal and show that it is possible to predict and compare the expected performance of an hyper-parameter configuration in a low-cost way.Universidad de Málaga. Campus de Excelencia Internacional Andalucía Tech. This research was partially funded by Ministerio de Economı́a, Industria y Competitividad, Gobierno de España, and European Regional Development Fund grant numbers TIN2016-81766-REDT (http://cirti.es) and TIN2017-88213-R (http://6city.lcc.uma.es)

Crossref

Repositorio Institucional Universidad de Málaga

Short-term Demand Forecasting for Online Car-hailing Services using Recurrent Neural Networks

Author: Bahrak Behnam
Mahini Hamid
Nejadettehad Alireza
Publication venue
Publication date: 29/01/2019
Field of study

Short-term traffic flow prediction is one of the crucial issues in intelligent transportation system, which is an important part of smart cities. Accurate predictions can enable both the drivers and the passengers to make better decisions about their travel route, departure time and travel origin selection, which can be helpful in traffic management. Multiple models and algorithms based on time series prediction and machine learning were applied to this issue and achieved acceptable results. Recently, the availability of sufficient data and computational power, motivates us to improve the prediction accuracy via deep-learning approaches. Recurrent neural networks have become one of the most popular methods for time series forecasting, however, due to the variety of these networks, the question that which type is the most appropriate one for this task remains unsolved. In this paper, we use three kinds of recurrent neural networks including simple RNN units, GRU and LSTM neural network to predict short-term traffic flow. The dataset from TAP30 Corporation is used for building the models and comparing RNNs with several well-known models, such as DEMA, LASSO and XGBoost. The results show that all three types of RNNs outperform the others, however, more simple RNNs such as simple recurrent units and GRU perform work better than LSTM in terms of accuracy and training time.Comment: arXiv admin note: text overlap with arXiv:1706.06279, arXiv:1804.04176 by other author

arXiv.org e-Print Archive

Directory of Open Access Journals