Search CORE

2,266 research outputs found

Incremental construction of LSTM recurrent neural network

Author: Alquézar Mancho René
Ribeiro Evandsa Sabrine Lopes-Lima
Publication venue
Publication date: 01/01/2002
Field of study

Long Short--Term Memory (LSTM) is a recurrent neural network that uses structures called memory blocks to allow the net remember significant events distant in the past input sequence in order to solve long time lag tasks, where other RNN approaches fail. Throughout this work we have performed experiments using LSTM networks extended with growing abilities, which we call GLSTM. Four methods of training growing LSTM has been compared. These methods include cascade and fully connected hidden layers as well as two different levels of freezing previous weights in the cascade case. GLSTM has been applied to a forecasting problem in a biomedical domain, where the input/output behavior of five controllers of the Central Nervous System control has to be modelled. We have compared growing LSTM results against other neural networks approaches, and our work applying conventional LSTM to the task at hand.Postprint (published version

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

DeepLOB: Deep Convolutional Neural Networks for Limit Order Books

Author: Roberts Stephen
Zhang Zihao
Zohren Stefan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 25/11/2018
Field of study

We develop a large-scale deep learning model to predict price movements from limit order book (LOB) data of cash equities. The architecture utilises convolutional filters to capture the spatial structure of the limit order books as well as LSTM modules to capture longer time dependencies. The proposed network outperforms all existing state-of-the-art algorithms on the benchmark LOB dataset [1]. In a more realistic setting, we test our model by using one year market quotes from the London Stock Exchange and the model delivers a remarkably stable out-of-sample prediction accuracy for a variety of instruments. Importantly, our model translates well to instruments which were not part of the training set, indicating the model's ability to extract universal features. In order to better understand these features and to go beyond a "black box" model, we perform a sensitivity analysis to understand the rationale behind the model predictions and reveal the components of LOBs that are most relevant. The ability to extract robust features which translate well to other instruments is an important property of our model which has many other applications.Comment: 12 pages, 9 figure

arXiv.org e-Print Archive

Oxford University Research Archive

Energy performance forecasting of residential buildings using fuzzy approaches

Author: Múgica Álvarez Francisco
Nebot Castells M. Àngela
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

The energy consumption used for domestic purposes in Europe is, to a considerable extent, due to heating and cooling. This energy is produced mostly by burning fossil fuels, which has a high negative environmental impact. The characteristics of a building are an important factor to determine the necessities of heating and cooling loads. Therefore, the study of the relevant characteristics of the buildings, regarding the heating and cooling needed to maintain comfortable indoor air conditions, could be very useful in order to design and construct energy-efficient buildings. In previous studies, different machine-learning approaches have been used to predict heating and cooling loads from the set of variables: relative compactness, surface area, wall area, roof area, overall height, orientation, glazing area and glazing area distribution. However, none of these methods are based on fuzzy logic. In this research, we study two fuzzy logic approaches, i.e., fuzzy inductive reasoning (FIR) and adaptive neuro fuzzy inference system (ANFIS), to deal with the same problem. Fuzzy approaches obtain very good results, outperforming all the methods described in previous studies except one. In this work, we also study the feature selection process of FIR methodology as a pre-processing tool to select the more relevant variables before the use of any predictive modelling methodology. It is proven that FIR feature selection provides interesting insights into the main building variables causally related to heating and cooling loads. This allows better decision making and design strategies, since accurate cooling and heating load estimations and correct identification of parameters that affect building energy demands are of high importance to optimize building designs and equipment specifications.Peer ReviewedPostprint (published version

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Estimating Blood Pressure from Photoplethysmogram Signal and Demographic Features using Machine Learning Techniques

Author: Chowdhury Moajjem Hossain
Chowdhury Muhammad E. H.
Khandakar Amith
Mahbub Zaid B
Reaz Mamun Bin Ibne
Shuzan Md Nazmul Islam
Uddin M. Monir
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

Hypertension is a potentially unsafe health ailment, which can be indicated directly from the Blood pressure (BP). Hypertension always leads to other health complications. Continuous monitoring of BP is very important; however, cuff-based BP measurements are discrete and uncomfortable to the user. To address this need, a cuff-less, continuous and a non-invasive BP measurement system is proposed using Photoplethysmogram (PPG) signal and demographic features using machine learning (ML) algorithms. PPG signals were acquired from 219 subjects, which undergo pre-processing and feature extraction steps. Time, frequency and time-frequency domain features were extracted from the PPG and their derivative signals. Feature selection techniques were used to reduce the computational complexity and to decrease the chance of over-fitting the ML algorithms. The features were then used to train and evaluate ML algorithms. The best regression models were selected for Systolic BP (SBP) and Diastolic BP (DBP) estimation individually. Gaussian Process Regression (GPR) along with ReliefF feature selection algorithm outperforms other algorithms in estimating SBP and DBP with a root-mean-square error (RMSE) of 6.74 and 3.59 respectively. This ML model can be implemented in hardware systems to continuously monitor BP and avoid any critical health conditions due to sudden changes.Comment: Accepted for publication in Sensor, 14 Figures, 14 Table

arXiv.org e-Print Archive

Qatar University Institutional Repository

Evaluación de diferentes métodos de modelización para la estimación de la altura total del árbol de la Región Mediterránea de Turquía

Author: Diamantopoulou M. J.
Özçelik R.
Publication venue: 'Instituto Nacional de Investigacion y Tecnologia Agraria y Alimentaria (INIA)'
Publication date: 28/11/2012
Field of study

Efficient management of timber resources and wood utilization practices require accurate and versatile information about important characteristics of forest resources for evaluating the numerous management and utilization alternatives for timber resources. Tree height is considered one of the most useful variables along with stocking and diameter at breast height, in estimating forest stand wood volumes and productivity. Six nonlinear growth functions were fitted to tree height-diameter data of three major tree species in Western Mediterranean Region’s forests of Turkey. The generalized regression neural network (GRNN) technique has been applied for tree height prediction, as well, due to its ability to fit complex nonlinear models. The performance of the models was compared and evaluated. Further, equivalence tests of the selected models were conducted. Validation showed the appropriatness of all models to predict tree height. According to the model performance criteria, the six nonlinear growth functions were able to capture the height-diameter relationships and fitted the data almost equally well, while the constructed generalized regression neural network (GRNN) models were found to be superior to all nonlinear regression models, in terms of their predictive ability. La gestión eficiente de los recursos forestales y la de utilización de la madera requieren de información precisa y versátil acerca de las características importantes de los recursos forestales para la evaluación de la gestión y de las alternativas de utilización de los recursos forestales. La altura del árbol es considerada como una de las variables más útiles, junto con la densidad, y el diámetro a la altura del pecho, en la estimación de volúmenes de madera y la productividad de masas forestales. Se ajustaron seis modelos de altura total-diámetro y se compararon con el fin de estimar con precisión la altura total del árbol de las tres principales especies de árboles en los bosques de la Región Occidental Mediterráneo de Turquía. La regresión generalizada de redes neuronales (GRNN) se presenta como una técnica alternativa de red neuronal a la técnica de regresión no lineal para estimar la altura total de los árboles debido a su capacidad para adaptarse a modelos complejos no lineales. Se compararon y evaluaron los modelos. Se llevaron a cabo otras pruebas, como la equivalencia de los modelos seleccionados. De acuerdo con los criterios del rendimiento de los modelos, las seis funciones no lineales de crecimiento fueron capaces de capturar las relaciones altura-diámetro y ajustaron los datos casi igual de bien, mientras que las construidas mediante modelos de regresión generalizados de redes neuronales (GRNN) resultaron ser superiores a todos los modelos de regresión no lineal, en términos de su capacidad predictiva

Scientific Journals of INIA (Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria)

Online Learning Algorithm for Time Series Forecasting Suitable for Low Cost Wireless Sensor Networks Nodes

Author: Botella-Rocamora Paloma
Pardo Juan
Zamora-Martinez Francisco
Publication venue: 'MDPI AG'
Publication date: 01/04/2015
Field of study

Time series forecasting is an important predictive methodology which can be applied to a wide range of problems. Particularly, forecasting the indoor temperature permits an improved utilization of the HVAC (Heating, Ventilating and Air Conditioning) systems in a home and thus a better energy efficiency. With such purpose the paper describes how to implement an Artificial Neural Network (ANN) algorithm in a low cost system-on-chip to develop an autonomous intelligent wireless sensor network. The present paper uses a Wireless Sensor Networks (WSN) to monitor and forecast the indoor temperature in a smart home, based on low resources and cost microcontroller technology as the 8051MCU. An on-line learning approach, based on Back-Propagation (BP) algorithm for ANNs, has been developed for real-time time series learning. It performs the model training with every new data that arrive to the system, without saving enormous quantities of data to create a historical database as usual, i.e., without previous knowledge. Consequently to validate the approach a simulation study through a Bayesian baseline model have been tested in order to compare with a database of a real application aiming to see the performance and accuracy. The core of the paper is a new algorithm, based on the BP one, which has been described in detail, and the challenge was how to implement a computational demanding algorithm in a simple architecture with very few hardware resources.Comment: 28 pages, Published 21 April 2015 at MDPI's journal "Sensors

arXiv.org e-Print Archive

Directory of Open Access Journals

Method and system for training dynamic nonlinear adaptive filters which have embedded memory

Author: Rabinowitz Matthew
Publication venue
Publication date: 26/02/2002
Field of study

Described herein is a method and system for training nonlinear adaptive filters (or neural networks) which have embedded memory. Such memory can arise in a multi-layer finite impulse response (FIR) architecture, or an infinite impulse response (IIR) architecture. We focus on filter architectures with separate linear dynamic components and static nonlinear components. Such filters can be structured so as to restrict their degrees of computational freedom based on a priori knowledge about the dynamic operation to be emulated. The method is detailed for an FIR architecture which consists of linear FIR filters together with nonlinear generalized single layer subnets. For the IIR case, we extend the methodology to a general nonlinear architecture which uses feedback. For these dynamic architectures, we describe how one can apply optimization techniques which make updates closer to the Newton direction than those of a steepest descent method, such as backpropagation. We detail a novel adaptive modified Gauss-Newton optimization technique, which uses an adaptive learning rate to determine both the magnitude and direction of update steps. For a wide range of adaptive filtering applications, the new training algorithm converges faster and to a smaller value of cost than both steepest-descent methods such as backpropagation-through-time, and standard quasi-Newton methods. We apply the algorithm to modeling the inverse of a nonlinear dynamic tracking system 5, as well as a nonlinear amplifier 6

NASA Technical Reports Server