Search CORE

17,618 research outputs found

Multi-Zone Unit for Recurrent Neural Networks

Author: Liu Yang
Meng Fandong
Zhang Jinchao
Zhou Jie
Publication venue
Publication date: 17/11/2019
Field of study

Recurrent neural networks (RNNs) have been widely used to deal with sequence learning problems. The input-dependent transition function, which folds new observations into hidden states to sequentially construct fixed-length representations of arbitrary-length sequences, plays a critical role in RNNs. Based on single space composition, transition functions in existing RNNs often have difficulty in capturing complicated long-range dependencies. In this paper, we introduce a new Multi-zone Unit (MZU) for RNNs. The key idea is to design a transition function that is capable of modeling multiple space composition. The MZU consists of three components: zone generation, zone composition, and zone aggregation. Experimental results on multiple datasets of the character-level language modeling task and the aspect-based sentiment analysis task demonstrate the superiority of the MZU.Comment: Accepted at AAAI 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

A hypothesize-and-verify framework for Text Recognition using Deep Recurrent Neural Networks

Author: Chaudhury Santanu
Rajeswar Sai
Ray Anupama
Publication venue
Publication date: 26/02/2015
Field of study

Deep LSTM is an ideal candidate for text recognition. However text recognition involves some initial image processing steps like segmentation of lines and words which can induce error to the recognition system. Without segmentation, learning very long range context is difficult and becomes computationally intractable. Therefore, alternative soft decisions are needed at the pre-processing level. This paper proposes a hybrid text recognizer using a deep recurrent neural network with multiple layers of abstraction and long range context along with a language model to verify the performance of the deep neural network. In this paper we construct a multi-hypotheses tree architecture with candidate segments of line sequences from different segmentation algorithms at its different branches. The deep neural network is trained on perfectly segmented data and tests each of the candidate segments, generating unicode sequences. In the verification step, these unicode sequences are validated using a sub-string match with the language model and best first search is used to find the best possible combination of alternative hypothesis from the tree structure. Thus the verification framework using language models eliminates wrong segmentation outputs and filters recognition errors

arXiv.org e-Print Archive

Crossref

Modeling Taxi Drivers' Behaviour for the Next Destination Prediction

Author: Barlacchi Gianni
Bianchini Monica
Lepri Bruno
Rossi Alberto
Publication venue
Publication date: 08/01/2019
Field of study

In this paper, we study how to model taxi drivers' behaviour and geographical information for an interesting and challenging task: the next destination prediction in a taxi journey. Predicting the next location is a well studied problem in human mobility, which finds several applications in real-world scenarios, from optimizing the efficiency of electronic dispatching systems to predicting and reducing the traffic jam. This task is normally modeled as a multiclass classification problem, where the goal is to select, among a set of already known locations, the next taxi destination. We present a Recurrent Neural Network (RNN) approach that models the taxi drivers' behaviour and encodes the semantics of visited locations by using geographical information from Location-Based Social Networks (LBSNs). In particular, RNNs are trained to predict the exact coordinates of the next destination, overcoming the problem of producing, in output, a limited set of locations, seen during the training phase. The proposed approach was tested on the ECML/PKDD Discovery Challenge 2015 dataset - based on the city of Porto -, obtaining better results with respect to the competition winner, whilst using less information, and on Manhattan and San Francisco datasets.Comment: preprint version of a paper submitted to IEEE Transactions on Intelligent Transportation System

arXiv.org e-Print Archive

Archivio della Ricerca - Università degli Studi di Siena

Archivio della ricerca - Fondazione Bruno Kessler

Unexpected Event Prediction in Wire Electrical Discharge Machining Using Deep Learning Techniques

Author: Arriandiaga Ander
Conde Aintzane
Plaza Pascual Soraya
Sánchez Jose A.
Wang Jun
Publication venue: 'MDPI AG'
Publication date: 01/01/2018
Field of study

Theoretical models of manufacturing processes provide a valuable insight into physical phenomena but their application to practical industrial situations is sometimes difficult. In the context of Industry 4.0, artificial intelligence techniques can provide efficient solutions to actual manufacturing problems when big data are available. Within the field of artificial intelligence, the use of deep learning is growing exponentially in solving many problems related to information and communication technologies (ICTs) but it still remains scarce or even rare in the field of manufacturing. In this work, deep learning is used to efficiently predict unexpected events in wire electrical discharge machining (WEDM), an advanced machining process largely used for aerospace components. The occurrence of an unexpected event, namely the change of thickness of the machined part, can be effectively predicted by recognizing hidden patterns from process signals. Based on WEDM experiments, different deep learning architectures were tested. By using a combination of a convolutional layer with gated recurrent units, thickness variation in the machined component could be predicted in 97.4% of cases, at least 2 mm in advance, which is extremely fast, acting before the process has degraded. New possibilities of deep learning for high-performance machine tools must be examined in the near future.The authors gratefully acknowledge the funding support received from the Spanish Ministry of Economy and Competitiveness and the FEDER operation program for funding the project "Scientific models and machine-tool advanced sensing techniques for efficient machining of precision components of Low Pressure Turbines" (DPI2017-82239-P) and UPV/EHU (UFI 11/29). The authors would also like to thank Euskampus and ONA-EDM for their support in this project

Multidisciplinary Digital Publishing Institute

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

Archivo Digital para la Docencia y la Investigación

Recommended from our members

Coupling between gamma-band power and cerebral blood volume during recurrent acute neocortical seizures

Author: Berwick J
Boorman L
Bruyns-Haylett M
Harris S
Kennerley A
Ma H
Overton PG
Schwartz TH
Zhao M
Zheng Y
Publication venue: 'Elsevier BV'
Publication date: 02/04/2014
Field of study

Characterization of neural and hemodynamic biomarkers of epileptic activity that can be measured using non-invasive techniques is fundamental to the accurate identification of the epileptogenic zone (EZ) in the clinical setting. Recently, oscillations at gamma-band frequencies and above (>30 Hz) have been suggested to provide valuable localizing information of the EZ and track cortical activation associated with epileptogenic processes. Although a tight coupling between gamma-band activity and hemodynamic-based signals has been consistently demonstrated in non-pathological conditions, very little is known about whether such a relationship is maintained in epilepsy and the laminar etiology of these signals. Confirmation of this relationship may elucidate the underpinnings of perfusion-based signals in epilepsy and the potential value of localizing the EZ using hemodynamic correlates of pathological rhythms. Here, we use concurrent multi-depth electrophysiology and 2-dimensional optical imaging spectroscopy to examine the coupling between multi-band neural activity and cerebral blood volume (CBV) during recurrent acute focal neocortical seizures in the urethane-anesthetized rat. We show a powerful correlation between gamma-band power (25-90 Hz) and CBV across cortical laminae, in particular layer 5, and a close association between gamma measures and multi-unit activity (MUA). Our findings provide insights into the laminar electrophysiological basis of perfusion-based imaging signals in the epileptic state and may have implications for further research using non-invasive multi-modal techniques to localize epileptogenic tissue

Central Archive at the University of Reading

Elsevier - Publisher Connector

Crossref

PubMed Central

Spiral - Imperial College Digital Repository

White Rose Research Online