Search CORE

11,101 research outputs found

Classification of medical data using Restricted Boltzmann Machines

Author: Aalto Markku
Publication venue
Publication date: 17/03/2014
Field of study

Restricted Boltzmann Machines are generative models commonly used for feature extraction and for training deep neural networks. In this thesis, their applicability for classification of medical data is researched. Three different approaches are evaluated using two small medical data sets. It is shown that the resulting classifiers are able to form sensible models of the data, having competitive performance when compared to other methods on these data sets

Trepo - Institutional Repository of Tampere University

Word Recognition with Deep Conditional Random Fields

Author: Chen Gang
Li Yawei
Srihari Sargur N.
Publication venue
Publication date: 04/12/2016
Field of study

Recognition of handwritten words continues to be an important problem in document analysis and recognition. Existing approaches extract hand-engineered features from word images--which can perform poorly with new data sets. Recently, deep learning has attracted great attention because of the ability to learn features from raw data. Moreover they have yielded state-of-the-art results in classification tasks including character recognition and scene recognition. On the other hand, word recognition is a sequential problem where we need to model the correlation between characters. In this paper, we propose using deep Conditional Random Fields (deep CRFs) for word recognition. Basically, we combine CRFs with deep learning, in which deep features are learned and sequences are labeled in a unified framework. We pre-train the deep structure with stacked restricted Boltzmann machines (RBMs) for feature learning and optimize the entire network with an online learning algorithm. The proposed model was evaluated on two datasets, and seen to perform significantly better than competitive baseline models. The source code is available at https://github.com/ganggit/deepCRFs.Comment: 5 pages, published in ICIP 2016. arXiv admin note: substantial text overlap with arXiv:1412.339

arXiv.org e-Print Archive

Crossref

Recommended from our members

The Recurrent Temporal Discriminative Restricted Boltzmann Machines

Author: Cherla S.
Garcez A.
Tran S. N.
Weyde T.
Publication venue
Publication date: 01/01/2017
Field of study

Classification of sequence data is the topic of interest for dynamic Bayesian models and Recurrent Neural Networks (RNNs). While the former can explicitly model the temporal dependencies between class variables, the latter have a capability of learning representations. Several attempts have been made to improve performance by combining these two approaches or increasing the processing capability of the hidden units in RNNs. This often results in complex models with a large number of learning parameters. In this paper, a compact model is proposed which offers both representation learning and temporal inference of class variables by rolling Restricted Boltzmann Machines (RBMs) and class variables over time. We address the key issue of intractability in this variant of RBMs by optimising a conditional distribution, instead of a joint distribution. Experiments reported in the paper on melody modelling and optical character recognition show that the proposed model can outperform the state-of-the-art. Also, the experimental results on optical character recognition, part-of-speech tagging and text chunking demonstrate that our model is comparable to recurrent neural networks with complex memory gates while requiring far fewer parameters

City Research Online