Search CORE

870 research outputs found

The Variant of Latent Dirichlet Allocation for Natural Scene Classification

Author: Yingjun Tang
Publication venue: Institute of Informatics, Slovak Academy of Sciences
Publication date: 26/01/2012
Field of study

The paper proposes a novel model based on classic LDA (latent Dirichlet allocation), which is used to learn and recognize natural scene category. Unlike previous work, the model performs variational Bayesian inference (VB) two times in order to get more precise prior Dirichlet parameters for each scene category. Although the scenes is represented in common topic simplex, the model has retained the diversities of each scene category based on the same topic simplex. Furthermore, two discriminations have been done to get good performance. We investigated the classification performance with classic 13 scenes image database and the experiments had demonstrated that our method can get better performance with less training time

Computing and Informatics (E-Journal - Institute of Informatics, SAS, Bratislava)

Factorized Topic Models

Author: Damianou Andreas
Ek Carl Henrik
Kjellstrom Hedvig
Zhang Cheng
Publication venue
Publication date: 01/01/2013
Field of study

In this paper we present a modification to a latent topic model, which makes the model exploit supervision to produce a factorized representation of the observed data. The structured parameterization separately encodes variance that is shared between classes from variance that is private to each class by the introduction of a new prior over the topic space. The approach allows for a more eff{}icient inference and provides an intuitive interpretation of the data in terms of an informative signal together with structured noise. The factorized representation is shown to enhance inference performance for image, text, and video classification.Comment: ICLR 201

arXiv.org e-Print Archive

Publikationer från KTH

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Explore Bristol Research

CINet: A Learning Based Approach to Incremental Context Modeling in Robots

Author: Bozcan İlker
Doğan Fethiye Irmak
Kalkan Sinan
Çelik Mehmet
Publication venue
Publication date: 29/07/2018
Field of study

There have been several attempts at modeling context in robots. However, either these attempts assume a fixed number of contexts or use a rule-based approach to determine when to increment the number of contexts. In this paper, we pose the task of when to increment as a learning problem, which we solve using a Recurrent Neural Network. We show that the network successfully (with 98\% testing accuracy) learns to predict when to increment, and demonstrate, in a scene modeling problem (where the correct number of contexts is not known), that the robot increments the number of contexts in an expected manner (i.e., the entropy of the system is reduced). We also present how the incremental model can be used for various scene reasoning tasks.Comment: The first two authors have contributed equally, 6 pages, 8 figures, International Conference on Intelligent Robots (IROS 2018

arXiv.org e-Print Archive

OpenMETU (Middle East Technical University)

A Supervised Neural Autoregressive Topic Model for Simultaneous Image Classification and Annotation

Author: Larochelle Hugo
Zhang Yu-Jin
Zheng Yin
Publication venue
Publication date: 22/05/2013
Field of study

Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to perform scene recognition and annotation. Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator (DocNADE) was proposed and demonstrated state-of-the-art performance for document modeling. In this work, we show how to successfully apply and extend this model to the context of visual scene modeling. Specifically, we propose SupDocNADE, a supervised extension of DocNADE, that increases the discriminative power of the hidden topic features by incorporating label information into the training objective of the model. We also describe how to leverage information about the spatial position of the visual words and how to embed additional image annotations, so as to simultaneously perform image classification and annotation. We test our model on the Scene15, LabelMe and UIUC-Sports datasets and show that it compares favorably to other topic models such as the supervised variant of LDA.Comment: 13 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data

Author: Larochelle Hugo
Zhang Yu-Jin
Zheng Yin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 31/12/2015
Field of study

Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal with multimodal data, such as in image annotation tasks. Another popular approach to model the multimodal data is through deep neural networks, such as the deep Boltzmann machine (DBM). Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator (DocNADE) was proposed and demonstrated state-of-the-art performance for text document modeling. In this work, we show how to successfully apply and extend this model to multimodal data, such as simultaneous image classification and annotation. First, we propose SupDocNADE, a supervised extension of DocNADE, that increases the discriminative power of the learned hidden topic features and show how to employ it to learn a joint representation from image visual words, annotation words and class label information. We test our model on the LabelMe and UIUC-Sports data sets and show that it compares favorably to other topic models. Second, we propose a deep extension of our model and provide an efficient way of training the deep model. Experimental results show that our deep model outperforms its shallow version and reaches state-of-the-art performance on the Multimedia Information Retrieval (MIR) Flickr data set.Comment: 24 pages, 10 figures. A version has been accepted by TPAMI on Aug 4th, 2015. Add footnote about how to train the model in practice in Section 5.1. arXiv admin note: substantial text overlap with arXiv:1305.530

arXiv.org e-Print Archive

CiteSeerX

Weakly Supervised Learning of Objects, Attributes and Their Associations

Author: Hospedales TM
Shi Z
Xiang T
Yang Y
Publication venue
Publication date: 01/01/2014
Field of study

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-10605-2_31]”

arXiv.org e-Print Archive

CiteSeerX

Crossref

University of Surrey

Queen Mary Research Online

Surrey Research Insight

Latent Dirichlet Allocation Uncovers Spectral Characteristics of Drought Stressed Plants

Author: Ballvora Agim
Bauckhage Christian
Kersting Kristian
Leon Jens
Pinto Francisco
Ploemer Lutz
Rascher Uwe
Roemer Christoph
Wahabzada Mirwaes
Publication venue
Publication date: 01/01/2012
Field of study

Understanding the adaptation process of plants to drought stress is essential in improving management practices, breeding strategies as well as engineering viable crops for a sustainable agriculture in the coming decades. Hyper-spectral imaging provides a particularly promising approach to gain such understanding since it allows to discover non-destructively spectral characteristics of plants governed primarily by scattering and absorption characteristics of the leaf internal structure and biochemical constituents. Several drought stress indices have been derived using hyper-spectral imaging. However, they are typically based on few hyper-spectral images only, rely on interpretations of experts, and consider few wavelengths only. In this study, we present the first data-driven approach to discovering spectral drought stress indices, treating it as an unsupervised labeling problem at massive scale. To make use of short range dependencies of spectral wavelengths, we develop an online variational Bayes algorithm for latent Dirichlet allocation with convolved Dirichlet regularizer. This approach scales to massive datasets and, hence, provides a more objective complement to plant physiological practices. The spectral topics found conform to plant physiological knowledge and can be computed in a fraction of the time compared to existing LDA approaches.Comment: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012

arXiv.org e-Print Archive

Fraunhofer-ePrints

Juelich Shared Electronic Resources