Search CORE

45 research outputs found

Deep Convolutional Ranking for Multilabel Image Annotation

Author: Gong Yunchao
Ioffe Sergey
Jia Yangqing
Leung Thomas
Toshev Alexander
Publication venue
Publication date: 14/04/2014
Field of study

Multilabel image annotation is one of the most important challenges in computer vision with many real-world applications. While existing work usually use conventional visual features for multilabel annotation, features based on Deep Neural Networks have shown potential to significantly boost performance. In this work, we propose to leverage the advantage of such features and analyze key components that lead to better performances. Specifically, we show that a significant performance gain could be obtained by combining convolutional architectures with approximate top-

k

ranking objectives, as thye naturally fit the multilabel tagging problem. Our experiments on the NUS-WIDE dataset outperforms the conventional visual features by about 10%, obtaining the best reported performance in the literature

arXiv.org e-Print Archive

CiteSeerX

Matrix Coherence and the Nystrom Method

Author: Rostamizadeh Afshin
Talwalkar Ameet
Publication venue
Publication date: 01/01/2010
Field of study

The Nystrom method is an efficient technique to speed up large-scale learning applications by generating low-rank approximations. Crucial to the performance of this technique is the assumption that a matrix can be well approximated by working exclusively with a subset of its columns. In this work we relate this assumption to the concept of matrix coherence and connect matrix coherence to the performance of the Nystrom method. Making use of related work in the compressed sensing and the matrix completion literature, we derive novel coherence-based bounds for the Nystrom method in the low-rank setting. We then present empirical results that corroborate these theoretical bounds. Finally, we present more general empirical results for the full-rank setting that convincingly demonstrate the ability of matrix coherence to measure the degree to which information can be extracted from a subset of columns

arXiv.org e-Print Archive

CiteSeerX

A Simple Algorithm for Semi-supervised Learning with Improved Generalization Error Bound

Author: Han Jiawei
Ji Ming
Jin Rong
Lin Binbin
Yang Tianbao
Publication venue
Publication date: 01/01/2012
Field of study

In this work, we develop a simple algorithm for semi-supervised regression. The key idea is to use the top eigenfunctions of integral operator derived from both labeled and unlabeled examples as the basis functions and learn the prediction function by a simple linear regression. We show that under appropriate assumptions about the integral operator, this approach is able to achieve an improved regression error bound better than existing bounds of supervised learning. We also verify the effectiveness of the proposed algorithm by an empirical study.Comment: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012

arXiv.org e-Print Archive

CiteSeerX

Hierarchical Subquery Evaluation for Active Learning on a Graph

Author: Brostow Gabriel J.
Campbell Neill D. F.
Kautz Jan
Mac Aodha Oisin
Publication venue
Publication date: 25/09/2014
Field of study

To train good supervised and semi-supervised object classifiers, it is critical that we not waste the time of the human experts who are providing the training labels. Existing active learning strategies can have uneven performance, being efficient on some datasets but wasteful on others, or inconsistent just between runs on the same dataset. We propose perplexity based graph construction and a new hierarchical subquery evaluation algorithm to combat this variability, and to release the potential of Expected Error Reduction. Under some specific circumstances, Expected Error Reduction has been one of the strongest-performing informativeness criteria for active learning. Until now, it has also been prohibitively costly to compute for sizeable datasets. We demonstrate our highly practical algorithm, comparing it to other active learning measures on classification datasets that vary in sparsity, dimensionality, and size. Our algorithm is consistent over multiple runs and achieves high accuracy, while querying the human expert for labels at a frequency that matches their desired time budget.Comment: CVPR 201

arXiv.org e-Print Archive

Crossref

OPUS