Search CORE

1,441 research outputs found

Direct kernel biased discriminant analysis: a new content-based image retrieval relevance feedback algorithm

Author: Dacheng Tao
Senior Member
Senior Member
Student Member
Xiaoou Tang
Xuelong Li
Yong Rui
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2006
Field of study

In recent years, a variety of relevance feedback (RF) schemes have been developed to improve the performance of content-based image retrieval (CBIR). Given user feedback information, the key to a RF scheme is how to select a subset of image features to construct a suitable dissimilarity measure. Among various RF schemes, biased discriminant analysis (BDA) based RF is one of the most promising. It is based on the observation that all positive samples are alike, while in general each negative sample is negative in its own way. However, to use BDA, the small sample size (SSS) problem is a big challenge, as users tend to give a small number of feedback samples. To explore solutions to this issue, this paper proposes a direct kernel BDA (DKBDA), which is less sensitive to SSS. An incremental DKBDA (IDKBDA) is also developed to speed up the analysis. Experimental results are reported on a real-world image collection to demonstrate that the proposed methods outperform the traditional kernel BDA (KBDA) and the support vector machine (SVM) based RF algorithms

CiteSeerX

Crossref

OPUS - University of Technology Sydney

Birkbeck Institutional Research Online

Large Margin Image Set Representation and Classification

Author: Alzahrani Majed
Gao Xin
Wang Jim Jing-Yan
Publication venue
Publication date: 22/04/2014
Field of study

In this paper, we propose a novel image set representation and classification method by maximizing the margin of image sets. The margin of an image set is defined as the difference of the distance to its nearest image set from different classes and the distance to its nearest image set of the same class. By modeling the image sets by using both their image samples and their affine hull models, and maximizing the margins of the images sets, the image set representation parameter learning problem is formulated as an minimization problem, which is further optimized by an expectation -maximization (EM) strategy with accelerated proximal gradient (APG) optimization in an iterative algorithm. To classify a given test image set, we assign it to the class which could provide the largest margin. Experiments on two applications of video-sequence-based face recognition demonstrate that the proposed method significantly outperforms state-of-the-art image set classification methods in terms of both effectiveness and efficiency

arXiv.org e-Print Archive

Crossref

Speech Recognition Using Augmented Conditional Random Fields

Author: Hifny Yasser
Renals Steve
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 21/12/2009
Field of study

Acoustic modeling based on hidden Markov models (HMMs) is employed by state-of-the-art stochastic speech recognition systems. Although HMMs are a natural choice to warp the time axis and model the temporal phenomena in the speech signal, their conditional independence properties limit their ability to model spectral phenomena well. In this paper, a new acoustic modeling paradigm based on augmented conditional random fields (ACRFs) is investigated and developed. This paradigm addresses some limitations of HMMs while maintaining many of the aspects which have made them successful. In particular, the acoustic modeling problem is reformulated in a data driven, sparse, augmented space to increase discrimination. Acoustic context modeling is explicitly integrated to handle the sequential phenomena of the speech signal. We present an efficient framework for estimating these models that ensures scalability and generality. In the TIMIT phone recognition task, a phone error rate of 23.0\% was recorded on the full test set, a significant improvement over comparable HMM-based systems

CiteSeerX

Crossref

Edinburgh Research Archive

Faster Geometric Algorithms via Dynamic Determinant Computation

Author: Abbott
Avis
Avrachenkov
Bareiss
Bartlett
Barvinok
Basu
Berkowitz
Bird
Boehm
Boissonnat
Brönnimann
Brönnimann
Brönnimann
Bunch
Büeler
Büeler
CGAL
Chand
Clarkson
Clarkson
Clarkson
Conway
Coppersmith
Cox
Dumas
Dumas
Edelsbrunner
Emiris
Emiris
Fisikopoulos
Fukuda
Garling
Gawrilow
Gelfand
Guennebaud
Harville
Hornus
Iliopoulos
Kaltofen
Kaltofen
Kettner
Krattenthaler
Le Gall
Luis Peñaranda
Mahajan
Poole
Press
Rambau
Robinson
Rote
Sankowski
Seidel
Sherman
Urbańska
Villard
Vissarion Fisikopoulos
Yap
Ziegler
Publication venue: 'Elsevier BV'
Publication date: 12/01/2016
Field of study

The computation of determinants or their signs is the core procedure in many important geometric algorithms, such as convex hull, volume and point location. As the dimension of the computation space grows, a higher percentage of the total computation time is consumed by these computations. In this paper we study the sequences of determinants that appear in geometric algorithms. The computation of a single determinant is accelerated by using the information from the previous computations in that sequence. We propose two dynamic determinant algorithms with quadratic arithmetic complexity when employed in convex hull and volume computations, and with linear arithmetic complexity when used in point location problems. We implement the proposed algorithms and perform an extensive experimental analysis. On one hand, our analysis serves as a performance study of state-of-the-art determinant algorithms and implementations. On the other hand, we demonstrate the supremacy of our methods over state-of-the-art implementations of determinant and geometric algorithms. Our experimental results include a 20 and 78 times speed-up in volume and point location computations in dimension 6 and 11 respectively.Comment: 29 pages, 8 figures, 3 table

arXiv.org e-Print Archive

Crossref

DI-fusion

Enhancement of ELDA Tracker Based on CNN Features and Adaptive Model Update

Author: Gao Changxin
Sang Nong
Shi Huizhang
Yu Jin-Gang
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2016
Field of study

Appearance representation and the observation model are the most important components in designing a robust visual tracking algorithm for video-based sensors. Additionally, the exemplar-based linear discriminant analysis (ELDA) model has shown good performance in object tracking. Based on that, we improve the ELDA tracking algorithm by deep convolutional neural network (CNN) features and adaptive model update. Deep CNN features have been successfully used in various computer vision tasks. Extracting CNN features on all of the candidate windows is time consuming. To address this problem, a two-step CNN feature extraction method is proposed by separately computing convolutional layers and fully-connected layers. Due to the strong discriminative ability of CNN features and the exemplar-based model, we update both object and background models to improve their adaptivity and to deal with the tradeoff between discriminative ability and adaptivity. An object updating method is proposed to select the “good” models (detectors), which are quite discriminative and uncorrelated to other selected models. Meanwhile, we build the background model as a Gaussian mixture model (GMM) to adapt to complex scenes, which is initialized offline and updated online. The proposed tracker is evaluated on a benchmark dataset of 50 video sequences with various challenges. It achieves the best overall performance among the compared state-of-the-art trackers, which demonstrates the effectiveness and robustness of our tracking algorithm

DigitalCommons@University of Nebraska

Directory of Open Access Journals

PubMed Central