Search CORE

9 research outputs found

Fast and versatile algorithm for nearest neighbor search based on a lower bound tree

Author: Chen Yong-Sheng
Publication venue: 'Elsevier BV'
Publication date: 29/04/2009
Field of study

Fast and Versatile Algorithm for Nearest Neighbor Search Based on a Lower Bound Tree

Author: Chiou-shann Fuh B
Ting-fang Yen A
Yi-ping Hung B
Yong-sheng Chen A
Publication venue
Publication date
Field of study

In this paper, we present a fast and versatile algorithm which can rapidly perform a variety of nearest neighbor searches. Efficiency improvement is achieved by utilizing the distance lower bound to avoid the calculation of the distance itself if the lower bound is already larger than the global minimum distance. At the preprocessing stage, the proposed algorithm constructs a lower bound tree (LB-tree) by agglomeratively clustering all the sample points to be searched. Given a query point, the lower bound of its distance to each sample point can be calculated by using the internal node of the LB-tree. To reduce the amount of lower bounds actually calculated, the winner-update search strategy is used for traversing the tree. For further efficiency improvement, data transformation can be applied to the sample and the query points. In addition to finding the nearest neighbor, the proposed algorithm can also (i) provide the k-nearest neighbors progressively; (ii) find the nearest neighbors within a specified distance threshold; and (iii) identify neighbors whose distances to the query are sufficiently close to the minimum distance of the nearest neighbor. Our experiments have shown that the proposed algorithm can save substantial computation, particularly when the distance of the query point to its nearest neighbor is relatively small compared with its distance to most other samples (which is the case for many object recognition problems)

CiteSeerX

Fast and versatile algorithm for nearest neighbor search based on a lower bound tree

Author: Arya
Bentley
Berchtold
Berchtold
Berchtold
Brin
Chen
Chiou-Shann Fuh
Djouadi
Duda
Fagin
Faragó
Flickner
Friedman
Fukunaga
Guttman
Hastie
Hjaltason
Hsieh
Jain
Katayama
Lee
Lee
Lin
McNames
Murase
Nene
Ramasubramanian
Soleymani
Strang
Ting-Fang Yen
Tomasi
Vidal
Wactlar
White
Winston
Yi-Ping Hung
Yong-Sheng Chen
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Comparison of classification ability of hyperball algorithms to neural network and k-nearest neighbour algorithms

Author: Zibamanzar-Mofrad Tanaby
Publication venue
Publication date: 03/04/2012
Field of study

The main focus of this thesis is to evaluate and compare Hyperbalilearning algorithm (HBL) to other learning algorithms. In this work HBL is compared to feed forward artificial neural networks using back propagation learning, K-nearest neighbor and 103 algorithms. In order to evaluate the similarity of these algorithms, we carried out three experiments using nine benchmark data sets from UCI machine learning repository. The first experiment compares HBL to other algorithms when sample size of dataset is changing. The second experiment compares HBL to other algorithms when dimensionality of data changes. The last experiment compares HBL to other algorithms according to the level of agreement to data target values. Our observations in general showed, considering classification accuracy as a measure, HBL is performing as good as most ANn variants. Additionally, we also deduced that HBL.:s classification accuracy outperforms 103's and K-nearest neighbour's for the selected data sets

Brock University Digital Repository

Building well-performing classifier ensembles: model and decision level combination.

Author: Eastwood Mark
Publication venue
Publication date
Field of study

There is a continuing drive for better, more robust generalisation performance from classification systems, and prediction systems in general. Ensemble methods, or the combining of multiple classifiers, have become an accepted and successful tool for doing this, though the reasons for success are not always entirely understood. In this thesis, we review the multiple classifier literature and consider the properties an ensemble of classifiers - or collection of subsets - should have in order to be combined successfully. We find that the framework of Stochastic Discrimination provides a well-defined account of these properties, which are shown to be strongly encouraged in a number of the most popular/successful methods in the literature via differing algorithmic devices. This uncovers some interesting and basic links between these methods, and aids understanding of their success and operation in terms of a kernel induced on the training data, with form particularly well suited to classification. One property that is desirable in both the SD framework and in a regression context, the ambiguity decomposition of the error, is de-correlation of individuals. This motivates the introduction of the Negative Correlation Learning method, in which neural networks are trained in parallel in a way designed to encourage de-correlation of the individual networks. The training is controlled by a parameter λ governing the extent to which correlations are penalised. Theoretical analysis of the dynamics of training results in an exact expression for the interval in which we can choose λ while ensuring stability of the training, and a value λ∗ for which the training has some interesting optimality properties. These values depend only on the size N of the ensemble. Decision level combination methods often result in a difficult to interpret model, and NCL is no exception. However in some applications, there is a need for understandable decisions and interpretable models. In response to this, we depart from the standard decision level combination paradigm to introduce a number of model level combination methods. As decision trees are one of the most interpretable model structures used in classification, we chose to combine structure from multiple individual trees to build a single combined model. We show that extremely compact, well performing models can be built in this way. In particular, a generalisation of bottom-up pruning to a multiple-tree context produces good results in this regard. Finally, we develop a classification system for a real-world churn prediction problem, illustrating some of the concepts introduced in the thesis, and a number of more practical considerations which are of importance when developing a prediction system for a specific problem

Bournemouth University Research Online

Building well-performing classifier ensembles : model and decision level combination

Author: Eastwood Mark
Publication venue
Publication date: 01/01/2010
Field of study

OpenGrey Repository

統計的性質に基づく文字の高精度認識に関する研究

Author: 勝山裕
Publication venue
Publication date: 05/12/2014
Field of study

Tohoku University加藤寧課

Tohoku University Repository (TOUR) / 東北大学機関リポジトリ

Institutional Repositories DataBase (IRDB)