Search CORE

268 research outputs found

A Novel Hybrid Dimensionality Reduction Method using Support Vector Machines and Independent Component Analysis

Author: Moon Sangwoo
Publication venue: TRACE: Tennessee Research and Creative Exchange
Publication date: 01/08/2010
Field of study

Due to the increasing demand for high dimensional data analysis from various applications such as electrocardiogram signal analysis and gene expression analysis for cancer detection, dimensionality reduction becomes a viable process to extracts essential information from data such that the high-dimensional data can be represented in a more condensed form with much lower dimensionality to both improve classification accuracy and reduce computational complexity. Conventional dimensionality reduction methods can be categorized into stand-alone and hybrid approaches. The stand-alone method utilizes a single criterion from either supervised or unsupervised perspective. On the other hand, the hybrid method integrates both criteria. Compared with a variety of stand-alone dimensionality reduction methods, the hybrid approach is promising as it takes advantage of both the supervised criterion for better classification accuracy and the unsupervised criterion for better data representation, simultaneously. However, several issues always exist that challenge the efficiency of the hybrid approach, including (1) the difficulty in finding a subspace that seamlessly integrates both criteria in a single hybrid framework, (2) the robustness of the performance regarding noisy data, and (3) nonlinear data representation capability. This dissertation presents a new hybrid dimensionality reduction method to seek projection through optimization of both structural risk (supervised criterion) from Support Vector Machine (SVM) and data independence (unsupervised criterion) from Independent Component Analysis (ICA). The projection from SVM directly contributes to classification performance improvement in a supervised perspective whereas maximum independence among features by ICA construct projection indirectly achieving classification accuracy improvement due to better intrinsic data representation in an unsupervised perspective. For linear dimensionality reduction model, I introduce orthogonality to interrelate both projections from SVM and ICA while redundancy removal process eliminates a part of the projection vectors from SVM, leading to more effective dimensionality reduction. The orthogonality-based linear hybrid dimensionality reduction method is extended to uncorrelatedness-based algorithm with nonlinear data representation capability. In the proposed approach, SVM and ICA are integrated into a single framework by the uncorrelated subspace based on kernel implementation. Experimental results show that the proposed approaches give higher classification performance with better robustness in relatively lower dimensions than conventional methods for high-dimensional datasets

University of Tennessee, Knoxville: Trace

Optimization of distributions differences for classification

Author: Bonyadi Mohammad Reza
Reutens David C.
Tieng Quang M.
Publication venue
Publication date: 02/03/2017
Field of study

In this paper we introduce a new classification algorithm called Optimization of Distributions Differences (ODD). The algorithm aims to find a transformation from the feature space to a new space where the instances in the same class are as close as possible to one another while the gravity centers of these classes are as far as possible from one another. This aim is formulated as a multiobjective optimization problem that is solved by a hybrid of an evolutionary strategy and the Quasi-Newton method. The choice of the transformation function is flexible and could be any continuous space function. We experiment with a linear and a non-linear transformation in this paper. We show that the algorithm can outperform 6 other state-of-the-art classification methods, namely naive Bayes, support vector machines, linear discriminant analysis, multi-layer perceptrons, decision trees, and k-nearest neighbors, in 12 standard classification datasets. Our results show that the method is less sensitive to the imbalanced number of instances comparing to these methods. We also show that ODD maintains its performance better than other classification methods in these datasets, hence, offers a better generalization ability

arXiv.org e-Print Archive

University of Queensland eSpace

A projection method for multiobjective multiclass SVM

Author: Liu Ling
Martín Barragán Belén
Prieto Fernández Francisco Javier
Publication venue
Publication date: 01/05/2014
Field of study

Support Vector Machines (SVMs) have become a very popular technique in the machine learning field for classification problems. It was originally proposed for classification of two classes. Various multiclass models with a single objective have been proposed mostly based on two families of methods: an all-together approach and a one-against-all approach. However,most of these single-objective models consider neither the different costs of misclassification nor the user's preferences. To overcome these drawbacks, multiobjective models have been proposed.In this paper we rewrite the different approaches that deal with the multiclass SVM using multiobjective techniques. These multiobjective techniques can give us weakly Pareto-optimal solutions. We propose a multiobjective technique called Projected Multiobjective All-Together(PMAT), which works in a higher-dimension space than the object space. With this technique, we can theoretically characterize the Pareto-optimal solution set. For these multiobjective techniques we get approximate sets of the Pareto-optimal solutions. For these sets, we use hypervolume and epsilon indicators to evaluate different multiobjective techniques. From the experimental results, we can see that (PMAT) outperfoms the other multiobjective techniques. When facing classification problems with very large numbers of classes, we suggest combininga tree method and multiobjective technique

Universidad Carlos III de Madrid e-Archivo

A projection multi-objective SVM method for multi-class classification

Author: Liu Ling
Martín-Barragán Belén
Prieto Francisco J.
Publication venue: 'Elsevier BV'
Publication date: 01/08/2021
Field of study

Edinburgh Research Explorer

Soft Methodology for Cost-and-error Sensitive Classification

Author: Jan Te-Kang
Lin Chi-Hung
Lin Hsuan-Tien
Wang Da-Wei
Publication venue
Publication date: 25/10/2017
Field of study

Many real-world data mining applications need varying cost for different types of classification errors and thus call for cost-sensitive classification algorithms. Existing algorithms for cost-sensitive classification are successful in terms of minimizing the cost, but can result in a high error rate as the trade-off. The high error rate holds back the practical use of those algorithms. In this paper, we propose a novel cost-sensitive classification methodology that takes both the cost and the error rate into account. The methodology, called soft cost-sensitive classification, is established from a multicriteria optimization problem of the cost and the error rate, and can be viewed as regularizing cost-sensitive classification with the error rate. The simple methodology allows immediate improvements of existing cost-sensitive classification algorithms. Experiments on the benchmark and the real-world data sets show that our proposed methodology indeed achieves lower test error rates and similar (sometimes lower) test costs than existing cost-sensitive classification algorithms. We also demonstrate that the methodology can be extended for considering the weighted error rate instead of the original error rate. This extension is useful for tackling unbalanced classification problems.Comment: A shorter version appeared in KDD '1

arXiv.org e-Print Archive

CiteSeerX

Efficient Learning Machines

Author: Awad Mariette
Khanna Rahul
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Computer scienc

OAPEN Library