Search CORE

34 research outputs found

Prediction of breast cancer by profiling of urinary RNA metabolites using Support Vector Machine-based feature selection

Author: A Frickenschmidt
A Seidel
Andreas Zell
B Kammerer
B Kammerer
Bernd Kammerer
C Denkert
Carsten Henneges
CC Chang
Christoph H Gleiter
D Bullinger
D Bullinger
D Bullinger
D Bullinger
DC Tormey
DC Tormey
Dino Bullinger
E Borek
E Dudley
E Tsutsui
F Constantinesco
GA Garcia
Hans Neubauer
Harald Seeger
I Guyon
J Thomale
J Yang
JH Oh
JL Khatcheressian
JO Johansson Marcus
K Fujarewicz
K Itoh
KH Schram
L Wasserman
LR Mandel
M Fontecave
Matthias Schwab
Natascha Friese
P Somol
R Duda
Richard Fux
S La
SJ Kerr
Stefan Laufer
TP Waalkes
VM Craddock
Y Mao
Publication venue: BioMed Central
Publication date: 05/04/2009
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Presence and Atmosphere: Tools for the Virtual and the Invention of a New Event Structure

Author: /. F. Roche
A. Breton
A. Hildebrand
C. P
D. Guattari
Deleuze
E. Grosz
E. Grosz
E. P
G. Deleuze
G. Deleuze
G. Deleuze
G. Lynn
H. A
H. Lefebvre
L. G
L. S
Lefebvre
M. Fried
M. Fried
M. M
M. Tapié
M. Tapié
P. Cook
P. Eisenman
P. R
P. Rahm
R. Koolhaas
R. Koolhaas
R. Koolhaas
R. Koolhaas
R. Packer
R. Somol
R. Somol
R. Somol
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Crossref

Feature Selection Based on Fuzzy Distances Between Clusters: First Results on Simulated Data.

Author: A. Jain
A. K. Jain
B. Bouchon-Meunier
I. Bloch
K. R. Castleman
P. Pudil
P. Somol
R. Duda
R. Lowen
R. Zwick
Publication venue
Publication date: 01/01/2001
Field of study

Automatic feature selection methods are important in many situations where a large set of possible features are available from which a subset should be selected in order to compose suitable feature vectors. Several methods for automatic feature selection are based on two main points: a selection algorithm and a criterion function. Many criterion functions usually adopted depend on a distance between the clusters, being extremely important to the final result. Most distances between clusters are more suitable to convex sets, do not producing good results for concave clusters, or for clusters presenting overlapping areas. In order to circumvent these problems, this paper presents a new approach to defining the criterion decision based on fuzzy distances. In our approach, each cluster is fuzzified and a fuzzy distance is applied to the fuzzy sets. Experimental results illustrating the advantages of the new approach are discussed

CiteSeerX

Crossref

Statistical learning of multi-view face detection

Author: A. Jain
F. Crow
F. Fleuret
H.A. Rowley
K.K. Sung
L. Wiskott
M. Bichsel
P. Pudil
P. Somol
R. Schapire
Y. Amit
Y. Freund
Publication venue
Publication date: 01/01/2002
Field of study

1 Introduction Pattern recognition problems has two essential issues: (i) feature selection, and (ii) classifier design based on selected features. Boosting is a method which attempts to boost the accuracy of an ensemble of weak classifiers to a strong one. The AdaBoost algorithm [1] solved many of the practical difficulties of earlier boosting algorithms. Each weak classifier is trained one stage-wise to minimize the empirical error in a given distribution re-weighted according classification errors of the previously trained classifier. It is shown that AdaBoost is a sequential forward search procedure using the greedy selection strategy to minimize a certain margin on the training set [4]

CiteSeerX

Crossref

Feature Selection Based on Run Covering

Author: A. Jain
H. Liu
H. Peng
I. Guyon
M. Robnik-Sikonja
N. Kwak
P. Somol
P.M. Narendra
R. Kohavi
T. Trappenberg
T.K. Ho
T.M. Cover
Y. Rui
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

This paper proposes a new feature selection algorithm. First, the data at every attribute are sorted. The continuously distributed data with the same class labels are grouped into runs. The runs whose length is greater than a given threshold are selected as “valid” runs, which enclose the instances separable from the other classes. Second, we count how many runs cover every instance and check how the covering number changes once eliminate a feature. Then, we delete the feature that has the least impact on the covering cases for all instances. We compare our method with ReliefF and a method based on mutual information. Evaluation was performed on 3 image databases. Experimental results show that the proposed method outperformed the other two

Crossref

MURAL - Maynooth University Research Archive Library

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

Classification Boundary Approximation by Using Combination of Training Steps for Real-Time Image Segmentation

Author: B. Dubuisson
B. Moobed
B. Schölkopf
D. Wettschereck
J. Kittler
J. Kittler
J. Miteran
J. Miteran
M. A. Hearst
P. Niyogi
P. . Somol
R. Enzler
R. O. Duda
S. Hauck
S. Salzberg
V. Vapnik
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A fast Branch-and-Bound algorithm for U-curve feature selection

Author: Barrera
Cover
Cover
Devroye
Edward R. Dougherty
Esmaeil Atashpaz-Gargari
Frank
Hua
Hughes
Jain
Junior Barrera
Kohavi
Marcelo S. Reis
Martins-Jr
Nakariyakul
Narendra
Reis
Ris
Sima
Somol
Ulisses M. Braga-Neto
Yu
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Seamless Heterogeneous 3D Tessellation via DWT Domain Smoothing and Mosaicking

Author: AA Efros
B Larsen
B Zitova
BY Li
C Frueh
CC Tanner
D Wagner
E Danovaro
E Zagrouba
F Losasso
F Tsai
H Buchholz
J Döllner
K Hayat
K Hayat
M Wloka
MF Ueng
P Somol
R Chen
R Pajarola
RM Okamoto
S Deb
Publication venue: SpringerOpen
Publication date: 01/01/2010
Field of study

With todays geobrowsers, the tessellations are far from being smooth due to a variety of reasons: the principal being the light difference and resolution heterogeneity. Whilst the former has been extensively dealt with in the literature through classic mosaicking techniques, the latter has got little attention. We focus on this latter aspect and present two DWT domain methods to seamlessly stitch tiles of heterogeneous resolutions. The first method is local in that each of the tiles that constitute the view, is subjected to one of the three context-based smoothing functions proposed for horizontal, vertical, and radial smoothing, depending on its localization in the tessellation. These functions are applied at the DWT subband level and followed by an inverse DWT to give a smoothened tile. In the second method, though we assume the same tessellation scenario, the view field is thought to be of a sliding window which may contain parts of the tiles from the heterogeneous tessellation. The window is refined in the DWT domain through mosaicking and smoothing followed by a global inverse DWT. Rather than the traditional sense, the mosaicking employed over here targets the heterogeneous resolution. Perceptually, this second method has shown better results than the first one. The methods have been successfully applied to practical examples of both the texture and its corresponding DEM for seamless 3D terrain visualization

Crossref

HAL AMU

Springer - Publisher Connector

Directory of Open Access Journals

Stability of feature selection methods ::a study of metrics across different gene expression datasets

Author: A Kalousis
C Mohana
D CA
GK Smyth
I Guyon
I Kononenko
JL Lustgarten
L Lausser
L Shi
M Kuhn
P Křížek
P Somol
R Guzmán-Martínez
S Nogueira
T Abeel
TM Cover
V Bolón-Canedo
VG Tusher
WWB Goh
Y Saeys
Z He
Z Mungloo-Dilmohamud
ZM Hira
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/05/2020
Field of study

Analysis of gene-expression data often requires that a gene (feature) subset is selected and many feature selection (FS) methods have been devised. However, FS methods often generate different lists of features for the same dataset and users then have to choose which list to use. One approach to support this choice is to apply stability metrics on the generated lists and selecting lists on that base. The aim of this study is to investigate the behavior of stability metrics applied to feature subsets generated by FS methods. The experiments in this work explore a plethora of gene expression datasets, FS methods, and expected number of features to compare several stability metrics. The stability metrics have been used to compare five feature selection methods (SVM, SAM, ReliefF, RFE + RF and LIMMA) on gene expression datasets from the EBI repository. Results show that the studied stability metrics display a high amount of variability. The reason behind this is not clear yet and is being further investigated. The final objective of the research, that is to define how to select a FS method, is an ongoing work whose partial findings are reported herein

Crossref

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

DWFS: A Wrapper Feature Selection Tool Based on a Parallel Genetic Algorithm

Author: A Jakulin
A Magana-Mora
A Spira
B Duval
D Levine
D Singh
Dimitrios Kleftogiannis
Dinesh Gupta
E Cantú-Paz
E Garbarine
E Glaab
F Fleuret
G Brown
GC Cawley
H Liu
H Peng
H Wang
HH Yang
I Guyon
J Xia
JH Holland
JL Lustgarten
K Ye
L Rokach
M Hall
M Hilario
M Seo
Othman Soufan
P Somol
P Somol
Panos Kalnis
R Batuwita
R Kohavi
S Schmeier
T Cheng
TR Golub
UM Braga-Neto
VB Bajić
Vladimir B. Bajic
W Siedlecki
Y Saeys
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref