Search CORE

37 research outputs found

Adjusted Measures for Feature Selection Stability for Data Sets with Similar Features

Author: A Bommert
A Kalousis
A Statnikov
J Vanschoren
JE Hopcroft
L Lausser
L Yu
M Lang
M Zhang
M Zucknick
MS Rahman
P Jaccard
Z He
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/09/2020
Field of study

For data sets with similar features, for example highly correlated features, most existing stability measures behave in an undesired way: They consider features that are almost identical but have different identifiers as different features. Existing adjusted stability measures, that is, stability measures that take into account the similarities between features, have major theoretical drawbacks. We introduce new adjusted stability measures that overcome these drawbacks. We compare them to each other and to existing stability measures based on both artificial and real sets of selected features. Based on the results, we suggest using one new stability measure that considers highly similar features as exchangeable

arXiv.org e-Print Archive

Crossref

Predicting disease progression in behavioral variant frontotemporal dementia

Author: Anderl‐Straub S.
Danek A.
Diehl‐Schmid J.
Fassbender K.
Fliessbach K.
Huppertz H.
Jahn H.
Kassubek J.
Kestler H.
Kornhuber J.
Landwehrmeyer B.
Lauer M.
Lausser L.
Lombardi J.
Ludolph A.
Obrig H.
Otto M.
Prudlo J.
Schneider A.
Schroeter M.
Semler E.
Synofzik M.
Uttner I.
Volk A.
Wiltfang J.
Publication venue: 'Wiley'
Publication date: 31/12/2021
Field of study

Introduction: The behavioral variant of frontotemporal dementia (bvFTD) is a rare neurodegenerative disease. Reliable predictors of disease progression have not been sufficiently identified. We investigated multivariate magnetic resonance imaging (MRI) biomarker profiles for their predictive value of individual decline. Methods: One hundred five bvFTD patients were recruited from the German frontotemporal lobar degeneration (FTLD) consortium study. After defining two groups ("fast progressors" vs. "slow progressors"), we investigated the predictive value of MR brain volumes for disease progression rates performing exhaustive screenings with multivariate classification models. Results: We identified areas that predict disease progression rate within 1 year. Prediction measures revealed an overall accuracy of 80% across our 50 top classification models. Especially the pallidum, middle temporal gyrus, inferior frontal gyrus, cingulate gyrus, middle orbitofrontal gyrus, and insula occurred in these models. Discussion: Based on the revealed marker combinations an individual prognosis seems to be feasible. This might be used in clinical studies on an individualized progression model

PubMed Central

MPG.PuRe

Ensemble of a subset of kNN classifiers

Author: A Karatzoglou
Aris Perperoglou
Asma Gul
Berthold Lausen
C Müssel
D Mease
DF Nettleton
E Bauer
EW Steyerberg
J Hernández-Orallo
J Kruppa
L Breiman
L Lausser
Miftahuddin Miftahuddin
O Mahmoud
Osama Mahmoud
P Hall
P Melville
R Barandela
R Maclin
RJ Samworth
S Li
T Cover
T Hothorn
T Hothorn
T Hothorn
T Hothorn
T Khoshgoftaar
Werner Adler
Z Liu
Zardad Khan
ZH Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Combining multiple classifiers, known as ensemble methods, can give substantial improvement in prediction performance of learning algorithms especially in the presence of non-informative features in the data sets. We propose an ensemble of subset of kNN classifiers, ESkNN, for classification task in two steps. Firstly, we choose classifiers based upon their individual performance using the out-of-sample accuracy. The selected classifiers are then combined sequentially starting from the best model and assessed for collective performance on a validation data set. We use bench mark data sets with their original and some added non-informative features for the evaluation of our method. The results are compared with usual kNN, bagged kNN, random kNN, multiple feature subset method, random forest and support vector machines. Our experimental comparisons on benchmark classification problems and simulated data sets reveal that the proposed ensemble gives better classification performance than the usual kNN and its ensembles, and performs comparable to random forest and support vector machines

University of Essex Research Repository

Crossref

Springer - Publisher Connector

Explore Bristol Research

A feature selection method for classification within functional genomics experiments based on the proportional overlapping score

Author: A Kikuchi
A Statnikov
A Ultsch
Andrew Harrison
Aris Perperoglou
Asma Gul
B Lausen
Berthold Lausen
C Cortes
C Ding
C Ma
C Müssel
C Zou
D Apiletti
D Apiletti
DA Notterman
DeAndresSA Díaz‐Uriarte R
DG Altman
E Baralis
GJ Gordon
H Peng
H‐C Liu
J Fan
J Fan
J Lu
K‐H Chen
L Breiman
L Breiman
L Lausser
M Dramiński
M Marczyk
Metodi V Metodiev
N De Jay
Osama Mahmoud
P Alhopuro
P Laiho
RN Jorissen
RS Croner
RS Croner
S Chiaretti
S Michiels
T Cover
T Jirapech‐Umpai
TR Golub
VG Tusher
W Talloen
Y Saeys
Y Su
Zardad Khan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Background: Microarray technology, as well as other functional genomics experiments, allow simultaneous measurements of thousands of genes within each sample. Both the prediction accuracy and interpretability of a classifier could be enhanced by performing the classification based only on selected discriminative genes. We propose a statistical method for selecting genes based on overlapping analysis of expression data across classes. This method results in a novel measure, called proportional overlapping score (POS), of a feature's relevance to a classification task.Results: We apply POS, along-with four widely used gene selection methods, to several benchmark gene expression datasets. The experimental results of classification error rates computed using the Random Forest, k Nearest Neighbor and Support Vector Machine classifiers show that POS achieves a better performance.Conclusions: A novel gene selection method, POS, is proposed. POS analyzes the expressions overlap across classes taking into account the proportions of overlapping samples. It robustly defines a mask for each gene that allows it to minimize the effect of expression outliers. The constructed masks along-with a novel gene score are exploited to produce the selected subset of genes

University of Essex Research Repository

Crossref

Springer - Publisher Connector

PubMed Central

Explore Bristol Research

sAPPβ and sAPPα increase structural complexity and E/I input ratio in primary hippocampal neurons and alter Cahomeostasis and CREB1-signaling

Author: Boeckers Tobias
Bott Patricia
Föhr Karl Josef
Hesse Raphael
Jackson Rosemary J
Kestler Hans A
Kroker Katja S
Lausser Ludwig
Proepper Christian
Rosenbrock Holger
Schwanzar Daniel
Spires-Jones Tara L
von Arnim Christine A F
von Einem Bjoern
Wagner Franziska
Walther Paul
Publication venue: 'Elsevier BV'
Publication date: 01/06/2018
Field of study

Edinburgh Research Explorer

Expansins: roles in plant growth and potential applications in crop improvement

Author: A Azeez
A Lausser
A Lovisetto
A Voegele
A Yan
AK Boron
Anming Ding
AX Li
BP Downes
BR Kuluev
BR Kuluev
C Gaete-Eastman
C Zörb
CH Park
CM Geilfus
D Brummell
D Carroll
D Choi
D Choi
DJ Cosgrove
DJ Cosgrove
DJ Cosgrove
DJ Cosgrove
DK Lee
DL Rayle
EJ Belfield
EP Harrison
ER Valdivia
ER Valdivia
ER Valdivia
F Carvajal
F Chen
F Dai
F Kerff
F Li
F Li
FA Hoekstra
H Hayama
H Kende
H Wang
HJ Lee
HT Cho
HT Cho
HT Cho
HT Cho
HW Lee
J Gustavsson
J Huang
J Ma
J Sampedro
J Zhou
J-S Xiong
JC Mollet
JK Rose
JKC Rose
JKC Rose
JM Bae
K Morris
K Weitbrecht
L Bashline
L Jones
L Li
M Asif
M Farhad
M Kwasniewski
M Olarte-Lozano
MA Bauerfeind
MJ Devi
MJ Holdsworth
MM Chaves
MR Zhao
MR Zhao
N Georgelis
N Ithal
NUS Kapu
OE Tovar-Herrera
P Lü
P Wei
P-C Wei
PB Green
PK Trivedi
PM Civello
Prince Marowa
Q Xu
RAM Vreeburg
RC O’Malley
S Abuqamar
S Dai
S Dal Santo
S Fudali
S Fudali
S McQueen-Mason
S Pien
S Won
S Zhou
SC Xing
SE Harmer
T Kawata
TE Gookin
VP Klink
W Guo
W Zhang
WE Finch-Savage
X Ding
X Li
Y Han
Y Lee
Y Lee
Y Lee
Y Li
Y Palapol
Y Shimizu
Yingzhen Kong
YR Kwon
Z Yu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Chained correlations for feature selection

Author: A Burkovski
C Müssel
D François
I Guyon
J Jones
J Kraus
K Deb
L Breiman
L Lausser
L Lausser
L Lausser
L Lausser
L Lausser
M Kearns
M Sheffer
MW Kimpel
N Japkowicz
NC Berchtold
O Chapelle
P Bühlmann
R Bellman
R Caruana
RM Gobble
S Taudien
S Yu
SJ Pan
T Haferlach
TD Pfister
TM Cover
VN Vapnik
Y Chevaleyre
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Statistical Quality Measures and ROC-Optimization by Learning Vector Quantization Classifiers

Author: Biehl Michael
Kaden M
Kestler H.A.
Krauss J.M.
Lausser L.
Schmid M.
Villmann T.
Publication venue: UULM (Ulm University)
Publication date: 01/07/2014
Field of study

Differentiation of multiple types of pancreatico-biliary tumors by molecular analysis of clinical specimens

Author: Buchholz M
Fiedler L
Giese N
Gress Tm
Kestler Ha
Lausser L
Michalski Cw
Scarpa A
Sipos B
Werner J
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Timely and accurate diagnosis of pancreatic ductal adenocarcinoma (PDAC) is critical in order to provide adequate treatment to patients. However, the clinical signs and symptoms of PDAC are shared by several types of malignant or benign tumors which may be difficult to differentiate from PDAC with conventional diagnostic procedures. Among others, these include ampullary cancers, solid pseudopapillary tumors, and adenocarcinomas of the distant bile duct, as well as inflammatory masses developing in chronic pancreatitis. Here, we report an approach to accurately differentiate between these different types of pancreatic masses based on molecular analysis of biopsy material. A total of 156 bulk tissue and fine needle aspiration biopsy samples were analyzed using a dedicated diagnostic cDNA array and a composite classification algorithm developed based on linear support vector machines. All five histological subtypes of pancreatic masses were clearly separable with 100\% accuracy when using all 156 individual samples for classification. Generalized performance of the classification system was tested by 10x10-fold cross validation (100 test runs). Correct classification into the five diagnostic groups was demonstrated for 81.5\% of 1,560 test set predictions. Performance increased to 85.3\% accuracy when PDAC and distant bile duct carcinomas were combined in a single diagnostic class. Importantly, overall sensitivity of detection of malignant disease was 92.2\%. The molecular diagnostic approach presented here is suitable to significantly aid in the differential diagnosis of undetermined pancreatic masses. To our knowledge, this is the first study reporting accurate differentiation between several types of pancreatico-biliary tumors in a single molecular analytical procedure

Catalogo dei prodotti della ricerca

Constraining classifiers in molecular analysis: invariance and robustness

Author: Anthony A
Attila Klimmek
Bishop C
Breiman L
Burkovski A
Casella G
Florian Schmid
Guo Y
Guyon I
Hans A. Kestler
Lausser L
Ludwig Lausser
Minsky M
Niyogi P
Robin Szekely
Singh D
Vapnik V
Vilar E
Publication venue: 'The Royal Society'
Publication date
Field of study

Crossref