Search CORE

42 research outputs found

Detecting meaningful compounds in complex class labels

Author: A Doan
A Pease
A Radford
E Hovy
H Leopold
H Schütze
I Mierswa
J Carletta
J Euzenat
J Mendling
JL Fleiss
JT Fernandez-Breis
M Ashburner
M d’Aquin
M Quesada-Martínez
M Sabou
P Shvaiko
PD Turney
SP Ponzetto
T Baldwin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Crossref

MAnnheim DOCument Server

Evolutionary multi-objective training set selection of data instances and augmentations for vocal detection

Author: A Kumar
CAC Coello
D Stoller
E Zitzler
E Zitzler
ER Miranda
F Pachet
G Acampora
I Mierswa
J Lemley
JR Cano
L Breiman
L Rabiner
N Beume
T Bäck
V Rao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/04/2019
Field of study

© Springer Nature Switzerland AG 2019. The size of publicly available music data sets has grown significantly in recent years, which allows training better classification models. However, training on large data sets is time-intensive and cumbersome, and some training instances might be unrepresentative and thus hurt classification performance regardless of the used model. On the other hand, it is often beneficial to extend the original training data with augmentations, but only if they are carefully chosen. Therefore, identifying a “smart” selection of training instances should improve performance. In this paper, we introduce a novel, multi-objective framework for training set selection with the target to simultaneously minimise the number of training instances and the classification error. Experimentally, we apply our method to vocal activity detection on a multi-track database extended with various audio augmentations for accompaniment and vocals. Results show that our approach is very effective at reducing classification error on a separate validation set, and that the resulting training set selections either reduce classification error or require only a small fraction of training instances for comparable performance

Crossref

Queen Mary Research Online

Feature engineering and a proposed decision-support system for systematic reviewers of medical evidence

Author: A Jimeno-Yepes
AM Cohen
AM Cohen
AM Cohen
BC Wallace
BR Luce
Christian Lovis
Dina Demner-Fushman
DM Blei
Eugene Tseytlin
F Boudin
G Del Fiol
H Bastian
H Kilicoglu
I Mierswa
J Chandler
J Yaffe
KA McKibbon
Kevin J. Mitchell
M Steyvers
MF Porter
NL Wilczynski
NL Wilczynski
NL Wilczynski
O Frunza
Q Zou
QT Zeng
R Klinkenberg
S Matwin
SR Dalal
SS Keerthi
T Bekhuis
T Bekhuis
T Bekhuis
T Hofmann
Tanja Bekhuis
TL Griffiths
X Huang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 27/01/2014
Field of study

Objectives: Evidence-based medicine depends on the timely synthesis of research findings. An important source of synthesized evidence resides in systematic reviews. However, a bottleneck in review production involves dual screening of citations with titles and abstracts to find eligible studies. For this research, we tested the effect of various kinds of textual information (features) on performance of a machine learning classifier. Based on our findings, we propose an automated system to reduce screeing burden, as well as offer quality assurance. Methods: We built a database of citations from 5 systematic reviews that varied with respect to domain, topic, and sponsor. Consensus judgments regarding eligibility were inferred from published reports. We extracted 5 feature sets from citations: alphabetic, alphanumeric +, indexing, features mapped to concepts in systematic reviews, and topic models. To simulate a two-person team, we divided the data into random halves. We optimized the parameters of a Bayesian classifier, then trained and tested models on alternate data halves. Overall, we conducted 50 independent tests. Results: All tests of summary performance (mean F3) surpassed the corresponding baseline, P<0.0001. The ranks for mean F3, precision, and classification error were statistically different across feature sets averaged over reviews; P-values for Friedman's test were .045, .002, and .002, respectively. Differences in ranks for mean recall were not statistically significant. Alphanumeric+ features were associated with best performance; mean reduction in screening burden for this feature type ranged from 88% to 98% for the second pass through citations and from 38% to 48% overall. Conclusions: A computer-assisted, decision support system based on our methods could substantially reduce the burden of screening citations for systematic review teams and solo reviewers. Additionally, such a system could deliver quality assurance both by confirming concordant decisions and by naming studies associated with discordant decisions for further consideration. © 2014 Bekhuis et al

Crossref

Directory of Open Access Journals

PubMed Central

D-Scholarship@Pitt

FigShare

Support Vector Machines and Kernels for Computational Biology

ISSN:1553-734XISSN:1553-735

Repository for Publications and Research Data

Crossref

Fraunhofer-ePrints

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Pattern Recognition Software and Techniques for Biological Image Analysis

Author: A Ben-Hur
A Kapelner
AE Carpenter
AL Tarca
AN Basavanhally
AR Gooding
B Misselwitz
BTM Roerdink
C Ding
CJ Fuller
D. Mark Eckley
DA Schiffmann
E Meijering
F Long
Fran Lewitter
G Cong
G Holmes
G Li
H Peng
H Peng
H Peng
H Peng
H Peng
H Peng
I Mierswa
IG Goldberg
Ilya G. Goldberg
J Johnston
J Vromen
J Zhou
JB Tenenbaum
JH Friedman
John D. Delaney
JP Carson
JR Swedlow
JR Swedlow
K Huang
K Kvilekval
L Shamir
L Shamir
L Shamir
L Vincent
LH Loo
Lior Shamir
LP Coelho
M Platani
M Pool
MJ Gardner
MR Lamprecht
MV Boland
MV Boland
MV Boland
N Orlov
Nikita Orlov
PJ Phillips
PK Sahoo
Q Wang
RE Fan
S Knudsen
ST Roweis
T Joachims
T Joachims
T Lindeberg
T Yoo
TJ Macura
TR Jones
TR Jones
TR Jones
V Ljosa
VN Vapnik
W Schroeder
Publication venue: Public Library of Science
Publication date: 01/11/2010
Field of study

The increasing prevalence of automated image acquisition systems is enabling new types of microscopy experiments that generate large image datasets. However, there is a perceived lack of robust image analysis systems required to process these diverse datasets. Most automated image analysis systems are tailored for specific types of microscopy, contrast methods, probes, and even cell types. This imposes significant constraints on experimental design, limiting their application to the narrow set of imaging methods for which they were designed. One of the approaches to address these limitations is pattern recognition, which was originally developed for remote sensing, and is increasingly being applied to the biology domain. This approach relies on training a computer to recognize patterns in images rather than developing algorithms or tuning parameters for specific image processing tasks. The generality of this approach promises to enable data mining in extensive image repositories, and provide objective and quantitative imaging assays for routine use. Here, we provide a brief overview of the technologies behind pattern recognition and its use in computer vision for biological and biomedical imaging. We list available software tools that can be used by biologists and suggest practical experimental considerations to make the best use of pattern recognition techniques for imaging assays

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Identification of Novel Functional Inhibitors of Acid Sphingomyelinase

Author: A Goede
A Guerra
A Haimovitz-Friedman
A Rebillard
A Sakata
A Schwarz
A Tropsha
AH Futerman
Astrid Friedl
BM Altura
C Andres
C de Duve
C De Simone
C Malaplate-Armand
C Williams
CA Lipinski
Christiane Mühle
D Amaratunga
D Canals
DA Konovalov
DG Altman
DL Streiner
DR Ragland
E Alpaydin
E Gulbins
E Gulbins
E Gulbins
E Schwarz
EB Roecker
EH Schuchman
EL Smith
Erich Gulbins
F Paris
FD Testai
G Dawson
G Gerebtzoff
G Pantaleo
GD Purvis 3rd
Gudrun M. Spitzer
H Fischer
H Grassmé
H Grassmé
H von der Voet
Howard Riezman
HP Deigner
HS Chung
I Guyon
I Kononenko
I Mierswa
IH Witten
J Kelder
J Kornhuber
J Kornhuber
J Kornhuber
J Kornhuber
J Kornhuber
J Riethmüller
JA Platts
Johannes Kornhuber
JP Hobson
K Kira
K Lanevskij
K Rose
K Thevissen
KA Becker
Klaus R. Liedl
L Breiman
L Zhang
LB Akella
Lothar Terfloth
M Feher
M Kölzer
M Kölzer
M Meloun
M Muehlbacher
M Pascual
M Zerara
MA Mikati
Markus Muehlbacher
Martin Reichel
MC Raff
MH Abraham
MK Cox
MN Ndengele
MW Bradbury
N Sakuragawa
O Cuvillier
P Garg
P Santana
P Willett
Philipp Tripal
PI Darroch
PP Van Veldhoven
R Bisiani
R Cecchelli
R Fluss
R Göggel
R Hurwitz
R Kohavi
R Kolesnick
R Liu
R Narayanan
R Todeschini
RE Toman
RJ Mintzer
RO Brady
RW Jenkins
S Albouz
S Cuzzocrea
S Elojeimy
S Jin
S Kirschnek
S Pandey
S Spiegel
S Spiegel
S Spiegel
S Trapp
S Vilar
S Wold
SR Mente
SS Castillo
Stefan Trapp
Stefanie Pechmann
T Ewing
Teja W. Groemer
U Norinder
V Svetnik
V Teichgräber
VA Levin
WJ Youden
WS Bush
X Han
X He
X-C Fu
Y Benjamini
Y Saeys
Y Wang
Y Yoshida
YA Hannun
ZF Yu
ZH Zhou
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

We describe a hitherto unknown feature for 27 small drug-like molecules, namely functional inhibition of acid sphingomyelinase (ASM). These entities named FIASMAs (Functional Inhibitors of Acid SphingoMyelinAse), therefore, can be potentially used to treat diseases associated with enhanced activity of ASM, such as Alzheimer's disease, major depression, radiation- and chemotherapy-induced apoptosis and endotoxic shock syndrome. Residual activity of ASM measured in the presence of 10 µM drug concentration shows a bimodal distribution; thus the tested drugs can be classified into two groups with lower and higher inhibitory activity. All FIASMAs share distinct physicochemical properties in showing lipophilic and weakly basic properties. Hierarchical clustering of Tanimoto coefficients revealed that FIASMAs occur among drugs of various chemical scaffolds. Moreover, FIASMAs more frequently violate Lipinski's Rule-of-Five than compounds without effect on ASM. Inhibition of ASM appears to be associated with good permeability across the blood-brain barrier. In the present investigation, we developed a novel structure-property-activity relationship by using a random forest-based binary classification learner. Virtual screening revealed that only six out of 768 (0.78%) compounds of natural products functionally inhibit ASM, whereas this inhibitory activity occurs in 135 out of 2028 (6.66%) drugs licensed for medical use in humans

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Online Research Database In Technology

Similarity Clustering of Music Files According to User Preference

Author: A.L. Uitdenbogerd
E. Unal
I. Mierswa
I. Mierswa
T. Kohonen
T. Kohonen
Z. Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

Segment and combine approach for non-parametric time-series classification

Author: H. Shimodaira
I. Mierswa
J. Alonso González
M. Kudo
M.W. Kadous
P. Geurts
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

peer reviewedThis paper presents a novel, generic, scalable, autonomous, and flexible supervised learning algorithm for the classification of multivariate and variable length time series. The essential ingredients of the algorithm are randomization, segmentation of time-series, decision tree ensemble based learning of subseries classifiers, combination of subseries classification by voting, and cross-validation based temporal resolution adaptation. Experiments are carried out with this method on 10 synthetic and real-world datasets. They highlight the good behavior of the algorithm on a large diversity of problems. Our results are also highly competitive with existing approaches from the literature

CiteSeerX

Crossref

Open Repository and Bibliography - Liège