Search CORE

105 research outputs found

Unifying generative and discriminative learning principles

Author: A Bernal
A Culotta
A Feelders
A Mccallum
AE Kel
AY Ng
BP Lewis
C Burge
CM Bishop
D Cai
D Grossman
E Redhead
E Segal
E Wingender
F Pernkopf
G Bouchard
G Bouchard
G Stormo
G Yeo
H Wallach
H Wettig
HE Peckham
I Ben-Gal
Ivo Grosse
J Aldrich
J Cerquides
J Grau
J Keilwagen
J Keilwagen
JA Lasserre
Jan Grau
Jens Keilwagen
JH Xue
M Maragkakis
M Tompa
M Zhang
Marc Strickert
O Yakhnenko
P Grünwald
R Greiner
R Raina
R Staden
RA Fisher
S Sonnenburg
SL Salzberg
Stefan Posch
T Abeel
T Hastie
TH Kim
Y Barash
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The recognition of functional binding sites in genomic DNA remains one of the fundamental challenges of genome research. During the last decades, a plethora of different and well-adapted models has been developed, but only little attention has been payed to the development of different and similarly well-adapted learning principles. Only recently it was noticed that discriminative learning principles can be superior over generative ones in diverse bioinformatics applications, too. Results Here, we propose a generalization of generative and discriminative learning principles containing the maximum likelihood, maximum a posteriori, maximum conditional likelihood, maximum supervised posterior, generative-discriminative trade-off, and penalized generative-discriminative trade-off learning principles as special cases, and we illustrate its efficacy for the recognition of vertebrate transcription factor binding sites. Conclusions We find that the proposed learning principle helps to improve the recognition of transcription factor binding sites, enabling better computational approaches for extracting as much information as possible from valuable wet-lab data. We make all implementations available in the open-source library Jstacs so that this learning principle can be easily applied to other classification problems in the field of genome and epigenome analysis.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Apples and oranges: avoiding different priors in Bayesian DNA sequence analysis

Author: A Bernal
A Culotta
A Feelders
AE Kel
AL Berger
AY Ng
C Burge
CM Bishop
D Cai
D Grossman
D Heckerman
D Klein
E Redhead
E Segal
F Pernkopf
G Yeo
GD Stormo
H Wallach
H Wettig
HE Peckham
I Ben-Gal
Ivo Grosse
J Cerquides
J Davis
J Goodman
J Grau
J Keilwagen
Jan Grau
Jens Keilwagen
L Narlikar
M Arita
M Meila-Predoviciu
M Tompa
M Zhang
MI Jordan
NK Kim
O Schulte
O Yakhnenko
P Grünwald
R Castelo
R Castelo
R Greiner
R Staden
S Chen
S Sonnenburg
SL Salzberg
Stefan Posch
T Fawcett
TH Kim
TM Chen
WL Buntine
Y Barash
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background One of the challenges of bioinformatics remains the recognition of short signal sequences in genomic DNA such as donor or acceptor splice sites, splicing enhancers or silencers, translation initiation sites, transcription start sites, transcription factor binding sites, nucleosome binding sites, miRNA binding sites, or insulator binding sites. During the last decade, a wealth of algorithms for the recognition of such DNA sequences has been developed and compared with the goal of improving their performance and to deepen our understanding of the underlying cellular processes. Most of these algorithms are based on statistical models belonging to the family of Markov random fields such as position weight matrix models, weight array matrix models, Markov models of higher order, or moral Bayesian networks. While in many comparative studies different learning principles or different statistical models have been compared, the influence of choosing different prior distributions for the model parameters when using different learning principles has been overlooked, and possibly lead to questionable conclusions. Results With the goal of allowing direct comparisons of different learning principles for models from the family of Markov random fields based on the <it>same a-priori information</it>, we derive a generalization of the commonly-used product-Dirichlet prior. We find that the derived prior behaves like a Gaussian prior close to the maximum and like a Laplace prior in the far tails. In two case studies, we illustrate the utility of the derived prior for a direct comparison of different learning principles with different models for the recognition of binding sites of the transcription factor Sp1 and human donor splice sites. Conclusions We find that comparisons of different learning principles using the same a-priori information can lead to conclusions different from those of previous studies in which the effect resulting from different priors has been neglected. We implement the derived prior is implemented in the open-source library Jstacs to enable an easy application to comparative studies of different learning principles in the field of sequence analysis.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

L'APPORT DE LA SOCIOLOGIE PRAGMATIQUE FRANÇAISE AUX ÉTUDES CRITIQUES EN MANAGEMENT

Crossref

Prosody-Based Recognition Of Spoken German Varieties

Author: F. Pernkopf
G. Kubin
M. Hagmüller
Micha Baum
V. Dizdarevic
Publication venue
Publication date
Field of study

An approach to the recognition of regional language varieties is presented. The algorithm is tested on utterances of 3 to 6 seconds duration taken from large speech databases (SpeechDat) of Austrian and German German. The features are based only on the prosody of the speech and include parameters derived from the Fujisaki model and statistics of the fundamental frequency. Classification is performed using a multi layer perceptron and yielded a rate of 64% correct identification of the regional variety. Those result

CiteSeerX

Die therapie der schluckst�rungen bei spondylosis cervicalis ventralis

Author: A. Herrmann
C. Buetti-B�uml
D. H. Collins
E. Pernkopf
F. J. Lang
F. Kuhlmann
G. Exner
G. Parade
M. Aufdermauer
W. lesoine
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1968
Field of study

Crossref

Observations concerning the vascularization of the dura mater cerebri

Author: C. Elze
E. Pernkopf
E. Pernkopf
F. Hochstetter
G. Schaltenbrand
G. Töndury
H. Ferner
H. Lauber
J. Lang
J. Lang
J. Lang
J. Lang
J. Lang
J. Lang
J. P. Schaeffer
J. W. Rolen
L. H. Weed
L. H. Weed
P. Glees
T. B. Johnston
W. G. Penfield
W. G. Penfield
W. H. Hollinshead
W. Thiel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1973
Field of study

Crossref

Efficient and Robust Machine Learning for Real-World Systems

Author: Froening H
Ghahramani Z
Mattina M
Peharz R
Pernkopf F
Pfeifenberger L
Roth W
Schindler G
Tschiatschek S
Zoehrer M
Publication venue
Publication date
Field of study

While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation and the vision of the Internet-of-Things fuel the interest in resource efficient approaches. These approaches require a carefully chosen trade-off between performance and resource consumption in terms of computation and energy. On top of this, it is crucial to treat uncertainty in a consistent manner in all but the simplest applications of machine learning systems. In particular, a desideratum for any real-world system is to be robust in the presence of outliers and corrupted data, as well as being `aware' of its limits, i.e.\ the system should maintain and provide an uncertainty estimate over its own predictions. These complex demands are among the major challenges in current machine learning research and key to ensure a smooth transition of machine learning technology into every day's applications. In this article, we provide an overview of the current state of the art of machine learning techniques facilitating these real-world requirements. First we provide a comprehensive review of resource-efficiency in deep neural networks with focus on techniques for model size reduction, compression and reduced precision. These techniques can be applied during training or as post-processing and are widely used to reduce both computational complexity and memory footprint. As most (practical) neural networks are limited in their ways to treat uncertainty, we contrast them with probabilistic graphical models, which readily serve these desiderata by means of probabilistic inference. In that way, we provide an extensive overview of the current state-of-the-art of robust and efficient machine learning for real-world systems

CUED - Cambridge University Engineering Department