Search CORE

Elsevier - Publisher Connector

ScholarBank@NUS

Protection against avian necrotic enteritis after immunisation with NetB genetic or formaldehyde toxoids

Author: Ajit K. Basak
Brennan
Brennan
Castanon
Chalmers
Christos G. Savva
Cooper
Corbel
Coursodon
Crouch
Dahiya
Del Giudice
Dey
Dorien Mot
Elwinger
Filip Van Immerseel
Gholamiandehkordi
Hoang
Jang
Jiang
Josefsberg
Kaldhusdal
Kaldhusdal
Keyburn
Keyburn
Kulkarni
Kulkarni
La Ragione
Lanckriet
Lanckriet
Lovland
Martin
McDonel
Meng
Monika Bokori-Brown
Mot
Parish
Petit
Richard W. Titball
Ricks
Saleh
Savva
Shojadoost
Sluis
Songer
Sérgio P. Fernandes da Costa
Timbermont
Timbermont
Zekarias
Publication venue: 'Elsevier BV'
Publication date: 01/08/2013
Field of study

NetB (necrotic enteritis toxin B) is a recently identified β-pore-forming toxin produced by Clostridium perfringens. This toxin has been shown to play a major role in avian necrotic enteritis. In recent years, a dramatic increase in necrotic enteritis has been observed, especially in countries where the use of antimicrobial growth promoters in animal feedstuffs has been banned. The aim of this work was to determine whether immunisation with a NetB toxoid would provide protection against necrotic enteritis. The immunisation of poultry with a formaldehyde NetB toxoid or with a NetB genetic toxoid (W262A) resulted in the induction of antibody responses against NetB and provided partial protection against disease

Birkbeck Institutional Research Online

Predicting gene function using hierarchical multi-label decision tree ensembles

Author: A Clare
A Clare
A Clare
B Hayete
C Vens
Celine Vens
D Kocev
Dragi Kocev
E Zdobnov
F Provost
F Wilcoxon
G Obozinski
GR Lanckriet
H Blockeel
H Blockeel
H Blockeel
H Chua
H Drucker
H Lee
H Mewes
Hendrik Blockeel
J Davis
J Gough
J Quinlan
J Rousu
J Struyf
Jan Struyf
L Breiman
L Breiman
L Breiman
L Breiman
L Pena-Castillo
Leander Schietgat
M Ashburner
M Deng
M Ouali
N Cesa-Bianchi
O Troyanskaya
R Caruana
S Altschul
S Mostafavi
Sašo Džeroski
T Hughes
T Joachims
U Karaoz
W Kim
W Tian
Y Chen
Y Guan
Z Barutcuoglu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background <it>S. cerevisiae</it>, <it>A. thaliana </it>and <it>M. musculus </it>are well-studied organisms in biology and the sequencing of their genomes was completed many years ago. It is still a challenge, however, to develop methods that assign biological functions to the ORFs in these genomes automatically. Different machine learning methods have been proposed to this end, but it remains unclear which method is to be preferred in terms of predictive performance, efficiency and usability. Results We study the use of decision tree based models for predicting the multiple functions of ORFs. First, we describe an algorithm for learning hierarchical multi-label decision trees. These can simultaneously predict all the functions of an ORF, while respecting a given hierarchy of gene functions (such as FunCat or GO). We present new results obtained with this algorithm, showing that the trees found by it exhibit clearly better predictive performance than the trees found by previously described methods. Nevertheless, the predictive performance of individual trees is lower than that of some recently proposed statistical learning methods. We show that ensembles of such trees are more accurate than single trees and are competitive with state-of-the-art statistical learning and functional linkage methods. Moreover, the ensemble method is computationally efficient and easy to use. Conclusions Our results suggest that decision tree based methods are a state-of-the-art, efficient and easy-to-use approach to ORF function prediction.</p

Leiden University Scholary Publications

Gene ontology based transfer learning for protein subcellular localization

Author: A Bateman
A Dijk
A Hoglund
A Hoglund
A Pierleoni
C Chen
C Leslie
C Leslie
DH Haft
E Marcotte
EM Zdobnov
F Corpet
FM Li
G Lanckriet
G Schneider
H Ding
H Lin
H Lin
H Liu
H Rangwala
H Shen
HB Shen
HB Shen
HB Shen
HB Shen
HB Shen
J Cedano
J Schultz
J Shen
JD Qiu
JD Qiu
K Chou
K Chou
K Chou
K Hofmann
K Lee
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
KC Chou
L Nanni
M Ashburner
M Esmaeili
M Mak
M Wang
Q Gu
Q Yang
R Apweiler
R Kuang
R Kuang
S Mei
S Pan
Shuigeng Zhou
Suyu Mei
T Blum
T Tung
TK Attwood
W Dai
W Dai
W Huang
W Huang
Wang Fei
X Jiang
X Xiao
XB Zhou
YH Zeng
YS Ding
YS Ding
Z Lei
Z Lu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Prediction of protein subcellular localization generally involves many complex factors, and using only one or two aspects of data information may not tell the true story. For this reason, some recent predictive models are deliberately designed to integrate multiple heterogeneous data sources for exploiting multi-aspect protein feature information. Gene ontology, hereinafter referred to as <it>GO</it>, uses a controlled vocabulary to depict biological molecules or gene products in terms of biological process, molecular function and cellular component. With the rapid expansion of annotated protein sequences, gene ontology has become a general protein feature that can be used to construct predictive models in computational biology. Existing models generally either concatenated the <it>GO </it>terms into a flat binary vector or applied majority-vote based ensemble learning for protein subcellular localization, both of which can not estimate the individual discriminative abilities of the three aspects of gene ontology. Results In this paper, we propose a Gene Ontology Based Transfer Learning Model (<it>GO-TLM</it>) for large-scale protein subcellular localization. The model transfers the signature-based homologous <it>GO </it>terms to the target proteins, and further constructs a reliable learning system to reduce the adverse affect of the potential false <it>GO </it>terms that are resulted from evolutionary divergence. We derive three <it>GO </it>kernels from the three aspects of gene ontology to measure the <it>GO </it>similarity of two proteins, and derive two other spectrum kernels to measure the similarity of two protein sequences. We use simple non-parametric cross validation to explicitly weigh the discriminative abilities of the five kernels, such that the time & space computational complexities are greatly reduced when compared to the complicated semi-definite programming and semi-indefinite linear programming. The five kernels are then linearly merged into one single kernel for protein subcellular localization. We evaluate <it>GO-TLM </it>performance against three baseline models: <it>MultiLoc, MultiLoc-GO </it>and <it>Euk-mPLoc </it>on the benchmark datasets the baseline models adopted. 5-fold cross validation experiments show that <it>GO-TLM </it>achieves substantial accuracy improvement against the baseline models: 80.38% against model <it>Euk-mPLoc </it>67.40% with <it>12.98% </it>substantial increase; 96.65% and 96.27% against model <it>MultiLoc-GO </it>89.60% and 89.60%, with <it>7.05% </it>and <it>6.67% </it>accuracy increase on dataset <it>MultiLoc plant </it>and dataset <it>MultiLoc animal</it>, respectively; 97.14%, 95.90% and 96.85% against model <it>MultiLoc-GO </it>83.70%, 90.10% and 85.70%, with accuracy increase <it>13.44%</it>, <it>5.8% </it>and <it>11.15% </it>on dataset <it>BaCelLoc plant</it>, dataset <it>BaCelLoc fungi </it>and dataset <it>BaCelLoc animal </it>respectively. For <it>BaCelLoc </it>independent sets, <it>GO-TLM </it>achieves 81.25%, 80.45% and 79.46% on dataset <it>BaCelLoc plant holdout</it>, dataset <it>BaCelLoc plant holdout </it>and dataset <it>BaCelLoc animal holdout</it>, respectively, as compared against baseline model <it>MultiLoc-GO </it>76%, 60.00% and 73.00%, with accuracy increase <it>5.25%</it>, <it>20.45% </it>and <it>6.46%</it>, respectively. Conclusions Since direct homology-based <it>GO </it>term transfer may be prone to introducing noise and outliers to the target protein, we design an explicitly weighted kernel learning system (called Gene Ontology Based Transfer Learning Model, <it>GO-TLM</it>) to transfer to the target protein the known knowledge about related homologous proteins, which can reduce the risk of outliers and share knowledge between homologous proteins, and thus achieve better predictive performance for protein subcellular localization. Cross validation and independent test experimental results show that the homology-based <it>GO </it>term transfer and explicitly weighing the <it>GO </it>kernels substantially improve the prediction performance.</p

Machine learning for regulatory analysis and transcription factor target prediction in yeast

Author: A Gasch
A Zien
AG Hinnebusch
B Balasubramanian
B Pina
C Harbison
CH Choi
Charles DeLisi
CJ Benham
CJ Benham
CS Leslie
D Goodsell
DE Martin
DL Wheeler
Dustin T. Holloway
E Birney
EM Conlon
F Baldino
G Lanckriet
GD Stormo
H Bussemaker
H Mountain
H Yu
I Guyon
IT Lee
J Helden van
J Helden van
J Helden van
J Ihmels
J Ihmels
J Mellor
J Qian
J Wu
JE Galagan
K Birnbaum
KJ Breslauer
KM Masters
KR Christie
M Kellis
M Pritsker
M Tompa
M Wang
MA Beer
Mark Kon
N Simonis
NA Kent
P Haverty
P Pavlidis
PF Cliften
RA Flickinger
S Aerts
S Elemento
S Hua
S Hua
S Keles
S Mangan
S Satchwell
SJ Deminoff
T Acton
T Schneider
TD Schneider
TD Tullius
TS Furey
V Matys
W Wang
X-F Zheng
Z Zhu
Publication venue: Kluwer Academic Publishers
Publication date: 01/01/2006
Field of study

High throughput technologies, including array-based chromatin immunoprecipitation, have rapidly increased our knowledge of transcriptional maps—the identity and location of regulatory binding sites within genomes. Still, the full identification of sites, even in lower eukaryotes, remains largely incomplete. In this paper we develop a supervised learning approach to site identification using support vector machines (SVMs) to combine 26 different data types. A comparison with the standard approach to site identification using position specific scoring matrices (PSSMs) for a set of 104 Saccharomyces cerevisiae regulators indicates that our SVM-based target classification is more sensitive (73 vs. 20%) when specificity and positive predictive value are the same. We have applied our SVM classifier for each transcriptional regulator to all promoters in the yeast genome to obtain thousands of new targets, which are currently being analyzed and refined to limit the risk of classifier over-fitting. For the purpose of illustration we discuss several results, including biochemical pathway predictions for Gcn4 and Rap1. For both transcription factors SVM predictions match well with the known biology of control mechanisms, and possible new roles for these factors are suggested, such as a function for Rap1 in regulating fermentative growth. We also examine the promoter melting temperature curves for the targets of YJR060W, and show that targets of this TF have potentially unique physical properties which distinguish them from other genes. The SVM output automatically provides the means to rank dataset features to identify important biological elements. We use this property to rank classifying k-mers, thereby reconstructing known binding sites for several TFs, and to rank expression experiments, determining the conditions under which Fhl1, the factor responsible for expression of ribosomal protein genes, is active. We can see that targets of Fhl1 are differentially expressed in the chosen conditions as compared to the expression of average and negative set genes. SVM-based classifiers provide a robust framework for analysis of regulatory networks. Processing of classifier outputs can provide high quality predictions and biological insight into functions of particular transcription factors. Future work on this method will focus on increasing the accuracy and quality of predictions using feature reduction and clustering strategies. Since predictions have been made on only 104 TFs in yeast, new classifiers will be built for the remaining 100 factors which have available binding data

Boston University Institutional Repository (OpenBU)

A computational procedure for functional characterization of potential marker genes from molecular data: Alzheimer's as a case study

Abstract Background A molecular characterization of Alzheimer's Disease (AD) is the key to the identification of altered gene sets that lead to AD progression. We rely on the assumption that candidate marker genes for a given disease belong to specific pathogenic pathways, and we aim at unveiling those pathways stable across tissues, treatments and measurement systems. In this context, we analyzed three heterogeneous datasets, two microarray gene expression sets and one protein abundance set, applying a recently proposed feature selection method based on regularization. Results For each dataset we identified a signature that was successively evaluated both from the computational and functional characterization viewpoints, estimating the classification error and retrieving the most relevant biological knowledge from different repositories. Each signature includes genes already known to be related to AD and genes that are likely to be involved in the pathogenesis or in the disease progression. The integrated analysis revealed a meaningful overlap at the functional level. Conclusions The identification of three gene signatures showing a relevant overlap of pathways and ontologies, increases the likelihood of finding potential marker genes for AD.</p

Archivio istituzionale della ricerca - Università di Genova

Biomedical Discovery Acceleration, with Applications to Craniofacial Development

Author: A Amano
A Baumeister
A Cvekl
A Ferrer-Martinez
A Gabow
A Gavalas
A Hollnagel
A Jaimovich
A Karimpour-Fard
A Karimpour-Fard
A Karimpour-Fard
A Karimpour-Fard
A L'Honore
A Nakaya
A Nazarali
A Subramanian
A Visel
A Yamane
A Zanzoni
AK Ramani
AM Edwards
AY Sivachenko
B Kanzler
BJ Daigle Jr
BT Alako
C Faloutsos
C North
C von Mering
CH Yeang
CL Myers
CL Myers
CM Deane
D Barker
D Eisenberg
D Hanisch
D Hwang
DJ Reiss
DP Hill
DP Tan
DR Rhodes
DS Goldberg
E Nabieva
E Segal
E Sprinzak
E Wingender
EM Marcotte
F Cozman
F Sohler
FM Rijli
GD Bader
GD Bader
GR Lanckriet
H Hishigaki
H Ogata
H Suzuki
H Tipney
Hannah Tipney
HJ Drabkin
HY Chuang
I Iossifov
I Lee
I Xenarios
J Chen
J Cui
J Graw
J Kim
J Kim
J Li
J Sun
JP Vert
JR Barrow
JS Bader
JT Eppig
L Hedges
L Hunter
L Hunter
L Li
L Salwinski
Lawrence Hunter
M Ashburner
M Bada
M Donalies
M Downes
M Downes
M Gendron-Maguire
M Kanai-Azuma
M Kanehisa
M Krallinger
M Maconochie
MC Mikl
MP Smidt
MS Scott
MY Galperin
N Daraselia
N Nariai
OG Troyanskaya
P Dupont
P Hunt
P Lipton
P Pei
P Saraiya
P Shannon
PA Gray
PM Bowers
Priyanka Kasliwal
PW Lord
R Bellazzi
R Hoffman
R Jansen
R Saito
Richard A. Spritz
Ronald P. Schuyler
S Asthana
S Brewer
S Draghici
S Imoto
S Kerrien
S Leach
S Leach
Satoru Miyano
Sonia M. Leach
T Ideker
T Matsumoto
T Schlitt
Trevor Williams
V Ferretti
W Feng
W Feng
WA Baumgartner
WA Baumgartner Jr
Weiguo Feng
William A. Baumgartner
X Yang
Y Chen
Y Kamei
Y Nakayama
Y Yamanishi
Y Yamanishi
Publication venue: Public Library of Science
Publication date: 01/03/2009
Field of study

The profusion of high-throughput instruments and the explosion of new results in the scientific literature, particularly in molecular biomedicine, is both a blessing and a curse to the bench researcher. Even knowledgeable and experienced scientists can benefit from computational tools that help navigate this vast and rapidly evolving terrain. In this paper, we describe a novel computational approach to this challenge, a knowledge-based system that combines reading, reasoning, and reporting methods to facilitate analysis of experimental data. Reading methods extract information from external resources, either by parsing structured data or using biomedical language processing to extract information from unstructured data, and track knowledge provenance. Reasoning methods enrich the knowledge that results from reading by, for example, noting two genes that are annotated to the same ontology term or database entry. Reasoning is also used to combine all sources into a knowledge network that represents the integration of all sorts of relationships between a pair of genes, and to calculate a combined reliability score. Reporting methods combine the knowledge network with a congruent network constructed from experimental data and visualize the combined network in a tool that facilitates the knowledge-based analysis of that data. An implementation of this approach, called the Hanalyzer, is demonstrated on a large-scale gene expression array dataset relevant to craniofacial development. The use of the tool was critical in the creation of hypotheses regarding the roles of four genes never previously characterized as involved in craniofacial development; each of these hypotheses was validated by further experimental work

Public Library of Science (PLOS)

Application of the bacteriophage Mu-driven system for the integration/amplification of target genes in the chromosomes of engineered Gram-negative bacteria—mini review

Author: A Haldimann
A Lamberg
A Lanckriet
A Rivero-Müller
AA Krylov
AH Abdelhakim
AI Bukhari
AI Bukhari
AI Gulevich
AO Paatero
AY Chistoserdov
AY Skorokhodova
BA Castilho
BD Lavoie
BD Lavoie
BD Lavoie
CF Kuo
CJ Marx
CJ Marx
D Bourque
D Manna
D Pettijohn
DE Pettijohn
DL Court
DV Zimenkov
E Daniell
E Gueguen
E Krementsova
E Kutukova
E Laasik
EA Groisman
EA Groisman
EA Groisman
EA Savrasova
EG Abalakina
EG Abalakina
EV Sycheva
Evgueni R. Gak
G Chaconas
G Chaconas
G Chaconas
G Chaconas
G Ditta
G Gloor
GJ Morgan
H Albert
H Arakawa
H Motoyama
H Motoyama
H Nakai
H Ochman
H Savilahti
H Savilahti
H Turakainen
HM Krause
I Meynial-Salles
Irina L. Tokmakova
J Ge
J Schrader
JA Sawitzke
JC Liebart
JD Watson
JE Akroyd
JI Katashkina
K Abremski
K Friehs
K Mizuuchi
K Mizuuchi
KA Datsenko
KA Fitzgerald
KEJ Tyo
KJ O’Day
KK Swinger
L Bélanger
L Chistoserdova
L Chistoserdova
L Paolozzi
LAS Roldan
LM Bowers
M De Mey
M Faelen
M Mizuuchi
M Mizuuchi
MA Watson
MAM Groenen
MG Surette
MG Surette
MG Surette
MI Pajunen
MM Howe
MM Howe
MV Matz
MY Peredelchuk
N Tsujimoto
NA Symonds
Nataliya V. Stoynova
NI Minaeva
NS Eremina
P Balbás
PA Rice
PC Leung
PH Tu Quoc
R Craigie
R Craigie
R Simon
R Simon
RA Weinberg
RB Vasey
RG Allison
RM Harshey
RM Harshey
RM Harshey
RM Harshey
S Haapa
S Haapa-Paananen
S Pathania
S Pathania
S Vuilleumier
SC Dillon
Sergey V. Mashko
SH North
SK Sharan
TA Baker
TA Patterson
TD Sokolsky
TK Au
V De Lorenzo
V Doroshenko
Valerii Z. Akhverdyan
VG Doroshenko
VG Doroshenko
VZ Akhverdyan
W Choi
W Pansegrau
XX Wei
Y Gunji
YAV Yomantas
YJ Choi
Yurgis A. V. Yomantas
YW Han
Z Wu
Z Wu
Z Yin
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

The advantages of phage Mu transposition-based systems for the chromosomal editing of plasmid-less strains are reviewed. The cis and trans requirements for Mu phage-mediated transposition, which include the L/R ends of the Mu DNA, the transposition factors MuA and MuB, and the cis/trans functioning of the E element as an enhancer, are presented. Mini-Mu(LR)/(LER) units are Mu derivatives that lack most of the Mu genes but contain the L/R ends or a properly arranged E element in cis to the L/R ends. The dual-component system, which consists of an integrative plasmid with a mini-Mu and an easily eliminated helper plasmid encoding inducible transposition factors, is described in detail as a tool for the integration/amplification of recombinant DNAs. This chromosomal editing method is based on replicative transposition through the formation of a cointegrate that can be resolved in a recombination-dependent manner. (E-plus)- or (E-minus)-helpers that differ in the presence of the trans-acting E element are used to achieve the proper mini-Mu transposition intensity. The systems that have been developed for the construction of stably maintained mini-Mu multi-integrant strains of Escherichia coli and Methylophilus methylotrophus are described. A novel integration/amplification/fixation strategy is proposed for consecutive independent replicative transpositions of different mini-Mu(LER) units with “excisable” E elements in methylotrophic cells

Towards Comprehensive Foundations of Computational Intelligence

Author: A Cichocki
A Gifi
A Gutkin
A Hyvärinen
A Konar
A Newell
A Pouget
A Pouget
A Roy
AM Callataÿ de
B Bakker
B Kégl
B Schölkopf
C Giraud-Carrier
C Jones
C Wendelken
CD Manning
CS Ong
D Michie
D Nauck
D Rousseau
D Wolpert
DL Wang
E Bauer
E Pekalska
E Salinas
E Simoncelli
EM Iyoda
F Corbacho
F Crestani
F Schwenker
FR Bach
G Giacinto
G-B Huang
GA Carpenter
GE Hinton
GRG Lanckriet
GS Cree
H Haas
H Leung
H Lodhi
I Guyon
J-P Vert
JA Anderson
JA Anderson
JG Wolff
JH Friedman
JSR Jang
K Grabczewski
K Torkkola
K Tsuda
KP Unnikrishnan
KS Fu
L Goldfarb
L Goldfarb
L Györfi
L Shastri
LI Kuncheva
M Blachnik
M Grochowski
M Kordos
M Leshno
MJ Kearns
MJD Powell
N Chater
N Jankowski
N Kunstman
NI Achieser
O Chapelle
P Dayan
P Matykiewicz
P Smyth
PH Winston
PM Baggenstoss
R Avnimelech
R Hecht-Nielsen
R Raizada
RE Schapire
RF Thompson
RL Gorsuch
RO Duda
RS Sutton
S Anuj
S Deneve
S Grossberg
S Haykin
S Mitra
S Roweis
SF Walker
SJ Russell
SK Pal
T Bilgiç
T Kohonen
T Poggio
T Wieczorek
TG Dietterich
TJ McCabe
TM Cover
V Kecman
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Maass
W Shoujue
Y Bengio
Y Bengio
Y Burnod
YH Pao
YJ Lee
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Abstract. Although computational intelligence (CI) covers a vast variety of different methods it still lacks an integrative theory. Several proposals for CI foundations are discussed: computing and cognition as compression, meta-learning as search in the space of data models, (dis)similarity based methods providing a framework for such meta-learning, and a more general approach based on chains of transformations. Many useful transformations that extract information from features are discussed. Heterogeneous adaptive systems are presented as particular example of transformation-based systems, and the goal of learning is redefined to facilitate creation of simpler data models. The need to understand data structures leads to techniques for logical and prototype-based rule extraction, and to generation of multiple alternative models, while the need to increase predictive power of adaptive models leads to committees of competent models. Learning from partial observations is a natural extension towards reasoning based on perceptions, and an approach to intuitive solving of such problems is presented. Throughout the paper neurocognitive inspirations are frequently used and are especially important in modeling of the higher cognitive functions. Promising directions such as liquid and laminar computing are identified and many open problems presented.

CiteSeerX

Composite kernel learning

Author: A. Argyriou
A. Argyriou
A. Farwell
A. Rakotomamonjy
A. Rakotomamonjy
Alain Rakotomamonjy
B. Blankertz
B. Schölkopf
C. S. Ong
F. Bach
F. R. Bach
G. Garipelli
G. R. G. Lanckriet
I. Guyon
J. Weston
L. Breiman
M. Nikolova
M. Schröder
M. Szafranski
M. Szafranski
M. Yuan
Marie Szafranski
N. Cristianini
N. Cristianini
N. Srebro
O. Bousquet
O. Bousquet
O. Chapelle
P. Zhao
R. Tibshirani
S. Sonnenburg
V. N. Vapnik
W. G. Walter
Y. Bengio
Y. Grandvalet
Y. Grandvalet
Yves Grandvalet
Z. Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study