Search CORE

42 research outputs found

Application of regulatory sequence analysis and metabolic network analysis to the interpretation of gene expression data

Author: A. Brazma
A.J. Enright
D. Gilbert
D. Thomas
E. Wingender
E.M. Marcotte
E.M. Marcotte
G. Reinert
H. Salgado
J. Helden van
J. Helden van
J. Helden van
J. Helden van
J. Helden van
J.H. Graber
J.L. DeRisi
M. Kanehisa
M. Pellegrini
M.B. Eisen
M.B. Eisen
P. Tamayo
P.D. Karp
P.O. Brown
P.T. Spellman
Publication venue: JOBIM
Publication date: 01/01/2000
Field of study

We present two complementary approaches for the interpretation of clusters of co-regulated genes, such as those obtained from DNA chips and related methods. Starting from a cluster of genes with similar expression profiles, two basic questions can be asked: 1. Which mechanism is responsible for the coordinated transcriptional response of the genes? This question is approached by extracting motifs that are shared between the upstream sequences of these genes. The motifs extracted are putative cis-acting regulatory elements. 2. What is the physiological meaning for the cell to express together these genes? One way to answer the question is to search for potential metabolic pathways that could be catalyzed by the products of the genes. This can be done by selecting the genes from the cluster that code for enzymes, and trying to assemble the catalyzed reactions to form metabolic pathways. We present tools to answer these two questions, and we illustrate their use with selected examples in the yeast Saccharomyces cerevisiae. The tools are available on the web (http://ucmb.ulb.ac.be/bioinformatics/rsa-tools/; http://www.ebi.ac.uk/research/pfbp/; http://www.soi.city.ac.uk/~msch/)

CiteSeerX

Crossref

DI-fusion

Brunel University Research Archive

Semi-supervised prediction of protein interaction sentences exploiting semantically encoded metrics

Author: D.D. Lewis
E.M. Marcotte
J.D. Kim
K. Lund
L. Azzopardi
M. Girolami
M.N. Jones
M.N. Jones
R. Bunescu
S. Padó
S. Pyysalo
S. Rogers
T. Joachims
T.K. Landauer
Z. Minier
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Protein-protein interaction (PPI) identification is an integral component of many biomedical research and database curation tools. Automation of this task through classification is one of the key goals of text mining (TM). However, labelled PPI corpora required to train classifiers are generally small. In order to overcome this sparsity in the training data, we propose a novel method of integrating corpora that do not contain relevance judgements. Our approach uses a semantic language model to gather word similarity from a large unlabelled corpus. This additional information is integrated into the sentence classification process using kernel transformations and has a re-weighting effect on the training features that leads to an 8% improvement in F-score over the baseline results. Furthermore, we discover that some words which are generally considered indicative of interactions are actually neutralised by this process

Infinite-Order Percolation and Giant Fluctuations in a Protein Interaction Network

Author: A. Goffeau
A. Wagner
A.J. Enright
B. Kahng
D.S. Callaway
E.M. Marcotte
F. Slanina
H. Jeong
J. Kim
J.-C. Rain
P. L. Krapivsky
P.L. Krapivsky
P.L. Krapivsky
P.L. Krapivsky
P.L. Uetz
R. Albert
R.V. Solé
S. Redner
S.H. Strogatz
S.N. Dorogovtsev
S.N. Dorogovtsev
T. Ito
T. Ito
Publication venue: 'American Physical Society (APS)'
Publication date: 27/09/2002
Field of study

We investigate a model protein interaction network whose links represent interactions between individual proteins. This network evolves by the functional duplication of proteins, supplemented by random link addition to account for mutations. When link addition is dominant, an infinite-order percolation transition arises as a function of the addition rate. In the opposite limit of high duplication rate, the network exhibits giant structural fluctuations in different realizations. For biologically-relevant growth rates, the node degree distribution has an algebraic tail with a peculiar rate dependence for the associated exponent.Comment: 4 pages, 2 figures, 2 column revtex format, to be submitted to PRL 1; reference added and minor rewording of the first paragraph; Title change and major reorganization (but no result changes) in response to referee comments; to be published in PR

arXiv.org e-Print Archive

Crossref

Classification of protein interaction sentences via gaussian processes

Author: A. Aizerman
A.M. Cohen
C.D. Manning
C.D. Manning
C.E. Rasmussen
C.H. Ding
D.D. Lewis
E.M. Marcotte
H. Chen
J. Huang
J.C. Platt
J.D. Kim
J.H. Albert
K. Crammer
K. Sugiyama
K.M.A. Chai
M. Girolami
M. Girolami
N. Lama
N. Lawrence
R. Bunescu
S. Rogers
S.S. Keerthi
Silva
T. Joachims
V. Vapnik
W. Chu
W. Chu
Y. Hao
Y. Lee
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

The increase in the availability of protein interaction studies in textual format coupled with the demand for easier access to the key results has lead to a need for text mining solutions. In the text processing pipeline, classification is a key step for extraction of small sections of relevant text. Consequently, for the task of locating protein-protein interaction sentences, we examine the use of a classifier which has rarely been applied to text, the Gaussian processes (GPs). GPs are a non-parametric probabilistic analogue to the more popular support vector machines (SVMs). We find that GPs outperform the SVM and na\"ive Bayes classifiers on binary sentence data, whilst showing equivalent performance on abstract and multiclass sentence corpora. In addition, the lack of the margin parameter, which requires costly tuning, along with the principled multiclass extensions enabled by the probabilistic framework make GPs an appealing alternative worth of further adoption

Mitochondrial and chloroplast localization of FtsH-like proteins in sugarcane based on their phylogenetic profile

Author: Adam Z.
Akiyama Y.
Akiyama Y.
Alexandre S. Guedes Coelho
Arlt H.
Chen M.
Emanuelsson O.
Hannenhalli S.S.
Hugueney P.
Itoh R.
Juhola M. K.
Karata K.
Kunau W-H.
Langer T.
Leonhard K.
Leonhard K.
Lindahl M.
Lindahl M.
Marcio C. Silva-Filho
Marcotte E.M.
Margulis L.
Osterseter O.
Page R.D.M.
Patel S.
Paul M.F.
Phellippe A. Santos Marbach
Saitou N.
Seo S.
Shah Z.H.
Shotland Y.
Swaffield J.C.
Thompson J.D.
Thorsness P.E.
Tomoyasu T.
Walker J.E.
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

But what does that gene do?

Author: Eleanor Lawrence
Pellegrini M Marcotte,E.M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Ranking for Medical Annotation: Investigating Performance, Local Search and Homonymy Recognition

Author: B. Boeckmann
E.M. Marcotte
L. Hirschman
P.B. Dobrokhotov
R. Duda
R.C. Holte
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Crossref

A role for central spindle proteins in cilia structure and function

Author: Basten S.G.
Giles R.H.
Kieserman E.K.
Marcotte E.M.
Smith K.R.
Wallingford J.B.
Wang P.I.
Publication venue
Publication date: 01/02/2011
Field of study

Cytokinesis and ciliogenesis are fundamental cellular processes that require strict coordination of microtubule organization and directed membrane trafficking. These processes have been intensely studied, but there has been little indication that regulatory machinery might be extensively shared between them. Here, we show that several central spindle/midbody proteins (PRC1, MKLP-1, INCENP, centriolin) also localize in specific patterns at the basal body complex in vertebrate ciliated epithelial cells. Moreover, bioinformatic comparisons of midbody and cilia proteomes reveal a highly significant degree of overlap. Finally, we used temperature-sensitive alleles of PRC1/spd-1 and MKLP-1/zen-4 in C. elegans to assess ciliary functions while bypassing these proteins' early role in cell division. These mutants displayed defects in both cilia function and cilia morphology. Together, these data suggest the conserved reuse of a surprisingly large number of proteins in the cytokinetic apparatus and in cilia

Utrecht University Repository

Finding all common intervals of k permutations

Author: B. Snel
D. Fulkerson
E.M. Marcotte
H. Mühlenbein
K.S. Booth
M.C. Golumbic
R. Overbeek
R.M. Brady
T. Uno
Publication venue: Springer Verlag
Publication date: 01/01/2001
Field of study

1 Introduction Let \Pi = (ss1; : : : ; ssk) be a family of k permutations of N = f1; 2; : : : ; ng. A k-tuple of intervals of these permutations consisting of the same set of elements is called a common interval

CiteSeerX

Crossref

Gene fusion in Helicobacter pylori: Making the ends meet

Author: A. Marchler-Bauer
A. Sali
C. Mering von
D.J. Lipman
E. Berthonneau
E.M. Marcotte
E.M. Marcotte
E.M. Marcotte
F.M. Katzen
I. Yanai
I.B. Rogozin
I.G. Boneca
I.K. Jordan
J.C. Mellor
J.D. Yourno
J.F. Tomb
K. Isono
K. Suhre
K.R. Sakharkar
Kishore R. Sakharkar
M. Long
M.K. Sakharkar
M.K. Sakharkar
M.Y. Galperin
M.Y. Galperin
Meena K. Sakharkar
N.R. Salama
R. Yelin
R.A. Alm
R.L. Tatusov
R.L. Tatusov
S. Tsoka
Vincent T. K. Chow
Y. Fukuda
Z.L. Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

10.1007/s10482-005-9021-2Antonie van Leeuwenhoek, International Journal of General and Molecular Microbiology891169-18

Crossref

ScholarBank@NUS