Search CORE

Evaluating Ortholog Prediction Algorithms in a Yeast Model Clade

Author: A Alexeyenko
A Goffeau
A Kuzniar
A Rokas
AM Altenhoff
Antonis Rokas
B Dujon
BN Kent
C Dessimoz
C Vogel
Cecile Fairhead
CEV Storm
CEV Storm
CM Zmasek
CP Kurtzman
DA Fitzpatrick
DP Mindell
DP Wall
DP Wall
DR Scannell
EV Koonin
EV Koonin
EW Sayers
F Chen
F Chen
F Lemoine
FS Dietrich
I Wapinski
I Wapinski
J Ehrlich
JC Chiu
JL Gordon
K Liolios
KH Wolfe
KP Byrne
KP O'Brien
L Li
L Salichos
LA Mirny
LB Koski
Leonidas Salichos
M Kellis
M Remm
MP Cummings
O Akerborg
P Bork
P Bork
P Cliften
R Overbeek
RL Tatusov
RL Tatusov
RR Sokal
S Grossetete
SF Altschul
SF Altschul
T Hulsen
TF DeLuca
V van Noort
WM Fitch
Publication venue: Public Library of Science
Publication date: 13/04/2011
Field of study

RSD, respectively, so that they can predict orthologs across multiple taxa) against a set of 2,723 groups of high-quality curated orthologs from 6 Saccharomycete yeasts in the Yeast Gene Order Browser. of all algorithms dramatically increased in these traps.) for evolutionary and functional genomics studies where the objective is the accurate inference of single-copy orthologs (e.g., molecular phylogenetics), but that all algorithms fail to accurately predict orthologs when paralogy is rampant

Discovering local patterns of co - evolution: computational aspects and biological examples

Author: A Tanay
B Dujon
B Snel
C Goh
D Barker
D Barker
D Chamovitz
D Juan
D Ober
D Scannell
DM Krylov
DP Wall
E Oron
F Pazos
F Pazos
I Wapinski
J Wu
JB MacQueen
K Wolfe
LM o Rami'rez
M Benton
Martin Kupiec
O Man
P Jaccard
PM Bowers
R Chenna
R Singh
RL Tatusov
S Grossmann
S Ohno
T Przytycka
T Pupko
T Tuller
T Tuller
Tamir Tuller
TD Bie
Y Chena
Y Cheng
Yifat Felder
Z Yang
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Co-evolution is the process in which two (or more) sets of orthologs exhibit a similar or correlative pattern of evolution. Co-evolution is a powerful way to learn about the functional interdependencies between sets of genes and cellular functions and to predict physical interactions. More generally, it can be used for answering fundamental questions about the evolution of biological systems. Orthologs that exhibit a strong signal of co-evolution in a certain part of the evolutionary tree may show a mild signal of co-evolution in other branches of the tree. The major reasons for this phenomenon are noise in the biological input, genes that gain or lose functions, and the fact that some measures of co-evolution relate to rare events such as positive selection. Previous publications in the field dealt with the problem of finding sets of genes that co-evolved along an entire underlying phylogenetic tree, without considering the fact that often co-evolution is local. Results In this work, we describe a new set of biological problems that are related to finding patterns of <it>local </it>co-evolution. We discuss their computational complexity and design algorithms for solving them. These algorithms outperform other bi-clustering methods as they are designed specifically for solving the set of problems mentioned above. We use our approach to trace the co-evolution of fungal, eukaryotic, and mammalian genes at high resolution across the different parts of the corresponding phylogenetic trees. Specifically, we discover regions in the fungi tree that are enriched with positive evolution. We show that metabolic genes exhibit a remarkable level of co-evolution and different patterns of co-evolution in various biological datasets. In addition, we find that protein complexes that are related to gene expression exhibit non-homogenous levels of co-evolution across different parts of the <it>fungi </it>evolutionary line. In the case of mammalian evolution, signaling pathways that are related to <it>neurotransmission </it>exhibit a relatively higher level of co-evolution along the <it>primate </it>subtree. Conclusions We show that finding local patterns of co-evolution is a computationally challenging task and we offer novel algorithms that allow us to solve this problem, thus opening a new approach for analyzing the evolution of biological systems.</p

Springer - Publisher Connector

The Aspergillus Genome Database, a curated comparative genomics resource for gene, protein and sequence information for the Aspergillus research community

Author: Adil Lotia
Arnaud
Ashburner
Aslett
Boyle
Bult
Consortium
Costanzo
Crabtree
Diane O. Inglis
Gail Binkley
Galagan
Gavin Sherlock
Hong
Jennifer R. Wortman
Jonathan Crabtree
Joshua Orvis
Mabey Gilsenan
Machida
Marcus C. Chibucos
Marek S. Skrzypek
Maria C. Costanzo
Martha B. Arnaud
Nierman
Prachi Shah
Remm
Rhee
Sprague
Stein
Stuart R. Miyasato
Tweedie
Twigger
Wapinski
Wortman
Publication venue: Oxford University Press
Publication date
Field of study

The Aspergillus Genome Database (AspGD) is an online genomics resource for researchers studying the genetics and molecular biology of the Aspergilli. AspGD combines high-quality manual curation of the experimental scientific literature examining the genetics and molecular biology of Aspergilli, cutting-edge comparative genomics approaches to iteratively refine and improve structural gene annotations across multiple Aspergillus species, and web-based research tools for accessing and exploring the data. All of these data are freely available at http://www.aspgd.org. We welcome feedback from users and the research community at [email protected]

An Atlas of the Speed of Copy Number Changes in Animal Gene Families and Its Implications

Author: AV Furano
C Vogel
D Pan
DA Petrov
DA Petrov
Deng Pan
DM Krylov
E Birney
EG Danchin
F Raible
H Frohlich
H Roest Crollius
I Wapinski
J Felsenstein
J Jiang
JA Graves
JK Killian
JP Demuth
JZ Zhang
L Aravind
Liqing Zhang
M Karuppasamy
M Lynch
MJ Wakefield
MW Hahn
N Lopez-Bigas
N Saitou
O Lespinet
P Pavlidis
Pawel Michalak
PM Harrison
RD Kortschak
SB Hedges
T Blomme
TJ Sargeant
Y Bai
Publication venue: Public Library of Science
Publication date: 23/10/2009
Field of study

The notion that gene duplications generating new genes and functions is commonly accepted in evolutionary biology. However, this assumption is more speculative from theory rather than well proven in genome-wide studies. Here, we generated an atlas of the rate of copy number changes (CNCs) in all the gene families of ten animal genomes. We grouped the gene families with similar CNC dynamics into rate pattern groups (RPGs) and annotated their function using a novel bottom-up approach. By comparing CNC rate patterns, we showed that most of the species-specific CNC rates groups are formed by gene duplication rather than gene loss, and most of the changes in rates of CNCs may be the result of adaptive evolution. We also found that the functions of many RPGs match their biological significance well. Our work confirmed the role of gene duplication in generating novel phenotypes, and the results can serve as a guide for researchers to connect the phenotypic features to certain gene duplications

ProteinHistorian: Tools for the Comparative Analysis of Eukaryote Protein Origin

Author: A Alexeyenko
A Vishnoi
Alexander G. Williams
Andreas Prlic
CW Dunn
D Bhattacharya
D Binns
E Boyle
E Jones
EW Sayers
I Wapinski
J Sukumaran
JA Capra
JD Hunter
JJ Cai
JJ Cai
JJ Cai
John A. Capra
JS Ferris
Katherine S. Pollard
L Li
M Ashburner
M Csürös
M Levine
M Mar Albà
M Pellegini
M Punta
M Warnefors
MD Hirschey
O Cohen
PD Thomas
R Blekhman
S Heinicke
SB Hedges
T Domazet-Loso
T Domazet-Loso
T Domazet-Lošo
T Hulsen
WK Kim
X Xiong
YI Wolf
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The evolutionary history of a protein reflects the functional history of its ancestors. Recent phylogenetic studies identified distinct evolutionary signatures that characterize proteins involved in cancer, Mendelian disease, and different ontogenic stages. Despite the potential to yield insight into the cellular functions and interactions of proteins, such comparative phylogenetic analyses are rarely performed, because they require custom algorithms. We developed ProteinHistorian to make tools for performing analyses of protein origins widely available. Given a list of proteins of interest, ProteinHistorian estimates the phylogenetic age of each protein, quantifies enrichment for proteins of specific ages, and compares variation in protein age with other protein attributes. ProteinHistorian allows flexibility in the definition of protein age by including several algorithms for estimating ages from different databases of evolutionary relationships. We illustrate the use of ProteinHistorian with three example analyses. First, we demonstrate that proteins with high expression in human, compared to chimpanzee and rhesus macaque, are significantly younger than those with human-specific low expression. Next, we show that human proteins with annotated regulatory functions are significantly younger than proteins with catalytic functions. Finally, we compare protein length and age in many eukaryotic species and, as expected from previous studies, find a positive, though often weak, correlation between protein age and length. ProteinHistorian is available through a web server with an intuitive interface and as a set of command line tools; this allows biologists and bioinformaticians alike to integrate these approaches into their analysis pipelines. ProteinHistorian's modular, extensible design facilitates the integration of new datasets and algorithms. The ProteinHistorian web server, source code, and pre-computed ages for 32 eukaryotic genomes are freely available under the GNU public license at http://lighthouse.ucsf.edu/ProteinHistorian/

eScholarship - University of California

Patterns and Mechanisms of Ancestral Histone Protein Inheritance in Budding Yeast

Author: A Corpet
A Goren
A Groth
A Groth
A Jamai
A Lengronne
A Rufiange
A Verreault
A Weiner
A. D McConnell
A. T Annunziato
A. V Probst
Assaf Weiner
C Bonne-Andrea
C Gruss
C Hodges
C Jiang
C Thiriet
C Yu
C. L Liu
D. A Koster
D. E Gottschling
D. J Stillman
D. K Pokholok
E. F Glynn
E. K Hoffmann
F Frederiks
F. C Holstege
Fred van Leeuwen
G. C Yuan
I Gat-Viks
I Tirosh
I Wapinski
I. B Dodd
J Lopes da Rosa
J. A Sharp
J. M Schulze
J. M Sogo
K. F Verzijlbergen
K. L Huisinga
Kitty F. Verzijlbergen
L Pillus
L Ringrose
L. N Rusche
M Durand-Dubief
M Ptashne
M Radman-Livaja
M Radman-Livaja
M Sedighi
M. E Fernandez-Beros
M. F Dion
M. K Raghuraman
Marta Radman-Livaja
Nir Friedman
O Matangkasombut
O. I Kulaeva
O. I Kulaeva
O. I Kulaeva
O. J Rando
Oliver J. Rando
P. B Talbert
P. D Kaufman
Peter B. Becker
R Gasser
R. B Deal
S Smith
S. K Randall
T Goto
T Kaplan
T Kouzarides
T Krude
T Tsubota
T. S Kim
Tibor van Welsem
U. J Schermer
V Jackson
V Jackson
V Jackson
V. M Studitsky
V. M Studitsky
W. C Au
Y Mito
Y Pommier
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Tracking of ancestral histone proteins over multiple generations of genome replication in yeast reveals that old histones move along genes from 3′ toward 5′ over time, and that maternal histones move up to around 400 bp during genomic replication

Up-regulation of long non-coding RNA PANDAR is associated with poor prognosis and promotes tumorigenesis in bladder cancer

Author: A Stenzl
Anbang He
C Wang
C Zhuang
Chengle Zhuang
CL Zhuang
F Suriano
GN Marta
Guoping Zhao
J Chen
J Lin
J Zhou
JA Witjes
JD Lewis
Junhao Lin
KC Wang
L Liu
L Yang
Li Liu
M Burger
M Guttman
M Racioppi
MC Tsai
Mingwei Chen
ND James
O Wapinski
P Ma
PK Puvvula
PP Amaral
Qiaoxia Zhang
R Siegel
R Siegel
T Gutschner
Weiren Huang
Wen Xu
Xiaojuan Sun
Xiaoying Chen
Y Liu
Y Zhan
Yonghao Zhan
Yuchen Liu
Zhicong Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Digital Repository @ Iowa State University (ISU)

Detection of gene orthology from gene co-expression and protein interaction networks

Background Ortholog detection methods present a powerful approach for finding genes that participate in similar biological processes across different organisms, extending our understanding of interactions between genes across different pathways, and understanding the evolution of gene families. Results We exploit features derived from the alignment of protein-protein interaction networks and gene-coexpression networks to reconstruct KEGG orthologs for Drosophila melanogaster, Saccharomyces cerevisiae, Mus musculus and Homo sapiens protein-protein interaction networks extracted from the DIP repository and Mus musculus and Homo sapiens and Sus scrofa gene coexpression networks extracted from NCBI\u27s Gene Expression Omnibus using the decision tree, Naive-Bayes and Support Vector Machine classification algorithms. Conclusions The performance of our classifiers in reconstructing KEGG orthologs is compared against a basic reciprocal BLAST hit approach. We provide implementations of the resulting algorithms as part of BiNA, an open source biomolecular network alignment toolkit

Springer - Publisher Connector

Gene duplication and phenotypic changes in the evolution of Mammalian metabolic networks

Author: A Clarke
A Wagner
B Papp
BE Stranger
CR White
E Chautard
E Gonzalez
EJ Masoro
FA Kondrashov
G Basler
Gavin C. Conant
GB West
GC Conant
GC Conant
GH Perry
GP Wagner
H Kitano
H Ma
I Bratic
I Wapinski
I. King Jordan
J Ihmels
JF Gillooly
JJ McElwee
JP de Magalhaes
JP de Magalhaes
JR Speakman
JR Speakman
KE Jones
KM Wooden
KR Clarke
L Fontana
L Guarente
L Kuepfer
M Bekaert
M Blüher
M Bonafe
M Hall
M Huss
M Messer
M Pigliucci
MI McCarthy
Michaël Bekaert
N Le Novere
NC Duarte
NJ Isaac
O Ebenhoh
P Flicek
P Langer
P Monaghan
PJ Rousseeuw
PS Agutter
RH Houtkooper
S Nair
S Selvarasu
SD Hursting
T Hao
T Nakamura
TBL Kirkwood
TF Mackay
U Sauer
W Fontana
WJ Murphy
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

Metabolic networks attempt to describe the complete suite of biochemical reactions available to an organism. One notable feature of these networks in mammals is the large number of distinct proteins that catalyze the same reaction. While the existence of these isoenzymes has long been known, their evolutionary significance is still unclear. Using a phylogenetically-aware comparative genomics approach, we infer enzyme orthology networks for sixteen mammals as well as for their common ancestors. We find that the pattern of isoenzymes copy-number alterations (CNAs) in these networks is suggestive of natural selection acting on the retention of certain gene duplications. When further analyzing these data with a machine-learning approach, we found that that the pattern of CNAs is also predictive of several important phenotypic traits, including milk composition and geographic range. Integrating tools from network analyses, phylogenetics and comparative genomics both allows the prediction of phenotypes from genetic data and represents a means of unifying distinct biological disciplines

Stirling Online Research Repository (RIOXX)