Search CORE

5,925 research outputs found

The inference of gene trees with species trees

Author: Bastien Boussau
Eric Tannier
Gergely J. Szöllősi
Montbonnot France
Vincent Daubin
Publication venue
Publication date: 04/11/2013
Field of study

Molecular phylogeny has focused mainly on improving models for the reconstruction of gene trees based on sequence alignments. Yet, most phylogeneticists seek to reveal the history of species. Although the histories of genes and species are tightly linked, they are seldom identical, because genes duplicate, are lost or horizontally transferred, and because alleles can co-exist in populations for periods that may span several speciation events. Building models describing the relationship between gene and species trees can thus improve the reconstruction of gene trees when a species tree is known, and vice-versa. Several approaches have been proposed to solve the problem in one direction or the other, but in general neither gene trees nor species trees are known. Only a few studies have attempted to jointly infer gene trees and species trees. In this article we review the various models that have been used to describe the relationship between gene trees and species trees. These models account for gene duplication and loss, transfer or incomplete lineage sorting. Some of them consider several types of events together, but none exists currently that considers the full repertoire of processes that generate gene trees along the species tree. Simulations as well as empirical studies on genomic data show that combining gene tree-species tree models with models of sequence evolution improves gene tree reconstruction. In turn, these better gene trees provide a better basis for studying genome evolution or reconstructing ancestral chromosomes and ancestral gene sequences. We predict that gene tree-species tree methods that can deal with genomic data sets will be instrumental to advancing our understanding of genomic evolution.Comment: Review article in relation to the "Mathematical and Computational Evolutionary Biology" conference, Montpellier, 201

arXiv.org e-Print Archive

CiteSeerX

INRIA a CCSD electronic archive server

PubMed Central

HAL

Repository of the Academy's Library

ELTE Digital Institutional Repository (EDIT)

Hal-Diderot

Unified Alignment of Protein-Protein Interaction Networks

Author: Ban K
Malod-Dognin N
Przulj N
Publication venue: NATURE PUBLISHING GROUP
Publication date: 11/04/2017
Field of study

Paralleling the increasing availability of protein-protein interaction (PPI) network data, several network alignment methods have been proposed. Network alignments have been used to uncover functionally conserved network parts and to transfer annotations. However, due to the computational intractability of the network alignment problem, aligners are heuristics providing divergent solutions and no consensus exists on a gold standard, or which scoring scheme should be used to evaluate them. We comprehensively evaluate the alignment scoring schemes and global network aligners on large scale PPI data and observe that three methods, HUBALIGN, L-GRAAL and NATALIE, regularly produce the most topologically and biologically coherent alignments. We study the collective behaviour of network aligners and observe that PPI networks are almost entirely aligned with a handful of aligners that we unify into a new tool, Ulign. Ulign enables complete alignment of two networks, which traditional global and local aligners fail to do. Also, multiple mappings of Ulign define biologically relevant soft clusterings of proteins in PPI networks, which may be used for refining the transfer of annotations across networks. Hence, PPI networks are already well investigated by current aligners, so to gain additional biological insights, a paradigm shift is needed. We propose such a shift come from aligning all available data types collectively rather than any particular data type in isolation from others

UCL Discovery

Coevolved mutations reveal distinct architectures for two core proteins in the bacterial flagellar motor

Author: A Pandini
A Pandini
AC Lowenthal
Alessandro Pandini
AM Waterhouse
Anna Roujeinikova
AS Vartanian
B Ruhnau
BJ Grant
BJ Lowder
CJ Tsai
CM Dyer
D de Juan
D Stock
DL Guzman
DR Livesay
DR Thomas
DR Thomas
DS Bischoff
DT Jones
F Pazos
F Pazos
H Ashkenazy
H Ashkenazy
H Shimodaira
H Sockett
H Szurmant
HC Berg
J Friedman
J Yuan
Jens Kleinjung
JP Armitage
JP Armitage
JS Parkinson
K Paul
K Paul
K Paul
KA Reynolds
KH Lam
KH Lam
L Cavallo
LK Lee
M Punta
MK Sarkar
MN Price
NA Rosenberg
NJ Delalez
P Cluzel
PN Brown
PN Brown
Q Ma
R Saito
RC Edgar
RD Finn
RW Branch
S Chen
S Pronk
SA Lloyd
SD Dunn
Shafqat Rasool
Shahid Khan
SM Van Way
SY Park
SY Park
T Minamino
T Pilizota
TA Duke
VM Irikura
WR Taylor
WR Taylor
X Zhao
Y Tu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2015
Field of study

Switching of bacterial flagellar rotation is caused by large domain movements of the FliG protein triggered by binding of the signal protein CheY to FliM. FliG and FliM form adjacent multi-subunit arrays within the basal body C-ring. The movements alter the interaction of the FliG C-terminal (FliGC) "torque" helix with the stator complexes. Atomic models based on the Salmonella entrovar C-ring electron microscopy reconstruction have implications for switching, but lack consensus on the relative locations of the FliG armadillo (ARM) domains (amino-terminal (FliGN), middle (FliGM) and FliGC) as well as changes during chemotaxis. The generality of the Salmonella model is challenged by the variation in motor morphology and response between species. We studied coevolved residue mutations to determine the unifying elements of switch architecture. Residue interactions, measured by their coevolution, were formalized as a network, guided by structural data. Our measurements reveal a common design with dedicated switch and motor modules. The FliM middle domain (FliMM) has extensive connectivity most simply explained by conserved intra and inter-subunit contacts. In contrast, FliG has patchy, complex architecture. Conserved structural motifs form interacting nodes in the coevolution network that wire FliMM to the FliGC C-terminal, four-helix motor module (C3-6). FliG C3-6 coevolution is organized around the torque helix, differently from other ARM domains. The nodes form separated, surface-proximal patches that are targeted by deleterious mutations as in other allosteric systems. The dominant node is formed by the EHPQ motif at the FliMMFliGM contact interface and adjacent helix residues at a central location within FliGM. The node interacts with nodes in the N-terminal FliGc α-helix triad (ARM-C) and FliGN. ARM-C, separated from C3-6 by the MFVF motif, has poor intra-network connectivity consistent with its variable orientation revealed by structural data. ARM-C could be the convertor element that provides mechanistic and species diversity.JK was supported by Medical Research Council grant U117581331. SK was supported by seed funds from Lahore University of Managment Sciences (LUMS) and the Molecular Biology Consortium

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Brunel University Research Archive

FigShare

Network Archaeology: Uncovering Ancient Networks from Present-day Interactions

Author: A Ahmed
A Kreimer
A Mithani
A Vazquez
A Vázquez
A Wagner
AC Gavin
AL Barabási
B Manna
BP Kelley
C Tantipathananandh
C Wiuf
Carl Kingsford
DJ de Solla Price
DJ Watts
DS Callaway
E Sprinzak
ED Levy
F Guo
F Hormozdiari
G Palla
H Ebel
H Huang
HA Simon
HB Fraser
I Bezáková
I Ispolatov
I Ispolatov
J Bar-Ilan
J Dutkowski
J Felsenstein
J Flannick
J Golbeck
J Hopcroft
J Leskovec
J Leskovec
J Leskovec
J Leskovec
J Leskovec
JB Pereira-Leal
JB Pereira-Leal
Joel S. Bader
JW Pinney
JW Thornton
L Hakes
LA Goodman
M Middendorf
P Shannon
R Kumar
R Milo
R Singh
RL Tatusov
S Hanneke
S Kerrien
S Li
S Navlakha
S Redner
Saket Navlakha
T Makino
TA Gibson
U Güldener
WK Kim
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 30/08/2010
Field of study

Often questions arise about old or extinct networks. What proteins interacted in a long-extinct ancestor species of yeast? Who were the central players in the Last.fm social network 3 years ago? Our ability to answer such questions has been limited by the unavailability of past versions of networks. To overcome these limitations, we propose several algorithms for reconstructing a network's history of growth given only the network as it exists today and a generative model by which the network is believed to have evolved. Our likelihood-based method finds a probable previous state of the network by reversing the forward growth model. This approach retains node identities so that the history of individual nodes can be tracked. We apply these algorithms to uncover older, non-extant biological and social networks believed to have grown via several models, including duplication-mutation with complementarity, forest fire, and preferential attachment. Through experiments on both synthetic and real-world data, we find that our algorithms can estimate node arrival times, identify anchor nodes from which new nodes copy links, and can reveal significant features of networks that have long since disappeared.Comment: 16 pages, 10 figure

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Directory of Open Access Journals

PubMed Central

Methods and tools to improve performance of plant genome analysis

Author: Ferrell Drew
Publication venue: Scholars Junction
Publication date: 09/08/2022
Field of study

Multi -omics data analysis and integration facilitates hypothesis building toward an understanding of genes and pathway responses driven by environments. Methods designed to estimate and analyze gene expression, with regard to treatments or conditions, can be leveraged to understand gene-level responses in the cell. However, genes often interact and signal within larger structures such as pathways and networks. Complex studies guided toward describing dynamic genetic pathways and networks require algorithms or methods designed for inference based on gene interactions and related topologies. Classes of algorithms and methods may be integrated into generalized workflows for comparative genomics studies, as multi -omics data can be standardized between contact points in various software applications. Further, network inference or network comparison algorithmic designs may involve interchangeable operations given the structure of their implementations. Network comparison and inference methods can also guide transfer-of-knowledge between model organisms and those with less knowledge base

Scholars Junction - Mississippi State University Institutional Repository