Search CORE

613 research outputs found

Topological network alignment uncovers biological function and phylogeny

Author: Cook S.
Flannick J.
Kuchaiev O.
Kuchaiev O.
Memišević V.
Nataša Pržulj
Oleksii Kuchaiev
Pržulj N.
Singh R.
Singh R.
Snijders T. A.
Tijana Milenković
Vesna Memišević
Wayne Hayes
Wentz-Hunter K.
Zhang Y.
Publication venue
Publication date: 07/10/2009
Field of study

Sequence comparison and alignment has had an enormous impact on our understanding of evolution, biology, and disease. Comparison and alignment of biological networks will likely have a similar impact. Existing network alignments use information external to the networks, such as sequence, because no good algorithm for purely topological alignment has yet been devised. In this paper, we present a novel algorithm based solely on network topology, that can be used to align any two networks. We apply it to biological networks to produce by far the most complete topological alignments of biological networks to date. We demonstrate that both species phylogeny and detailed biological function of individual proteins can be extracted from our alignments. Topology-based alignments have the potential to provide a completely new, independent source of phylogenetic information. Our alignment of the protein-protein interaction networks of two very different species--yeast and human--indicate that even distant species share a surprising amount of network topology with each other, suggesting broad similarities in internal cellular wiring across all life on Earth.Comment: Algorithm explained in more details. Additional analysis adde

arXiv.org e-Print Archive

Crossref

PubMed Central

UCL Discovery

Efficiency and Power as a Function of Sequence Coverage, SNP Array Density, and Imputation

Author: Citation Flannick
David Altshuler
David Altshuler
Eric Banks
Eric Banks
George B. Grant
George B. Grant
Jason Flannick
Joshua M. Korn
Joshua M. Korn
Mark A. Depristo
Mark A. Depristo
Pierre Fontanillas
Pierre Fontanillas
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

High coverage whole genome sequencing provides near complete information about genetic variation. However, other technologies can be more efficient in some settings by (a) reducing redundant coverage within samples and (b) exploiting patterns of genetic variation across samples. To characterize as many samples as possible, many genetic studies therefore employ lower coverage sequencing or SNP array genotyping coupled to statistical imputation. To compare these approaches individually and in conjunction, we developed a statistical framework to estimate genotypes jointly from sequence reads, array intensities, and imputation. In European samples, we find similar sensitivity (89%) and specificity (99.6%) from imputation with either 1× sequencing or 1 M SNP arrays. Sensitivity is increased, particularly for low-frequency polymorphisms (MAF <5%), when low coverage sequence reads are added to dense genome-wide SNP arrays — the converse, however, is not true. At sites where sequence reads and array intensities produce different sample genotypes, joint analysis reduces genotype errors and identifies novel error modes. Our joint framework informs the use of next-generation sequencing in genome wide association studies and supports development of improved methods for genotype calling

CiteSeerX

Public Library of Science (PLOS)

DSpace@MIT

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute

Using multiple alignments to improve seeded local alignment algorithms

Author: Batzoglou Serafim
Flannick Jason
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

Multiple alignments among genomes are becoming increasingly prevalent. This trend motivates the development of tools for efficient homology search between a query sequence and a database of multiple alignments. In this paper, we present an algorithm that uses the information implicit in a multiple alignment to dynamically build an index that is weighted most heavily towards the promising regions of the multiple alignment. We have implemented Typhon, a local alignment tool that incorporates our indexing algorithm, which our test results show to be more sensitive than algorithms that index only a sequence. This suggests that when applied on a whole-genome scale, Typhon should provide improved homology searches in time comparable to existing algorithms

CiteSeerX

Crossref

PubMed Central

Network Archaeology: Uncovering Ancient Networks from Present-day Interactions

Author: A Ahmed
A Kreimer
A Mithani
A Vazquez
A Vázquez
A Wagner
AC Gavin
AL Barabási
B Manna
BP Kelley
C Tantipathananandh
C Wiuf
Carl Kingsford
DJ de Solla Price
DJ Watts
DS Callaway
E Sprinzak
ED Levy
F Guo
F Hormozdiari
G Palla
H Ebel
H Huang
HA Simon
HB Fraser
I Bezáková
I Ispolatov
I Ispolatov
J Bar-Ilan
J Dutkowski
J Felsenstein
J Flannick
J Golbeck
J Hopcroft
J Leskovec
J Leskovec
J Leskovec
J Leskovec
J Leskovec
JB Pereira-Leal
JB Pereira-Leal
Joel S. Bader
JW Pinney
JW Thornton
L Hakes
LA Goodman
M Middendorf
P Shannon
R Kumar
R Milo
R Singh
RL Tatusov
S Hanneke
S Kerrien
S Li
S Navlakha
S Redner
Saket Navlakha
T Makino
TA Gibson
U Güldener
WK Kim
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 30/08/2010
Field of study

Often questions arise about old or extinct networks. What proteins interacted in a long-extinct ancestor species of yeast? Who were the central players in the Last.fm social network 3 years ago? Our ability to answer such questions has been limited by the unavailability of past versions of networks. To overcome these limitations, we propose several algorithms for reconstructing a network's history of growth given only the network as it exists today and a generative model by which the network is believed to have evolved. Our likelihood-based method finds a probable previous state of the network by reversing the forward growth model. This approach retains node identities so that the history of individual nodes can be tracked. We apply these algorithms to uncover older, non-extant biological and social networks believed to have grown via several models, including duplication-mutation with complementarity, forest fire, and preferential attachment. Through experiments on both synthetic and real-world data, we find that our algorithms can estimate node arrival times, identify anchor nodes from which new nodes copy links, and can reveal significant features of networks that have long since disappeared.Comment: 16 pages, 10 figure

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Directory of Open Access Journals

PubMed Central

Recommended from our members

Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes.

Author: Afaq Saima
Afzal Shoaib
Ahlqvist Emma
Almgren Peter
Amin Najaf
An Ping
Bang Lia B
Bertoni Alain G
Bielak Lawrence F
Bombieri Cristina
Bork-Jensen Jette
Brandslund Ivan
Brody Jennifer A
Burtt Noël P
Canouil Mickaël
Chen Yii-Der Ida
Cho Yoon Shin
Christensen Cramer
Chu Audrey Y
Cook James P
de Haan Hugoline G
Demirkan Ayse
Eastwood Sophie V
Eckardt Kai-Uwe
ExomeBP Consortium
Fischer Krista
Flannick Jason
Gambaro Giovanni
Gan Wei
GIANT Consortium
Giedraitis Vilmantas
Graff Marielisa
Grarup Niels
Grove Megan L
Guo Xiuqing
Gustafsson Stefan
Hackinger Sophie
Hai Yang
Han Sohee
Highland Heather M
Hivert Marie-France
Hu Yao
Huo Shaofeng
Isomaa Bo
Jensen Richard A
Justice Anne E
Jäger Susanne
Jørgensen Marit E
Jørgensen Torben
Kim Bong-Jo
Kim Sung Soo
Kim Young Jin
Kitajima Hidetoshi
Koistinen Heikki A
Kovacs Peter
Kravic Jasmina
Kriebel Jennifer
Kronenberg Florian
Käräjämäki Annemari
Lange Leslie A
Lecoeur Cécile
Lee Jung-Jin
Lehne Benjamin
Li Huaixing
Li Jin
Li Man
Li-Gao Ruifang
Ligthart Symen
Lin Keng-Hung
Liu Dajiang J
Lohman Kurt K
Lu Yingchang
Läll Kristi
MAGIC Consortium
Mahajan Anubha
Malerba Giovanni
Marouli Eirini
Marten Jonathan
Meidtner Karina
Müller-Nurasyid Martina
Peloso Gina Marie
Preuss Michael
Prins Bram Peter
Rayner N William
Robertson Neil R
Rybin Denis V
Smith Albert Vernon
Steinthorsdottir Valgerdur
Tajes Juan Fernandez
Taliun Daniel
Trubetskoy Vassily Vladimirovich
Tybjærg-Hansen Anne
Varga Tibor V
Warren Helen R
Wessel Jennifer
Willems Sara M
Wuttke Matthias
Yaghootkar Hanieh
Zhang Weihua
Zhao Wei
Publication venue: eScholarship, University of California
Publication date: 01/04/2018
Field of study

We aggregated coding variant data for 81,412 type 2 diabetes cases and 370,832 controls of diverse ancestry, identifying 40 coding variant association signals (P < 2.2 × 10-7); of these, 16 map outside known risk-associated loci. We make two important observations. First, only five of these signals are driven by low-frequency variants: even for these, effect sizes are modest (odds ratio ≤1.29). Second, when we used large-scale genome-wide association data to fine-map the associated variants in their regional context, accounting for the global enrichment of complex trait associations in coding sequence, compelling evidence for coding variant causality was obtained for only 16 signals. At 13 others, the associated coding variants clearly represent 'false leads' with potential to generate erroneous mechanistic inference. Coding variant associations offer a direct route to biological insight for complex diseases and identification of validated therapeutic targets; however, appropriate mechanistic inference requires careful specification of their causal contribution to disease predisposition

eScholarship - University of California

Optimizing a global alignment of protein interaction networks

Author: Aebersold
Aladağ
Bader
Barabási
Berg
Bonnie Berger
Breitkreutz
Cheng-Yu Ma
Chindelevitch
Chung-Shou Liao
Croes
Csardi
Dutkowski
Flannick
Formont-Racine
Galil
Gavin
Guo
Hagberg
Han
Higham
Ho
Hubbard
Ito
Johnson
Kalaev
Kelley
Kelley
Keshava Prasad
Komili
Koyutürk
Kuchaiev
Kuchaiev
Kuhn
Lawler
Leonid Chindelevitch
Liao
Lindqvist
Ma
Mano
Memišević
Park
Patro
Przulj
Sahni
Salwinski
Sharan
Singh
Srinivasan
Tan
Uetz
Zaslavskiy
Zhang
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/07/2013
Field of study

Motivation: The global alignment of protein interaction networks is a widely studied problem. It is an important first step in understanding the relationship between the proteins in different species and identifying functional orthologs. Furthermore, it can provide useful insights into the species’ evolution. Results: We propose a novel algorithm, PISwap, for optimizing global pairwise alignments of protein interaction networks, based on a local optimization heuristic that has previously demonstrated its effectiveness for a variety of other intractable problems. PISwap can begin with different types of network alignment approaches and then iteratively adjust the initial alignments by incorporating network topology information, trading it off for sequence information. In practice, our algorithm efficiently refines other well-studied alignment techniques with almost no additional time cost. We also show the robustness of the algorithm to noise in protein interaction data. In addition, the flexible nature of this algorithm makes it suitable for different applications of network alignment. This algorithm can yield interesting insights into the evolutionary dynamics of related species. Availability: Our software is freely available for non-commercial purposes from our Web site, http://piswap.csail.mit.edu/.National Institutes of Health (U.S.) (Grant GM081871

DSpace@MIT

Crossref

PubMed Central

Spiral - Imperial College Digital Repository