Search CORE

MINE: Module Identification in Networks

Author: A Ceol
AJ Enright
B Adamcsek
B Aranda
C Stark
DJ Watts
GD Bader
H Hu
HW Mewes
IX Leung
JD Han
Kahn Rhrissorrakrai
Kristin C Gunsalus
M Ashburner
M Boxem
M Remm
ME Newman
N Simonis
X Yan
X Yan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Graphical models of network associations are useful for both visualizing and integrating multiple types of association data. Identifying modules, or groups of functionally related gene products, is an important challenge in analyzing biological networks. However, existing tools to identify modules are insufficient when applied to dense networks of experimentally derived interaction data. To address this problem, we have developed an agglomerative clustering method that is able to identify highly modular sets of gene products within highly interconnected molecular interaction networks. Results MINE outperforms MCODE, CFinder, NEMO, SPICi, and MCL in identifying non-exclusive, high modularity clusters when applied to the <it>C. elegans </it>protein-protein interaction network. The algorithm generally achieves superior geometric accuracy and modularity for annotated functional categories. In comparison with the most closely related algorithm, MCODE, the top clusters identified by MINE are consistently of higher density and MINE is less likely to designate overlapping modules as a single unit. MINE offers a high level of granularity with a small number of adjustable parameters, enabling users to fine-tune cluster results for input networks with differing topological properties. Conclusions MINE was created in response to the challenge of discovering high quality modules of gene products within highly interconnected biological networks. The algorithm allows a high degree of flexibility and user-customisation of results with few adjustable parameters. MINE outperforms several popular clustering algorithms in identifying modules with high modularity and obtains good overall recall and precision of functional annotations in protein-protein interaction networks from both <it>S. cerevisiae </it>and <it>C. elegans</it>.</p

arXiv.org e-Print Archive

Formation of regulatory modules by local sequence duplication

Author: A Stark
A Tanay
AL Halpern
AM Moses
AM Moses
AM Moses
Amos Tanay
Armita Nourmohammad
B Ondek
BP Berman
CM Bergman
CM Bergman
CT Harbison
D Gruen
D Stanojevic
DN Arnosti
DS Fields
E Segal
EE Hare
EH Davidson
EH Davidson
G Badis
G Benson
G Leung
GD Stormo
I Abnizova
J Berg
J Berg
J Monod
JM Hancock
K Thornton
L Li
M Kimura
M Kimura
M Levine
M Lynch
M Lynch
M Lässig
M Markstein
M Pachkov
M Ptashne
MC King
MD Vinces
Michael Lässig
MM Kulkarni
MS Halfon
MS Halfon
MV Katti
MZ Ludwig
MZ Ludwig
MZ Ludwig
MZ Ludwig
N Rajewsky
NE Buchler
O Berg
PW Messer
R Durbin
RJ Britten
RW Lusk
S Kullback
S Mukherjee
S Sinha
S Sinha
S Sinha
S Small
SJ Maerkl
SM Gallo
SW Doniger
V Boeva
V Mustonen
V Mustonen
V Mustonen
Z Wunderlich
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Turnover of regulatory sequence and function is an important part of molecular evolution. But what are the modes of sequence evolution leading to rapid formation and loss of regulatory sites? Here, we show that a large fraction of neighboring transcription factor binding sites in the fly genome have formed from a common sequence origin by local duplications. This mode of evolution is found to produce regulatory information: duplications can seed new sites in the neighborhood of existing sites. Duplicate seeds evolve subsequently by point mutations, often towards binding a different factor than their ancestral neighbor sites. These results are based on a statistical analysis of 346 cis-regulatory modules in the Drosophila melanogaster genome, and a comparison set of intergenic regulatory sequence in Saccharomyces cerevisiae. In fly regulatory modules, pairs of binding sites show significantly enhanced sequence similarity up to distances of about 50 bp. We analyze these data in terms of an evolutionary model with two distinct modes of site formation: (i) evolution from independent sequence origin and (ii) divergent evolution following duplication of a common ancestor sequence. Our results suggest that pervasive formation of binding sites by local sequence duplications distinguishes the complex regulatory architecture of higher eukaryotes from the simpler architecture of unicellular organisms

Public Library of Science (PLOS)

Kölner UniversitätsPublikationsServer

iRefR: an R package to manipulate the iRefIndex consolidated protein interaction database

Author: A Ceol
A Clauset
A Ruepp
A Stojmirovic
AL Turinsky
Antonio Mora
B Aranda
B Turner
C Alfarano
C Stark
G Csardi
GD Bader
I Xenarios
Ian M Donaldson
J Yu
KR Brown
P Braun
P Pagel
RM Ewing
S Kerrien
S Razick
TS Keshava Prasad
U Guldener
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The iRefIndex addresses the need to consolidate protein interaction data into a single uniform data resource. iRefR provides the user with access to this data source from an R environment. Results The iRefR package includes tools for selecting specific subsets of interest from the iRefIndex by criteria such as organism, source database, experimental method, protein accessions and publication identifier. Data may be converted between three representations (MITAB, edgeList and graph) for use with other R packages such as igraph, graph and RBGL. The user may choose between different methods for resolving redundancies in interaction data and how n-ary data is represented. In addition, we describe a function to identify binary interaction records that possibly represent protein complexes. We show that the user choice of data selection, redundancy resolution and n-ary data representation all have an impact on graphical analysis. Conclusions The package allows the user to control how these issues are dealt with and communicate them via an R-script written using the iRefR package - this will facilitate communication of methods, reproducibility of network analyses and further modification and comparison of methods by researchers.</p

NORA - Norwegian Open Research Archives

Carotid transient ischemic attacks presenting as limb-shaking syndrome: report of two cases

Author: André R. Troiano
Baquis GD
Baumgartner RW
Bogousslavsky J
Camac A
Célio Teixeira Mendonça
Firlik AD
Fisher CM
Gálvez-Jimenez
Hélio A.G. Teive
Kimber TE
Lee MS
Leira EC
Lineu C. Werneck
Merchut MP
Pedro A. Kowacs
Stark SR
Tatemichi TK
Yanagihara T
Zaidat OO
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

University of Toronto Research Repository

Markov clustering versus affinity propagation for the partitioning of protein interaction graphs

Author: AC Gavin
AC Gavin
AK Jain
B Alberts
BJ Frey
BJ Frey
C Stark
E Pieroni
GD Bader
H Chipman
H Yu
J MacQueen
J Vlasblom
James Vlasblom
M Blatt
ME Cusick
MJ Brusco
N Johnsson
NJ Krogan
P Shannon
R Sharan
S Bader
S Brohee
S Charbonnier
S Fields
S Lloyd
S Pu
S Pu
S van Dongen
SH Yook
Shoshana J Wodak
SR Collins
T Formosa
T Hastie
TE Ideker
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Genome scale data on protein interactions are generally represented as large networks, or graphs, where hundreds or thousands of proteins are linked to one another. Since proteins tend to function in groups, or complexes, an important goal has been to reliably identify protein complexes from these graphs. This task is commonly executed using clustering procedures, which aim at detecting densely connected regions within the interaction graphs. There exists a wealth of clustering algorithms, some of which have been applied to this problem. One of the most successful clustering procedures in this context has been the Markov Cluster algorithm (MCL), which was recently shown to outperform a number of other procedures, some of which were specifically designed for partitioning protein interactions graphs. A novel promising clustering procedure termed Affinity Propagation (AP) was recently shown to be particularly effective, and much faster than other methods for a variety of problems, but has not yet been applied to partition protein interaction graphs. Results In this work we compare the performance of the Affinity Propagation (AP) and Markov Clustering (MCL) procedures. To this end we derive an unweighted network of protein-protein interactions from a set of 408 protein complexes from <it>S. cervisiae </it>hand curated in-house, and evaluate the performance of the two clustering algorithms in recalling the annotated complexes. In doing so the parameter space of each algorithm is sampled in order to select optimal values for these parameters, and the robustness of the algorithms is assessed by quantifying the level of complex recall as interactions are randomly added or removed to the network to simulate noise. To evaluate the performance on a weighted protein interaction graph, we also apply the two algorithms to the consolidated protein interaction network of <it>S. cerevisiae</it>, derived from genome scale purification experiments and to versions of this network in which varying proportions of the links have been randomly shuffled. Conclusion Our analysis shows that the MCL procedure is significantly more tolerant to noise and behaves more robustly than the AP algorithm. The advantage of MCL over AP is dramatic for unweighted protein interaction graphs, as AP displays severe convergence problems on the majority of the unweighted graph versions that we tested, whereas MCL continues to identify meaningful clusters, albeit fewer of them, as the level of noise in the graph increases. MCL thus remains the method of choice for identifying protein complexes from binary interaction networks.</p

Which clustering algorithm is better for predicting protein complexes?

Abstract Background Protein-Protein interactions (PPI) play a key role in determining the outcome of most cellular processes. The correct identification and characterization of protein interactions and the networks, which they comprise, is critical for understanding the molecular mechanisms within the cell. Large-scale techniques such as pull down assays and tandem affinity purification are used in order to detect protein interactions in an organism. Today, relatively new high-throughput methods like yeast two hybrid, mass spectrometry, microarrays, and phage display are also used to reveal protein interaction networks. Results In this paper we evaluated four different clustering algorithms using six different interaction datasets. We parameterized the MCL, Spectral, RNSC and Affinity Propagation algorithms and applied them to six PPI datasets produced experimentally by Yeast 2 Hybrid (Y2H) and Tandem Affinity Purification (TAP) methods. The predicted clusters, so called protein complexes, were then compared and benchmarked with already known complexes stored in published databases. Conclusions While results may differ upon parameterization, the MCL and RNSC algorithms seem to be more promising and more accurate at predicting PPI complexes. Moreover, they predict more complexes than other reviewed algorithms in absolute numbers. On the other hand the spectral clustering algorithm achieves the highest valid prediction rate in our experiments. However, it is nearly always outperformed by both RNSC and MCL in terms of the geometrical accuracy while it generates the fewest valid clusters than any other reviewed algorithm. This article demonstrates various metrics to evaluate the accuracy of such predictions as they are presented in the text below. Supplementary material can be found at: <url>http://www.bioacademy.gr/bioinformatics/projects/ppireview.htm</url></p

Open Repository and Bibliography - Luxembourg

EUR Research Repository

University of Thessaly Institutional Repository

Resolving the structure of interactomes with hierarchical agglomerative clustering

Author: A Clauset
A Clauset
A Clauset
AM Cuervo
B Ravikumar
C He
C Stark
CG Rivera
DS Goldberg
E Airoldi
G Palla
GD Bader
H Huang
H Yu
H Zhang
J Qiu
JDJ Han
JM Hofman
Joel S Bader
K Heller
K Henderson
L Royer
M Costanzo
MEJ Newman
MEJ Newman
N Mizushima
P Ye
R Kelley
RE Kass
S Fortunato
SR Pfeffer
UV Luxburg
V Spirin
WW Zachary
X Pan
Y Park
Y Qi
Yongjin Park
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

University of Liverpool Repository

Measurement of the top quark mass using the matrix element technique in dilepton final states

Author: Abazov VM
Abbott B
Acharya BS
Adams M
Adams T
Agnew JP
Alexeev GD
Alkhazov G
Alton A
Askew A
Atkins S
Augsten K
Aushev V
Aushev Y
Avila C
Badaud F
Bagby L
Baldin B
Bandurin DV
Banerjee S
Barberis E
Baringer P
Bartlett JF
Bassler U
Bazterra V
Bean A
Begalli M
Bellantoni L
Beri SB
Bernardi G
Bernhard R
Bertram I
Besancon M
Beuselinck R
Bhat PC
Bhatia S
Bhatnagar V
Blazey G
Blessing S
Bloom K
Boehnlein A
Boline D
Boos EE
Borissov G
Borysova M
Brandt A
Brandt O
Brochmann M
Brock R
Bross A
Brown D
Bu XB
Buehler M
Buescher V
Bunichev V
Burdin S
Buszello CP
Camacho-Perez E
Casey BCK
Castilla-Valdez H
Caughron S
Chakrabarti S
Chan KM
Chandra A
Chapon E
Chen G
Cho SW
Choi S
Choudhary B
Cihangir S
Claes D
Clutter J
Collaboration D0
Cooke M
Cooper WE
Corcoran M
Couderc F
Cousinou M-C
Cuth J
Cutts D
Das A
Davies G
de Jong SJ
De La Cruz-Burelo E
de Sa R Lopes
Deliot F
Demina R
Denisov D
Denisov SP
Desai S
Deterre C
DeVaughan K
Diehl HT
Diesburg M
Ding PF
Dominguez A
Dubey A
Dudko LV
Duperrin A
Dutt S
Eads M
Edmunds D
Ellison J
Elvira VD
Enari Y
Evans H
Evdokimov A
Evdokimov VN
Faure A
Feng L
Ferbel T
Fiedler F
Filthaut F
Fisher W
Fisk HE
Fortner M
Fox H
Franc J
Fuess S
Garbincius PH
Garcia-Bellido A
Garcia-Gonzalez JA
Gavrilov V
Geng W
Gerber CE
Gershtein Y
Ginther G
Gogota O
Golovanov G
Grannis PD
Greder S
Greenlee H
Grenier G
Gris Ph
Grivaz J-F
Grohsjean A
Grunendahl S
Grunewald MW
Guillemin T
Gutierrez G
Gutierrez P
Haley J
Han L
Harder K
Harel A
Hauptman JM
Hays J
Head T
Hebbeker T
Hedin D
Hegab H
Heinson AP
Heintz U
Hensel C
Heredia-De La Cruz I
Herner K
Hesketh G
Hildreth MD
Hirosky R
Hoang T
Hobbs JD
Hoeneisen B
Hogan J
Hohlfeld M
Holzbauer JL
Howley I
Hubacek Z
Hynek V
Iashvili I
Ilchenko Y
Illingworth R
Ito AS
Jabeen S
Jaffre M
Jayasinghe A
Jeong MS
Jesik R
Jiang P
Johns K
Johnson E
Johnson M
Jonckheere A
Jonsson P
Joshi J
Jung AW
Juste A
Kajfasz E
Karmanov D
Katsanos I
Kaur M
Kehoe R
Kermiche S
Khalatyan N
Khanov A
Kharchilava A
Kharzheev YN
Kiselevich I
Kohli JM
Kozelov AV
Kraus J
Kumar A
Kupco A
Kurca T
Kuzmin VA
Lammers S
Lebrun P
Lee HS
Lee SW
Lee WM
Lei X
Lellouch J
Li D
Li H
Li L
Li QZ
Lim JK
Lincoln D
Linnemann J
Lipaev VV
Lipton R
Liu H
Liu Y
Lobodenko A
Lokajicek M
Luna-Garcia R
Lyon AL
Maciel AKA
Madar R
Magana-Villalba R
Malik S
Malyshev VL
Mansour J
Martinez-Ortega J
McCarthy R
McGivern CL
Meijer MM
Melnitchouk A
Menezes D
Mercadante PG
Merkin M
Meyer A
Meyer J
Miconi F
Mondal NK
Mulhearn M
Nagy E
Narain M
Nayyar R
Neal HA
Negret JP
Neustroev P
Nguyen HT
Nunnemann T
Orduna J
Osman N
Pal A
Parashar N
Parihar V
Park SK
Partridge R
Parua N
Patwa A
Penning B
Perfilov M
Peters Y
Petridis K
Petrillo G
Petroff P
Pleier M-A
Podstavkov VM
Popov AV
Prewitt M
Price D
Prokopenko N
Qian J
Quadt A
Quinn B
Ratoff PN
Razumov I
Ripp-Baudot I
Rizatdinova F
Rominsky M
Ross A
Royon C
Rubinov P
Ruchti R
Sajot G
Sanchez-Hernandez A
Sanders MP
Santos AS
Savage G
Savitskyi M
Sawyer L
Scanlon T
Schamberger RD
Scheglov Y
Schellman H
Schott M
Schwanenberger C
Schwienhorst R
Sekaric J
Severini H
Shabalina E
Shary V
Shaw S
Shchukin AA
Simak V
Skubic P
Slattery P
Snow GR
Snow J
Snyder S
Soldner-Rembold S
Sonnenschein L
Soustruznik K
Stark J
Stefaniuk N
Stoyanova DA
Strauss M
Suter L
Svoisky P
Titov M
Tokmenin VV
Tsai Y-T
Tsybychev D
Tuchming B
Tully C
Uvarov L
Uvarov S
Uzunyan S
Van Kooten R
van Leeuwen WM
Varelas N
Varnes EW
Vasilyev IA
Verkheev AY
Vertogradov LS
Verzocchi M
Vesterinen M
Vilanova D
Vokac P
Wahl HD
Wang MHLS
Warchol J
Watts G
Wayne M
Weichert J
Welty-Rieger L
Williams MRJ
Wilson GW
Wobisch M
Wood DR
Wyatt TR
Xie Y
Yamada R
Yang S
Yasuda T
Yatsunenko YA
Ye W
Ye Z
Yin H
Yip K
Youn SW
Yu JM
Zennamo J
Zhao TG
Zhou B
Zhu J
Zielinski M
Zieminska D
Zivkovic L
Publication venue: 'American Physical Society (APS)'
Publication date: 10/06/2016
Field of study

We present a measurement of the top quark mass in pp¯ collisions at a center-of-mass energy of 1.96 TeV at the Fermilab Tevatron collider. The data were collected by the D0 experiment corresponding to an integrated luminosity of 9.7 fb−1. The matrix element technique is applied to tt¯ events in the final state containing leptons (electrons or muons) with high transverse momenta and at least two jets. The calibration of the jet energy scale determined in the lepton+jets final state of tt¯ decays is applied to jet energies. This correction provides a substantial reduction in systematic uncertainties. We obtain a top quark mass of mt=173.93±1.84 GeV

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas