Search CORE

908 research outputs found

Interrogating domain-domain interactions with parsimony based approaches

Author: A Bateman
AC Gavin
AJ Enright
BA Shoemaker
C Chothia
C von Mering
CM Deane
D Ekman
E Sprinzak
EM Marcotte
EV Koonin
G Butland
H Lee
H Wang
J Felsenstein
JR Bock
JR Nevins
JS Bader
Katia S Guimarães
KS Guimaraes
L Giot
L Lo Conte
L Salwinski
M Deng
M Gerstein
M Hayashida
MA Ikeda
NJ Krogan
OZ Peng K Vucetic S.
P Uetz
R Jothi
R Mrowka
R Riley
RD Finn
RX Luo
S Li
SA Teichmann
SM Gomez
SP Chellappan
T Ito
T Le Gall
Teresa M Przytycka
TM Nye
X Cheng
Y Ho
Z Itzhaki
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background The identification and characterization of interacting domain pairs is an important step towards understanding protein interactions. In the last few years, several methods to predict domain interactions have been proposed. Understanding the power and the limitations of these methods is key to the development of improved approaches and better understanding of the nature of these interactions. Results Building on the previously published Parsimonious Explanation method (PE) to predict domain-domain interactions, we introduced a new Generalized Parsimonious Explanation (GPE) method, which (i) adjusts the granularity of the domain definition to the granularity of the input data set and (ii) permits domain interactions to have different costs. This allowed for preferential selection of the so-called "co-occurring domains" as possible mediators of interactions between proteins. The performance of both variants of the parsimony method are competitive to the performance of the top algorithms for this problem even though parsimony methods use less information than some of the other methods. We also examined possible enrichment of co-occurring domains and homo-domains among domain interactions mediating the interaction of proteins in the network. The corresponding study was performed by surveying domain interactions predicted by the GPE method as well as by using a combinatorial counting approach independent of any prediction method. Our findings indicate that, while there is a considerable propensity towards these special domain pairs among predicted domain interactions, this overrepresentation is significantly lower than in the iPfam dataset. Conclusion The Generalized Parsimonious Explanation approach provides a new means to predict and study domain-domain interactions. We showed that, under the assumption that all protein interactions in the network are mediated by domain interactions, there exists a significant deviation of the properties of domain interactions mediating interactions in the network from that of iPfam data.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

DOMINE: a comprehensive collection of known and predicted domain-domain interactions

Author: Apic
Asba Tasneem
Balaji Raghavachari
Chen
Deng
Dmitri V. Zaykin
Finn
Guimaraes
Guimaraes
Jothi
Kann
Lee
Liu
Ng
Nye
Pagel
Pawson
Raghavachari
Raja Jothi
Riley
Sailu Yellaboina
Schuster-Bockler
Shoemaker
Singhal
Sprinzak
Stein
Wang
Zhao
Publication venue: Oxford University Press
Publication date
Field of study

DOMINE is a comprehensive collection of known and predicted domain–domain interactions (DDIs) compiled from 15 different sources. The updated DOMINE includes 2285 new domain–domain interactions (DDIs) inferred from experimentally characterized high-resolution three-dimensional structures, and about 3500 novel predictions by five computational approaches published over the last 3 years. These additions bring the total number of unique DDIs in the updated version to 26 219 among 5140 unique Pfam domains, a 23% increase compared to 20 513 unique DDIs among 4346 unique domains in the previous version. The updated version now contains 6634 known DDIs, and features a new classification scheme to assign confidence levels to predicted DDIs. DOMINE will serve as a valuable resource to those studying protein and domain interactions. Most importantly, DOMINE will not only serve as an excellent reference to bench scientists testing for new interactions but also to bioinformaticans seeking to predict novel protein–protein interactions based on the DDIs. The contents of the DOMINE are available at http://domine.utdallas.edu

Crossref

PubMed Central

Predicting domain-domain interactions using a parsimony approach

Author: Guimarães Katia S
Jothi Raja
Przytycka Teresa M
Zotenko Elena
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

We propose a novel approach to predict domain-domain interactions from a protein-protein interaction network. In our method we apply a parsimony-driven explanation of the network, where the domain interactions are inferred using linear programming optimization, and false positives in the protein network are handled by a probabilistic construction. This method outperforms previous approaches by a considerable margin. The results indicate that the parsimony principle provides a correct approach for detecting domain-domain contacts

Springer - Publisher Connector

PubMed Central

A Top-Down Approach to Infer and Compare Domain-Domain Interactions across Eight Model Organisms

Author: Guda Chittibabu
Guda Purnima
King Brian R.
Pal Lipika R.
Publication venue: Public Library of Science
Publication date: 31/03/2009
Field of study

Knowledge of specific domain-domain interactions (DDIs) is essential to understand the functional significance of protein interaction networks. Despite the availability of an enormous amount of data on protein-protein interactions (PPIs), very little is known about specific DDIs occurring in them. Here, we present a top-down approach to accurately infer functionally relevant DDIs from PPI data. We created a comprehensive, non-redundant dataset of 209,165 experimentally-derived PPIs by combining datasets from five major interaction databases. We introduced an integrated scoring system that uses a novel combination of a set of five orthogonal scoring features covering the probabilistic, evolutionary, evidence-based, spatial and functional properties of interacting domains, which can map the interacting propensity of two domains in many dimensions. This method outperforms similar existing methods both in the accuracy of prediction and in the coverage of domain interaction space. We predicted a set of 52,492 high-confidence DDIs to carry out cross-species comparison of DDI conservation in eight model species including human, mouse, Drosophila, C. elegans, yeast, Plasmodium, E. coli and Arabidopsis. Our results show that only 23% of these DDIs are conserved in at least two species and only 3.8% in at least 4 species, indicating a rather low conservation across species. Pair-wise analysis of DDI conservation revealed a ‘sliding conservation’ pattern between the evolutionarily neighboring species. Our methodology and the high-confidence DDI predictions generated in this study can help to better understand the functional significance of PPIs at the modular level, thus can significantly impact further experimental investigations in systems biology research

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

InSite: a computational method for identifying protein-protein interaction binding sites on a proteome-wide scale

Author: Asa Ben-hur
Daphne Koller
Eran Segal
Eran Segal
Haidong Wang
Haidong Wang
Marc Vidal
Marc Vidal
Qianru Li
Qianru Li
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

InSite is a computational method that integrates high-throughput protein and sequence data to infer the specific binding regions of interacting protein pairs

CiteSeerX

Crossref

Harvard University - DASH

Springer - Publisher Connector

PubMed Central

Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences

Author: A Ben-Hur
A Ben-Hur
A Henschel
A-C Gavin
AJ Enright
AK Ramani
AS Aytuna
C-C Chang
C-S Goh
D Betel
D Juan
E Akiva
E Sprinzak
ED Levy
EM Marcotte
F Pazos
F Pazos
H Lee
H Li
H Wang
H Yu
J Espadaler
J Guo
J Shen
J Wojcik
J-F Rual
JP Miller
JR Bock
K Guimaraes
K Tarassov
K-C Chou
L Burger
L Giot
L Salwinski
M Ashburner
M Deng
M Iqbal
M Pellegrini
M Singhal
NJ Krogan
P Aloy
P Uetz
R Jansen
R Riley
R Sharan
S Li
S Martin
S Peri
S Pitre
S Pitre
S Zanivan
S-E Schelhorn
S-K Ng
SM Gomez
T Dandekar
T Hastie
T Ito
T-T Soong
TMW Nye
U Stelzl
W Li
WK Kim
WK Kim
X-W Chen
Y Guo
Y Liu
Yungki Park
Publication venue: BioMed Central
Publication date: 01/12/2009
Field of study

Abstract Background Protein-protein interactions underlie many important biological processes. Computational prediction methods can nicely complement experimental approaches for identifying protein-protein interactions. Recently, a unique category of sequence-based prediction methods has been put forward - unique in the sense that it does not require homologous protein sequences. This enables it to be universally applicable to all protein sequences unlike many of previous sequence-based prediction methods. If effective as claimed, these new sequence-based, universally applicable prediction methods would have far-reaching utilities in many areas of biology research. Results Upon close survey, I realized that many of these new methods were ill-tested. In addition, newer methods were often published without performance comparison with previous ones. Thus, it is not clear how good they are and whether there are significant performance differences among them. In this study, I have implemented and thoroughly tested 4 different methods on large-scale, non-redundant data sets. It reveals several important points. First, significant performance differences are noted among different methods. Second, data sets typically used for training prediction methods appear significantly biased, limiting the general applicability of prediction methods trained with them. Third, there is still ample room for further developments. In addition, my analysis illustrates the importance of complementary performance measures coupled with right-sized data sets for meaningful benchmark tests. Conclusions The current study reveals the potentials and limits of the new category of sequence-based protein-protein interaction prediction methods, which in turn provides a firm ground for future endeavours in this important area of contemporary bioinformatics.</p

Crossref

Directory of Open Access Journals

PubMed Central

Texas ScholarWorks

Organization of Physical Interactomes as Uncovered by Network Schemas

Author: A Barabasi
A Bateman
A Ferro
A Inokuchi
A Rives
A Tong
B Breitkreutz
B Dost
B Kelley
Bernard Chazelle
Burkhard Rost
C Stark
D Cook
E Banks
E Bornberg-Bauer
E Hong
E Patton
E Sprinzak
E Sprinzak
E Yeger-Lotem
EI Boyle
Elena Nabieva
Eric Banks
J Flannick
J Fong
J Huan
J Huan
J Pandey
J Ptacek
J Scott
J Wojcik
JD Han
K Guimaraes
K Mitsui
L Despons
L Giot
L Hartwell
L Kiemer
L Zhang
M Ashburner
M Deng
M Kuramochi
M Kuramochi
M Stefen
Mona Singh
N Luscombe
O Garcia
P Chomez
P Kim
P Kim
P Pagel
P Shannon
R Milo
R Pinter
R Riley
R Sharan
R Sharan
R Singh
S Ghazizadeh
S Gomez
S Maslov
S Pao
S Shen-Orr
S Wuchty
T Ito
T Lee
T Nye
T Pawson
T Przytycka
T Sandmann
T Shlomi
U Sivars
V Lacroix
V Neduva
V Spirin
W He
X Yang
X Zhu
Y Tian
Z Itzhaki
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Large-scale protein-protein interaction networks provide new opportunities for understanding cellular organization and functioning. We introduce network schemas to elucidate shared mechanisms within interactomes. Network schemas specify descriptions of proteins and the topology of interactions among them. We develop algorithms for systematically uncovering recurring, over-represented schemas in physical interaction networks. We apply our methods to the S. cerevisiae interactome, focusing on schemas consisting of proteins described via sequence motifs and molecular function annotations and interacting with one another in one of four basic network topologies. We identify hundreds of recurring and over-represented network schemas of various complexity, and demonstrate via graph-theoretic representations how more complex schemas are organized in terms of their lower-order constituents. The uncovered schemas span a wide range of cellular activities, with many signaling and transport related higher-order schemas. We establish the functional importance of the schemas by showing that they correspond to functionally cohesive sets of proteins, are enriched in the frequency with which they have instances in the H. sapiens interactome, and are useful for predicting protein function. Our findings suggest that network schemas are a powerful paradigm for organizing, interrogating, and annotating cellular networks

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Contextual Specificity in Peptide-Mediated Protein Interactions

Author: Aloy Patrick
Stein Amelie
Publication venue: Public Library of Science
Publication date: 02/07/2008
Field of study

Most biological processes are regulated through complex networks of transient protein interactions where a globular domain in one protein recognizes a linear peptide from another, creating a relatively small contact interface. Although sufficient to ensure binding, these linear motifs alone are usually too short to achieve the high specificity observed, and additional contacts are often encoded in the residues surrounding the motif (i.e. the context). Here, we systematically identified all instances of peptide-mediated protein interactions of known three-dimensional structure and used them to investigate the individual contribution of motif and context to the global binding energy. We found that, on average, the context is responsible for roughly 20% of the binding and plays a crucial role in determining interaction specificity, by either improving the affinity with the native partner or impeding non-native interactions. We also studied and quantified the topological and energetic variability of interaction interfaces, finding a much higher heterogeneity in the context residues than in the consensus binding motifs. Our analysis partially reveals the molecular mechanisms responsible for the dynamic nature of peptide-mediated interactions, and suggests a global evolutionary mechanism to maximise the binding specificity. Finally, we investigated the viability of non-native interactions and highlight cases of potential cross-reaction that might compensate for individual protein failure and establish backup circuits to increase the robustness of cell networks

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

One Explanation Does Not Fit All The Promise of Interactive Explanations for Machine Learning Transparency

Author: Flach Peter
Sokol Kacper
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/01/2020
Field of study

The need for transparency of predictive systems based on Machine Learning algorithms arises as a consequence of their ever-increasing proliferation in the industry. Whenever black-box algorithmic predictions influence human affairs, the inner workings of these algorithms should be scrutinised and their decisions explained to the relevant stakeholders, including the system engineers, the system's operators and the individuals whose case is being decided. While a variety of interpretability and explainability methods is available, none of them is a panacea that can satisfy all diverse expectations and competing objectives that might be required by the parties involved. We address this challenge in this paper by discussing the promises of Interactive Machine Learning for improved transparency of black-box systems using the example of contrastive explanations -- a state-of-the-art approach to Interpretable Machine Learning. Specifically, we show how to personalise counterfactual explanations by interactively adjusting their conditional statements and extract additional explanations by asking follow-up "What if?" questions. Our experience in building, deploying and presenting this type of system allowed us to list desired properties as well as potential limitations, which can be used to guide the development of interactive explainers. While customising the medium of interaction, i.e., the user interface comprising of various communication channels, may give an impression of personalisation, we argue that adjusting the explanation itself and its content is more important. To this end, properties such as breadth, scope, context, purpose and target of the explanation have to be considered, in addition to explicitly informing the explainee about its limitations and caveats...Comment: Published in the Kunstliche Intelligenz journal, special issue on Challenges in Interactive Machine Learnin

arXiv.org e-Print Archive

Explore Bristol Research

Mining Protein-Protein Interactions at Domain and Residue Levels by Machine Learning Methods

Author: Le Thi Tu, Kien
レーティートゥーキエン
Publication venue
Publication date: 30/09/2013
Field of study

13301甲第4027号博士（工学）金沢大学博士論文要旨Abstrac

Kanazawa University Repository for Academic Resources