43 research outputs found

    Cross-Over between Discrete and Continuous Protein Structure Space: Insights into Automatic Classification and Networks of Protein Structures

    Get PDF
    Structural classifications of proteins assume the existence of the fold, which is an intrinsic equivalence class of protein domains. Here, we test in which conditions such an equivalence class is compatible with objective similarity measures. We base our analysis on the transitive property of the equivalence relationship, requiring that similarity of A with B and B with C implies that A and C are also similar. Divergent gene evolution leads us to expect that the transitive property should approximately hold. However, if protein domains are a combination of recurrent short polypeptide fragments, as proposed by several authors, then similarity of partial fragments may violate the transitive property, favouring the continuous view of the protein structure space. We propose a measure to quantify the violations of the transitive property when a clustering algorithm joins elements into clusters, and we find out that such violations present a well defined and detectable cross-over point, from an approximately transitive regime at high structure similarity to a regime with large transitivity violations and large differences in length at low similarity. We argue that protein structure space is discrete and hierarchic classification is justified up to this cross-over point, whereas at lower similarities the structure space is continuous and it should be represented as a network. We have tested the qualitative behaviour of this measure, varying all the choices involved in the automatic classification procedure, i.e., domain decomposition, alignment algorithm, similarity score, and clustering algorithm, and we have found out that this behaviour is quite robust. The final classification depends on the chosen algorithms. We used the values of the clustering coefficient and the transitivity violations to select the optimal choices among those that we tested. Interestingly, this criterion also favours the agreement between automatic and expert classifications. As a domain set, we have selected a consensus set of 2,890 domains decomposed very similarly in SCOP and CATH. As an alignment algorithm, we used a global version of MAMMOTH developed in our group, which is both rapid and accurate. As a similarity measure, we used the size-normalized contact overlap, and as a clustering algorithm, we used average linkage. The resulting automatic classification at the cross-over point was more consistent than expert ones with respect to the structure similarity measure, with 86% of the clusters corresponding to subsets of either SCOP or CATH superfamilies and fewer than 5% containing domains in distinct folds according to both SCOP and CATH. Almost 15% of SCOP superfamilies and 10% of CATH superfamilies were split, consistent with the notion of fold change in protein evolution. These results were qualitatively robust for all choices that we tested, although we did not try to use alignment algorithms developed by other groups. Folds defined in SCOP and CATH would be completely joined in the regime of large transitivity violations where clustering is more arbitrary. Consistently, the agreement between SCOP and CATH at fold level was lower than their agreement with the automatic classification obtained using as a clustering algorithm, respectively, average linkage (for SCOP) or single linkage (for CATH). The networks representing significant evolutionary and structural relationships between clusters beyond the cross-over point may allow us to perform evolutionary, structural, or functional analyses beyond the limits of classification schemes. These networks and the underlying clusters are available at http://ub.cbm.uam.es/research/ProtNet.ph

    Cerebral malaria: insights from host-parasite protein-protein interactions

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Cerebral malaria is a form of human malaria wherein <it>Plasmodium falciparum</it>-infected red blood cells adhere to the blood capillaries in the brain, potentially leading to coma and death. Interactions between parasite and host proteins are important in understanding the pathogenesis of this deadly form of malaria. It is, therefore, necessary to study available protein-protein interactions to identify lesser known interactions that could throw light on key events of cerebral malaria.</p> <p>Methods</p> <p>Sequestration, haemostasis dysfunction, systemic inflammation and neuronal damage are key processes of cerebral malaria. Key events were identified from literature as being crucial to these processes. An integrated interactome was created using available experimental and predicted datasets as well as from literature. Interactions from this interactome were filtered based on Gene Ontology and tissue-specific annotations, and further analysed for relevance to the key events.</p> <p>Results</p> <p>PfEMP1 presentation, platelet activation and astrocyte dysfunction were identified as the key events influencing the disease. 48896 host-parasite along with other host-parasite, host-host and parasite-parasite protein-protein interactions obtained from a disease-specific corpus were combined to form an integrated interactome. Filtering of the interactome resulted in five host-parasite PPI, six parasite-parasite and two host-host PPI. The analysis of these interactions revealed the potential significance of apolipoproteins and temperature/Hsp expression on efficient PfEMP1 presentation; role of MSP-1 in platelet activation; effect of parasite proteins in TGF-β regulation and the role of albumin in astrocyte dysfunction.</p> <p>Conclusions</p> <p>This work links key host-parasite, parasite-parasite and host-host protein-protein interactions to key processes of cerebral malaria and generates hypotheses for disease pathogenesis based on a filtered interaction dataset. These hypotheses provide novel and significant insights to cerebral malaria.</p

    Functional clustering of yeast proteins from the protein-protein interaction network

    Get PDF
    BACKGROUND: The abundant data available for protein interaction networks have not yet been fully understood. New types of analyses are needed to reveal organizational principles of these networks to investigate the details of functional and regulatory clusters of proteins. RESULTS: In the present work, individual clusters identified by an eigenmode analysis of the connectivity matrix of the protein-protein interaction network in yeast are investigated for possible functional relationships among the members of the cluster. With our functional clustering we have successfully predicted several new protein-protein interactions that indeed have been reported recently. CONCLUSION: Eigenmode analysis of the entire connectivity matrix yields both a global and a detailed view of the network. We have shown that the eigenmode clustering not only is guided by the number of proteins with which each protein interacts, but also leads to functional clustering that can be applied to predict new protein interactions

    Progress in computational studies of host-pathogen interactions

    No full text
    10.1142/S0219720012300018Journal of Bioinformatics and Computational Biology112JBCB
    corecore