35 research outputs found
PrePPI: a structure-informed database of protein–protein interactions
PrePPI (http://bhapp.c2b2.columbia.edu/PrePPI) is a database that combines predicted and experimentally determined protein–protein interactions (PPIs) using a Bayesian framework. Predicted interactions are assigned probabilities of being correct, which are derived from calculated likelihood ratios (LRs) by combining structural, functional, evolutionary and expression information, with the most important contribution coming from structure. Experimentally determined interactions are compiled from a set of public databases that manually collect PPIs from the literature and are also assigned LRs. A final probability is then assigned to every interaction by combining the LRs for both predicted and experimentally determined interactions. The current version of PrePPI contains ∼2 million PPIs that have a probability more than ∼0.1 of which ∼60 000 PPIs for yeast and ∼370 000 PPIs for human are considered high confidence (probability greater than 0.5). The PrePPI database constitutes an integrated resource that enables users to examine aggregate information on PPIs, including both known and potentially novel interactions, and that provides structural models for many of the PPIs
Recommended from our members
A computational interactome and functional annotation for the human proteome
We present a database, PrePPI (Predicting Protein-Protein Interactions), of more than 1.35 million predicted protein-protein interactions (PPIs). Of these at least 127,000 are expected to constitute direct physical interactions although the actual number may be much larger (~500,000). The current PrePPI, which contains predicted interactions for about 85% of the human proteome, is related to an earlier version but is based on additional sources of interaction evidence and is far larger in scope. The use of structural relationships allows PrePPI to infer numerous previously unreported interactions. PrePPI has been subjected to a series of validation tests including reproducing known interactions, recapitulating multi-protein complexes, analysis of disease associated SNPs, and identifying functional relationships between interacting proteins. We show, using Gene Set Enrichment Analysis (GSEA), that predicted interaction partners can be used to annotate a protein’s function. We provide annotations for most human proteins, including many annotated as having unknown function
The WD40 Domain Is Required for LRRK2 Neurotoxicity
BACKGROUND:Mutations in leucine-rich repeat kinase 2 (LRRK2) are the most common genetic cause of Parkinson disease (PD). LRRK2 contains an "enzymatic core" composed of GTPase and kinase domains that is flanked by leucine-rich repeat (LRR) and WD40 protein-protein interaction domains. While kinase activity and GTP-binding have both been implicated in LRRK2 neurotoxicity, the potential role of other LRRK2 domains has not been as extensively explored. PRINCIPAL FINDINGS:We demonstrate that LRRK2 normally exists in a dimeric complex, and that removing the WD40 domain prevents complex formation and autophosphorylation. Moreover, loss of the WD40 domain completely blocks the neurotoxicity of multiple LRRK2 PD mutations. CONCLUSION:These findings suggest that LRRK2 dimerization and autophosphorylation may be required for the neurotoxicity of LRRK2 PD mutations and highlight a potential role for the WD40 domain in the mechanism of LRRK2-mediated cell death
Recommended from our members
ER-mitochondria tethering by PDZD8 regulates Ca2+ dynamics in mammalian neurons
Interfaces between organelles are emerging as critical platforms for many biological responses in eukaryotic cells. In yeast, the ERMES complex is an endoplasmic reticulum (ER)-mitochondria tether composed of four proteins, three of which contain a SMP (synaptotagmin-like mitochondrial-lipid binding protein) domain. No functional ortholog for any ERMES protein has been identified in metazoans. Here, we identified PDZD8 as an ER protein present at ER-mitochondria contacts. The SMP domain of PDZD8 is functionally orthologous to the SMP domain found in yeast Mmm1. PDZD8 was necessary for the formation of ER-mitochondria contacts in mammalian cells. In neurons, PDZD8 was required for calcium ion (Ca2+) uptake by mitochondria after synaptically induced Ca2+-release from ER and thereby regulated cytoplasmic Ca2+ dynamics. Thus, PDZD8 represents a critical ER-mitochondria tethering protein in metazoans. We suggest that ER-mitochondria coupling is involved in the regulation of dendritic Ca2+ dynamics in mammalian neurons
Using Structure to Explore the Sequence Alignment Space of Remote Homologs
Protein structure modeling by homology requires an accurate sequence alignment between the query protein and its structural template. However, sequence alignment methods based on dynamic programming (DP) are typically unable to generate accurate alignments for remote sequence homologs, thus limiting the applicability of modeling methods. A central problem is that the alignment that is “optimal” in terms of the DP score does not necessarily correspond to the alignment that produces the most accurate structural model. That is, the correct alignment based on structural superposition will generally have a lower score than the optimal alignment obtained from sequence. Variations of the DP algorithm have been developed that generate alternative alignments that are “suboptimal” in terms of the DP score, but these still encounter difficulties in detecting the correct structural alignment. We present here a new alternative sequence alignment method that relies heavily on the structure of the template. By initially aligning the query sequence to individual fragments in secondary structure elements and combining high-scoring fragments that pass basic tests for “modelability”, we can generate accurate alignments within a small ensemble. Our results suggest that the set of sequences that can currently be modeled by homology can be greatly extended
Genetic Drivers of Kidney Defects in the DiGeorge Syndrome
Background The DiGeorge syndrome, the most common of the microdeletion syndromes, affects multiple organs, including the heart, the nervous system, and the kidney. It is caused by deletions on chromosome 22q11.2; the genetic driver of the kidney defects is unknown. Methods We conducted a genomewide search for structural variants in two cohorts: 2080 patients with congenital kidney and urinary tract anomalies and 22,094 controls. We performed exome and targeted resequencing in samples obtained from 586 additional patients with congenital kidney anomalies. We also carried out functional studies using zebrafish and mice. Results We identified heterozygous deletions of 22q11.2 in 1.1% of the patients with congenital kidney anomalies and in 0.01% of population controls (odds ratio, 81.5; P=4.5×10(-14)). We localized the main drivers of renal disease in the DiGeorge syndrome to a 370-kb region containing nine genes. In zebrafish embryos, an induced loss of function in snap29, aifm3, and crkl resulted in renal defects; the loss of crkl alone was sufficient to induce defects. Five of 586 patients with congenital urinary anomalies had newly identified, heterozygous protein-altering variants, including a premature termination codon, in CRKL. The inactivation of Crkl in the mouse model induced developmental defects similar to those observed in patients with congenital urinary anomalies. Conclusions We identified a recurrent 370-kb deletion at the 22q11.2 locus as a driver of kidney defects in the DiGeorge syndrome and in sporadic congenital kidney and urinary tract anomalies. Of the nine genes at this locus, SNAP29, AIFM3, and CRKL appear to be critical to the phenotype, with haploinsufficiency of CRKL emerging as the main genetic driver. (Funded by the National Institutes of Health and others.)