Search CORE

13 research outputs found

PrePPI: a structure-informed database of protein–protein interactions

Author: Deng Lei
Garzon Canas Jose I.
Honig Barry
Petrey Donald S.
Zhang Qiangfeng Cliff
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2012
Field of study

PrePPI (http://bhapp.c2b2.columbia.edu/PrePPI) is a database that combines predicted and experimentally determined protein–protein interactions (PPIs) using a Bayesian framework. Predicted interactions are assigned probabilities of being correct, which are derived from calculated likelihood ratios (LRs) by combining structural, functional, evolutionary and expression information, with the most important contribution coming from structure. Experimentally determined interactions are compiled from a set of public databases that manually collect PPIs from the literature and are also assigned LRs. A final probability is then assigned to every interaction by combining the LRs for both predicted and experimentally determined interactions. The current version of PrePPI contains ∼2 million PPIs that have a probability more than ∼0.1 of which ∼60 000 PPIs for yeast and ∼370 000 PPIs for human are considered high confidence (probability greater than 0.5). The PrePPI database constitutes an integrated resource that enables users to examine aggregate information on PPIs, including both known and potentially novel interactions, and that provides structural models for many of the PPIs

Crossref

Columbia University Academic Commons

PubMed Central

Recommended from our members

A computational interactome and functional annotation for the human proteome

Author: Deng Lei
Garzon Canas Jose I.
Honig Barry
Murray Diana
Petrey Donald S.
Shapira Sagi
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2016
Field of study

We present a database, PrePPI (Predicting Protein-Protein Interactions), of more than 1.35 million predicted protein-protein interactions (PPIs). Of these at least 127,000 are expected to constitute direct physical interactions although the actual number may be much larger (~500,000). The current PrePPI, which contains predicted interactions for about 85% of the human proteome, is related to an earlier version but is based on additional sources of interaction evidence and is far larger in scope. The use of structural relationships allows PrePPI to infer numerous previously unreported interactions. PrePPI has been subjected to a series of validation tests including reproducing known interactions, recapitulating multi-protein complexes, analysis of disease associated SNPs, and identifying functional relationships between interacting proteins. We show, using Gene Set Enrichment Analysis (GSEA), that predicted interaction partners can be used to annotate a protein’s function. We provide annotations for most human proteins, including many annotated as having unknown function

Columbia University Academic Commons

PubMed Central

Recommended from our members

ER-mitochondria tethering by PDZD8 regulates Ca2+ dynamics in mammalian neurons

Author: Erfani Parsa
Hirabayashi Yusuke
Kwon Seok-Kyu
Lee Jinoh
Paek Hunki
Paul Maëla A.
Pernice Wolfgang Maximilian
Petrey Donald S.
Polleux Franck
Pon Liza A.
Raczkowski Ashleigh
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2017
Field of study

Interfaces between organelles are emerging as critical platforms for many biological responses in eukaryotic cells. In yeast, the ERMES complex is an endoplasmic reticulum (ER)-mitochondria tether composed of four proteins, three of which contain a SMP (synaptotagmin-like mitochondrial-lipid binding protein) domain. No functional ortholog for any ERMES protein has been identified in metazoans. Here, we identified PDZD8 as an ER protein present at ER-mitochondria contacts. The SMP domain of PDZD8 is functionally orthologous to the SMP domain found in yeast Mmm1. PDZD8 was necessary for the formation of ER-mitochondria contacts in mammalian cells. In neurons, PDZD8 was required for calcium ion (Ca2+) uptake by mitochondria after synaptically induced Ca2+-release from ER and thereby regulated cytoplasmic Ca2+ dynamics. Thus, PDZD8 represents a critical ER-mitochondria tethering protein in metazoans. We suggest that ER-mitochondria coupling is involved in the regulation of dendritic Ca2+ dynamics in mammalian neurons

Columbia University Academic Commons

Using Structure to Explore the Sequence Alignment Space of Remote Homologs

Author: A Mac Sweeney
AG Murzin
AM Lesk
Andrew Kuziemko
AR Panchenko
AS Yang
B John
B Qian
B Rost
Barry Honig
CL Tang
D Chivian
D Eisenberg
D Kihara
D Petrey
D Petrey
Donald Petrey
DT Jones
F Melo
GJ Barton
H Chen
H Lee
H Zhou
H Zhou
HM Berman
I Friedberg
J Moult
J Shi
J Söding
JM Sauder
JU Bowie
L Jaroszewski
MA Marti-Renom
MA Saqi
MS Madhusudhan
MS Waterman
MS Waterman
N Mirkovic
NC Goonesekere
P Bork
Philip E. Bourne
R Sanchez
RB Russell
RC Edgar
S Liu
SA Benner
SB Williams
T Madej
WRP Scott
Y Zhang
Y Zhang
Y Zhang
Publication venue: Public Library of Science
Publication date: 01/10/2011
Field of study

Protein structure modeling by homology requires an accurate sequence alignment between the query protein and its structural template. However, sequence alignment methods based on dynamic programming (DP) are typically unable to generate accurate alignments for remote sequence homologs, thus limiting the applicability of modeling methods. A central problem is that the alignment that is “optimal” in terms of the DP score does not necessarily correspond to the alignment that produces the most accurate structural model. That is, the correct alignment based on structural superposition will generally have a lower score than the optimal alignment obtained from sequence. Variations of the DP algorithm have been developed that generate alternative alignments that are “suboptimal” in terms of the DP score, but these still encounter difficulties in detecting the correct structural alignment. We present here a new alternative sequence alignment method that relies heavily on the structure of the template. By initially aligning the query sequence to individual fragments in secondary structure elements and combining high-scoring fragments that pass basic tests for “modelability”, we can generate accurate alignments within a small ensemble. Our results suggest that the set of sequences that can currently be modeled by homology can be greatly extended

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Genetic Drivers of Kidney Defects in the DiGeorge Syndrome

Author: Allegri Landino
Anderson Blair R
Arapovic Adela
Barasch Jonathan M
Barton David
Bodria Monica
Capone Valentina P
Carpentier Wassila
Carrea Alba
Casolari Emilio
Crowley Terrence B
Cusi Daniele
D'Agati Vivette
Darlow John M
Deng Rong
Drozdz Dorota
Drummond Iain A
Fasel David A
Flogelova Hana
Furth Susan L
Gaillard Dominique
GESUALDO Loreto
Gharavi Ali G
Ghiggeri Gian Marco
Gillies Christopher E
Gucev Zoran
Hakonarson Hakon
Heidet Laurence
Hildebrandt Friedhelm
Honig Barry
Imamoto Akira
Izzi Claudia
Jeanpierre Cecile
Katsanis Nicholas
Kiryluk Krzysztof
Krzemien Grazyna
Kunac Nenad
Latos Bielenska Anna
Lifton Richard P
Liu Qingxue
Liu Yangfan P
Lopez Rivera Esther
Lozanovski Vladimir J
Maiorana Mariarosa
Makar Gabriel S
Martino Jeremiah
Materna Kiryluk Anna
McDonald McGinn Donna M
Miklaszewska Monika
MITROTTI ADELE
Mizerska Wasiak Malgorzata
Morrow Bernice E
Otto Edgar A
Papaioannou Virginia E
Petrey Donald S
Puri Prem
Racedo Silvia E
Salomon Rémi
Samii Ali
Sampson Matthew G
Sanna Cherchi Simone
Saraga Babic Mirna
Saraga Marijan
Scolari Francesco
Sikora Przemyslaw
Steers Nicholas J
Szczepanska Maria
Szmigielska Agnieszka
Tasic Velibor
Tkaczyk Marcin
van Wijk Joanna A. E
Vega Warner Virginia
Verbitsky Miguel
Vivante Asaf
Vukojevic Katarina
Warady Bradley A
Werth Max
Westland Rik
Wong Craig S
Yan Zhonghai
Zackai Elaine H
Zaniew Marcin
Publication venue: 'Massachusetts Medical Society'
Publication date: 01/01/2017
Field of study

Background The DiGeorge syndrome, the most common of the microdeletion syndromes, affects multiple organs, including the heart, the nervous system, and the kidney. It is caused by deletions on chromosome 22q11.2; the genetic driver of the kidney defects is unknown. Methods We conducted a genomewide search for structural variants in two cohorts: 2080 patients with congenital kidney and urinary tract anomalies and 22,094 controls. We performed exome and targeted resequencing in samples obtained from 586 additional patients with congenital kidney anomalies. We also carried out functional studies using zebrafish and mice. Results We identified heterozygous deletions of 22q11.2 in 1.1% of the patients with congenital kidney anomalies and in 0.01% of population controls (odds ratio, 81.5; P=4.5×10(-14)). We localized the main drivers of renal disease in the DiGeorge syndrome to a 370-kb region containing nine genes. In zebrafish embryos, an induced loss of function in snap29, aifm3, and crkl resulted in renal defects; the loss of crkl alone was sufficient to induce defects. Five of 586 patients with congenital urinary anomalies had newly identified, heterozygous protein-altering variants, including a premature termination codon, in CRKL. The inactivation of Crkl in the mouse model induced developmental defects similar to those observed in patients with congenital urinary anomalies. Conclusions We identified a recurrent 370-kb deletion at the 22q11.2 locus as a driver of kidney defects in the DiGeorge syndrome and in sporadic congenital kidney and urinary tract anomalies. Of the nine genes at this locus, SNAP29, AIFM3, and CRKL appear to be critical to the phenotype, with haploinsufficiency of CRKL emerging as the main genetic driver. (Funded by the National Institutes of Health and others.)

Archivio istituzionale della ricerca - Università di Bari

Predicting transmembrane beta-barrels in proteomes

Author: Bigelow Henry R.
Liu Jinfeng
Petrey Donald S.
Przybylski Dariusz
Rost Burkhard
Publication venue: Oxford University Press
Publication date: 01/01/2004
Field of study

Very few methods address the problem of predicting beta-barrel membrane proteins directly from sequence. One reason is that only very few high-resolution structures for transmembrane beta-barrel (TMB) proteins have been determined thus far. Here we introduced the design, statistics and results of a novel profile-based hidden Markov model for the prediction and discrimination of TMBs. The method carefully attempts to avoid over-fitting the sparse experimental data. While our model training and scoring procedures were very similar to a recently published work, the architecture and structure-based labelling were significantly different. In particular, we introduced a new definition of beta- hairpin motifs, explicit state modelling of transmembrane strands, and a log-odds whole-protein discrimination score. The resulting method reached an overall four-state (up-, down-strand, periplasmic-, outer-loop) accuracy as high as 86%. Furthermore, accurately discriminated TMB from non-TMB proteins (45% coverage at 100% accuracy). This high precision enabled the application to 72 entirely sequenced Gram-negative bacteria. We found over 164 previously uncharacterized TMB proteins at high confidence. Database searches did not implicate any of these proteins with membranes. We challenge that the vast majority of our 164 predictions will eventually be verified experimentally. All proteome predictions and the PROFtmb prediction method are available at http://www.rostlab.org/services/PROFtmb/

CiteSeerX

Crossref

PubMed Central

De novo missense variants in PPP2R5D are associated with intellectual disability, macrocephaly, hypotonia, and autism

Author: Asaikar Shailesh
Carmichael Jason
Cho Megan T
Chung Wendy K
Folk Leandra
Fong Chin-To
Haude Katrina M
Hauser Natalie
Henderson Lindsay B
Innis Jeffrey
Lundberg Julie
Monaghan Kristin G
Pearson Margaret
Petrey Donald S
Retterer Kyle
Schuette Jane
Shang Linshan
Shur Natasha
Wu Yvonne W
Publication venue: eScholarship, University of California
Publication date: 01/01/2016
Field of study

Protein phosphatase 2A (PP2A) is a heterotrimeric protein serine/threonine phosphatase and is involved in a broad range of cellular processes. PPP2R5D is a regulatory B subunit of PP2A and plays an important role in regulating key neuronal and developmental regulation processes such as PI3K/AKT and glycogen synthase kinase 3 beta (GSK3β)-mediated cell growth, chromatin remodeling, and gene transcriptional regulation. Using whole-exome sequencing (WES), we identified four de novo variants in PPP2R5D in a total of seven unrelated individuals with intellectual disability (ID) and other shared clinical characteristics, including autism spectrum disorder, macrocephaly, hypotonia, seizures, and dysmorphic features. Among the four variants, two have been previously reported and two are novel. All four amino acids are highly conserved among the PP2A subunit family, and all change a negatively charged acidic glutamic acid (E) to a positively charged basic lysine (K) and are predicted to disrupt the PP2A subunit binding and impair the dephosphorylation capacity. Our data provides further support for PPP2R5D as a genetic cause of ID

PubMed Central

eScholarship - University of California

Bi-allelic missense disease-causing variants in RPL3L associate neonatal dilated cardiomyopathy with muscle-specific ribosome biogenesis

Author: Ahimaz Priyanka
Argyriou Loukas
Auber Bernd
Buchovecky Christie M.
Burfeind Peter
Cabezas-Herrera Juan
Cyganek Lukas
Ganapathi Mythily
Hasenfuss Gerd
Honig Barry
Iglesias Alejandro D.
Lee Teresa M.
Li Yun
Martinez-Azorin Francisco
Morlot Susanne
Petrey Donald S.
Sabater-Molina Maria
Siegelin Markus D.
Sorli-Garcia Moises
Thiele Holger
von Gise Alexander
Wollnik Bernd
Yigit Gokhan
Zibat Arne
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Dilated cardiomyopathy (DCM) belongs to the most frequent forms of cardiomyopathy mainly characterized by cardiac dilatation and reduced systolic function. Although most cases of DCM are classified as sporadic, 20-30% of cases show a heritable pattern. Familial forms of DCM are genetically heterogeneous, and mutations in several genes have been identified that most commonly play a role in cytoskeleton and sarcomere-associated processes. Still, a large number of familial cases remain unsolved. Here, we report five individuals from three independent families who presented with severe dilated cardiomyopathy during the neonatal period. Using whole-exome sequencing (WES), we identified causative, compound heterozygous missense variants in RPL3L (ribosomal protein L3-like) in all the affected individuals. The identified variants co-segregated with the disease in each of the three families and were absent or very rare in the human population, in line with an autosomal recessive inheritance pattern. They are located within the conserved RPL3 domain of the protein and were classified as deleterious by several in silico prediction software applications. RPL3L is one of the four non-canonical riboprotein genes and it encodes the 60S ribosomal protein L3-like protein that is highly expressed only in cardiac and skeletal muscle. Three-dimensional homology modeling and in silico analysis of the affected residues in RPL3L indicate that the identified changes specifically alter the interaction of RPL3L with the RNA components of the 60S ribosomal subunit and thus destabilize its binding to the 60S subunit. In conclusion, we report that bi-allelic pathogenic variants in RPL3L are causative of an early-onset, severe neonatal form of dilated cardiomyopathy, and we show for the first time that cytoplasmic ribosomal proteins are involved in the pathogenesis of non-syndromic cardiomyopathies

Kölner UniversitätsPublikationsServer

Hochschulbibliothekszentrum des Landes Nordrhein-Westfalen (hbz)

Structure-based prediction of protein–protein interactions on a genome-wide scale

Author: Andrea Califano
AS Yang
BA Shoemaker
BA Shoemaker
Barry Honig
Brygida Bisikirska
C Lefebvre
C von Mering
C von Mering
Celine Lefebvre
Chan Aye Thu
CM Deane
D Petrey
Domenico Accili
Donald Petrey
E Krissinel
F Enault
FP Davis
G Stolovitzky
H Yu
HL Chen
HM Berman
HW Mewes
I Letunic
K Henrick
L Bonetta
L Lu
L Salwinski
L Sun
Lei Deng
Li Qiang
M Gao
M Huynen
M Levitt
M Vidal
MN Wass
N Mirkovic
N Tuncbag
O Keskin
P Aloy
P Braun
QC Zhang
QC Zhang
Qiangfeng Cliff Zhang
R Apweiler
R Jansen
R Sanchez
RM Ewing
S Liang
SF Altschul
T Barrett
T Reguly
The Gene Ontology Consortium
Tom Maniatis
Tony Hunter
U Pieper
Yu Shi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Precise parallel volumetric comparison of molecular surfaces and electrostatic isopotentials

Author: A Shrake
A Stark
A-S Yang
B Lee
BE Nolan
BY Chen
BY Chen
BY Chen
BY Chen
BY Chen
CL Bajaj
Donald Petrey
EF Pettersen
F Aurenhammer
F Ferre
F Kaiser
H Edelsbrunner
J Goldfeather
J Konc
J Venkateswaran
K Kinoshita
L Ellingson
L He
L Li
M Brylinski
M Fischer
M Moll
M Nayal
MF Sanner
ML Connolly
NL Max
RJ Morris
S Blumenthal
SL Chan
T Lu
T Madej
TA Binkowski
Tao Ju
W Rocchia
W Tian
William E. Lorensen
Y Liu
Y Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref