Search CORE

Public Library of Science (PLOS)

Cohesive versus Flexible Evolution of Functional Modules in Eukaryotes

Author: A Hirsh
AC Gavin
AH Singh
B Adamcsek
B Snel
Berend Snel
BJ Monahan
Christian von Mering
E Neher
ESGA Snitkin
F Pazos
GV Glazko
HM Bourbon
HW Mewes
L Li
Like Fokkens
M Ashburner
M Campillos
M Kroiss
M Pellegrini
M Remm
MA Huynen
NJ Krogan
OX Cordero
P Aloy
P Shannon
P Smits
R Jothi
R Tatusov
S Collins
T Gabaldon
T Rognes
T Tanaka
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Although functionally related proteins can be reliably predicted from phylogenetic profiles, many functional modules do not seem to evolve cohesively according to case studies and systematic analyses in prokaryotes. In this study we quantify the extent of evolutionary cohesiveness of functional modules in eukaryotes and probe the biological and methodological factors influencing our estimates. We have collected various datasets of protein complexes and pathways in Saccheromyces cerevisiae. We define orthologous groups on 34 eukaryotic genomes and measure the extent of cohesive evolution of sets of orthologous groups of which members constitute a known complex or pathway. Within this framework it appears that most functional modules evolve flexibly rather than cohesively. Even after correcting for uncertain module definitions and potentially problematic orthologous groups, only 46% of pathways and complexes evolve more cohesively than random modules. This flexibility seems partly coupled to the nature of the functional module because biochemical pathways are generally more cohesively evolving than complexes

The ADAMTS (A Disintegrin and Metalloproteinase with Thrombospondin motifs) family

Author: A Colige
A Didangelos
A Luque
A Moncada-Pazos
A Noel
A Tan Ide
AC Jonsson-Rylander
AC Nicholson
AJ Fosang
B Joe
BH Koo
BH Koo
BV Nusgens
C Casal
C Esselens
C Gendron
C Kintakas
C Lopez-Otin
C Ricciardelli
CB Kern
CG Viloria
CH Mjaatvedt
CJ Liu
CJ Liu
CL Jacobi
CM Dancevic
CN Brocker
CR Flannery
D Wagsater
DL Russell
DL Silver
DR Edwards
DR McCulloch
DT Jones
Dylan R Edwards
EA Lin
EC Arner
ED Karagiannis
EM Majerus
F Guo
F Vazquez
FG Brunet
FX Gomis-Ruth
FX Gomis-Ruth
G Gao
G Gao
G Hashimoto
G Murphy
GC Choi
GE Lind
GG Levy
GJ Wayne
Grant N Wheeler
H Enomoto
H Jin
H Stanton
H Stanton
HL Lung
HM Brown
I Abbaszade
I Peluso
Ines Desanlis
J Dubail
J Dubail
J Felsenstein
J Hofsteenge
J Huxley-Jones
J Huxley-Jones
J Morales
JA Pyun
JC Adams
JC Rodriguez-Manzaneque
JC Rodriguez-Manzaneque
JC Rodriguez-Manzaneque
JD Sandy
JM Longpre
K Demircan
K Fujikawa
K Gopalakrishnan
K Kuno
K Kuno
K Stankunas
K Tamura
K Yamamoto
K Yamamoto
L Angerer
L Faivre
L Mittaz
L Mosyak
L Troeberg
L Troeberg
L Troeberg
L Wagstaff
LA Collins-Racie
LE Dupuis
LM Ricketts
M Cudic
M Hour El
M Kashiwagi
M Krampert
M Llamazares
MA Aldahmesh
MD Tortorella
MN Vankemmelbeke
N Dagoneau
N Hattori
N Rocks
N Rocks
N Stupka
NH Lim
NV Lee
NV Lee
P Wang
PH Lo
PJ Koshy
PS Chockalingam
R Kelwick
RC Salter
RH Song
Richard Kelwick
RL Robker
RP Somerville
RP Somerville
RS Patel
S Abdul-Majeed
S Gerhardt
S Kumar
S Kumar
S Nandadasa
S Porter
S Porter
SS Apte
SS Glasson
T Fontanil
T Shindo
TN Wight
U Auf Dem Keller
W Du
W Zeng
WE Kutz
WM Wang
X Lu
X Pu
XL Zheng
YJ Liu
YP Hsu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

The ADAMTS (A Disintegrin and Metalloproteinase with Thrombospondin motifs) enzymes are secreted, multi-domain matrix-associated zinc metalloendopeptidases that have diverse roles in tissue morphogenesis and patho-physiological remodeling, in inflammation and in vascular biology. The human family includes 19 members that can be sub-grouped on the basis of their known substrates, namely the aggrecanases or proteoglycanases (ADAMTS1, 4, 5, 8, 9, 15 and 20), the procollagen N-propeptidases (ADAMTS2, 3 and 14), the cartilage oligomeric matrix protein-cleaving enzymes (ADAMTS7 and 12), the von-Willebrand Factor proteinase (ADAMTS13) and a group of orphan enzymes (ADAMTS6, 10, 16, 17, 18 and 19). Control of the structure and function of the extracellular matrix (ECM) is a central theme of the biology of the ADAMTS, as exemplified by the actions of the procollagen-N-propeptidases in collagen fibril assembly and of the aggrecanases in the cleavage or modification of ECM proteoglycans. Defects in certain family members give rise to inherited genetic disorders, while the aberrant expression or function of others is associated with arthritis, cancer and cardiovascular disease. In particular, ADAMTS4 and 5 have emerged as therapeutic targets in arthritis. Multiple ADAMTSs from different sub-groupings exert either positive or negative effects on tumorigenesis and metastasis, with both metalloproteinase-dependent and -independent actions known to occur. The basic ADAMTS structure comprises a metalloproteinase catalytic domain and a carboxy-terminal ancillary domain, the latter determining substrate specificity and the localization of the protease and its interaction partners; ancillary domains probably also have independent biological functions. Focusing primarily on the aggrecanases and proteoglycanases, this review provides a perspective on the evolution of the ADAMTS family, their links with developmental and disease mechanisms, and key questions for the future

Spiral - Imperial College Digital Repository

University of East Anglia digital repository

Efficient Identification of Critical Residues Based Only on Protein Structure by Network Analysis

Author: A del Sol
A del Sol Mesa
A Mittermaier
AH Elcock
AR Poteete
B Thibert
Boris Thibert
C Mattos
D Christendat
D Devos
D Eisenberg
D Pal
D Rennell
Dale E. Bredesen
DD Loeb
F Glaser
F Pazos
G Amitai
Gabriel del Rio
HM Berman
J Dundas
JM Thornton
K Suhre
L Holm
M Phillippsen
M Vendruscolo
MA Weiss
Michael P. Cusack
RA George
RI Sadreyev
Richard Copley
SJ Benkovic
W Huang
Publication venue: Public Library of Science
Publication date: 01/05/2007
Field of study

Despite the increasing number of published protein structures, and the fact that each protein's function relies on its three-dimensional structure, there is limited access to automatic programs used for the identification of critical residues from the protein structure, compared with those based on protein sequence. Here we present a new algorithm based on network analysis applied exclusively on protein structures to identify critical residues. Our results show that this method identifies critical residues for protein function with high reliability and improves automatic sequence-based approaches and previous network-based approaches. The reliability of the method depends on the conformational diversity screened for the protein of interest. We have designed a web site to give access to this software at http://bis.ifc.unam.mx/jamming/. In summary, a new method is presented that relates critical residues for protein function with the most traversed residues in networks derived from protein structures. A unique feature of the method is the inclusion of the conformational diversity of proteins in the prediction, thus reproducing a basic feature of the structure/function relationship of proteins

Public Library of Science (PLOS)

Hal - Université Grenoble Alpes

Supervised multivariate analysis of sequence groups to identify specificity determining residues

Author: A Carro
A del Sol Mesa
AC Culhane
AC Culhane
AR Fersht
CD Livingstone
CL Tucker
D Charif
Desmond G Higgins
DG Higgins
DH Morgan
E Beitz
F Pazos
G Casari
G Zhang
H Yao
HM Wilks
Iain M Wallace
J Thioulouse
JC Gower
JD Thompson
JG Henikoff
KM Mayer
L Yuan
LA Mirny
M Clamp
N Saitou
O Lichtarge
OV Kalinina
OV Kalinina
RC Gentleman
RD Finn
RJ Edwards
S Dolédec
S Henikoff
SJ Hubbard
SS Hannenhalli
TD Schneider
V Vacic
W Pirovano
WR Atchley
X Gu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these residues from families of proteins with different substrate specifities using multiple sequence alignments. Results We demonstrate the usefulness of this method on three different test cases. Two of these test cases, the Lactate/Malate dehydrogenase family and Nucleotidyl Cyclases, consist of two functional groups. The other family, Serine Proteases consists of three groups. BGA was used to analyse and visualise these three families using two different encoding schemes for the amino acids. Conclusion This overall combination of methods in this paper is powerful and flexible while being computationally very fast and simple. BGA is especially useful because it can be used to analyse any number of functional classes. In the examples we used in this paper, we have only used 2 or 3 classes for demonstration purposes but any number can be used and visualised.</p

Automated functional classification of experimental and predicted protein structures

Author: A Andreeva
A Godzik
A Stark
AG Murzin
AR Ortiz
B Zhang
D Fischer
D Pal
D Xu
EC Webb
EF Pettersen
F Pazos
GJ Bartlett
H Hegyi
HM Berman
IN Shindyalov
J Gough
JA Di Gennaro
JC Whisstock
JD Thompson
JD Watson
JM Bujnicki
JM Bujnicki
JM Chandonia
JS Fetrow
JV Ponomarenko
K Ginalski
K Ginalski
K Ginalski
K Pawlowski
K Wang
Kai Wang
L Liao
L Rychlewski
L Rychlewski
L Xie
LH Hung
LH Hung
M Ashburner
MJ Ondrechen
N Nagano
N Nagano
R Kuang
Ram Samudrala
S Cheek
SE Brenner
SF Altschul
SK Burley
SR Eddy
WR Pearson
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Proteins that are similar in sequence or structure may perform different functions in nature. In such cases, function cannot be inferred from sequence or structural similarity. RESULTS: We analyzed experimental structures belonging to the Structural Classification of Proteins (SCOP) database and showed that about half of them belong to multi-functional fold families for which protein similarity alone is not adequate to assign function. We also analyzed predicted structures from the LiveBench and the PDB-CAFASP experiments and showed that accurate homology-based functional assignments cannot be achieved approximately one third of the time, when the protein is a member of a multi-functional fold family. We then conducted extended performance evaluation and comparisons on both experimental and predicted structures using our Functional Signatures from Structural Alignments (FSSA) algorithm that we previously developed to handle the problem of classifying proteins belonging to multi-functional fold families. CONCLUSION: The results indicate that the FSSA algorithm has better accuracy when compared to homology-based approaches for functional classification of both experimental and predicted protein structures, in part due to its use of local, as opposed to global, information for classifying function. The FSSA algorithm has also been implemented as a webserver and is available at

A new approach to assess and predict the functional roles of proteins across all known structures

Author: A Bairoch
A Medrano-Soto
A Preumont
AG Murzin
AS Juncker
B Rost
BH Dessailly
C Radauer
CF Schaefer
D Devos
D Lee
D Pal
D Petrey
D Yarullina
Elchin S. Julfayev
EM Marcotte
F Pazos
H Takahashi
HM Berman
I Friedberg
I Levin
J Benach
JS Richardson
JU Bowie
L Aravind
L Jaroszewski
L Xie
M Ashburner
M Chruszcz
M Kanehisa
M Levitt
P Yue
PD Karp
R Nair
R Rentzsch
RA Laskowski
RD Finn
RE Schapire
RL Marsden
RM Ward
Ryan J. McLaughlin
S Singh
SF Altschul
SK Burley
TC Terwilliger
VA McKusick
William A. McLaughlin
Yi-Ping Tao
YYA Godzik
Publication venue: Springer Netherlands
Publication date: 01/01/2011
Field of study

The three dimensional atomic structures of proteins provide information regarding their function; and codified relationships between structure and function enable the assessment of function from structure. In the current study, a new data mining tool was implemented that checks current gene ontology (GO) annotations and predicts new ones across all the protein structures available in the Protein Data Bank (PDB). The tool overcomes some of the challenges of utilizing large amounts of protein annotation and measurement information to form correspondences between protein structure and function. Protein attributes were extracted from the Structural Biology Knowledgebase and open source biological databases. Based on the presence or absence of a given set of attributes, a given protein’s functional annotations were inferred. The results show that attributes derived from the three dimensional structures of proteins enhanced predictions over that using attributes only derived from primary amino acid sequence. Some predictions reflected known but not completely documented GO annotations. For example, predictions for the GO term for copper ion binding reflected used information a copper ion was known to interact with the protein based on information in a ligand interaction database. Other predictions were novel and require further experimental validation. These include predictions for proteins labeled as unknown function in the PDB. Two examples are a role in the regulation of transcription for the protein AF1396 from Archaeoglobus fulgidus and a role in RNA metabolism for the protein psuG from Thermotoga maritima

Characterization of pathogenic germline mutations in human Protein Kinases

Author: A Baudot
A Fiser
A Hamosh
A Rausell
A Torkamani
A Zankl
Alfonso Valencia
Andrew CR Martin
Anja Baresic
C Ferrer-Costa
C Ferrer-Costa
C Greenman
C Mao
Christine A Orengo
CT Porter
D Vitkup
E Jain
F Pazos
FS Collins
G López
G Manning
G Manning
HM Berman
I Ezkurdia
J Hurst
J Izarzugaza
J Ptacek
JA Ubersax
JDR Knight
JG Leroy
Jose MG Izarzugaza
K Garber
LD Wood
Lisa EM Hopcroft
M Caceres
M Krallinger
MJ Karkkainen
P Taillon-Miller
P Yue
PA Futreal
R Fonseca
RC Edgar
S Forbes
SF Altschul
ST Sherry
VG Krishnan
Z Wang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background Protein Kinases are a superfamily of proteins involved in crucial cellular processes such as cell cycle regulation and signal transduction. Accordingly, they play an important role in cancer biology. To contribute to the study of the relation between kinases and disease we compared pathogenic mutations to neutral mutations as an extension to our previous analysis of cancer somatic mutations. First, we analyzed native and mutant proteins in terms of amino acid composition. Secondly, mutations were characterized according to their potential structural effects and finally, we assessed the location of the different classes of polymorphisms with respect to kinase-relevant positions in terms of subfamily specificity, conservation, accessibility and functional sites. Results Pathogenic Protein Kinase mutations perturb essential aspects of protein function, including disruption of substrate binding and/or effector recognition at family-specific positions. Interestingly these mutations in Protein Kinases display a tendency to avoid structurally relevant positions, what represents a significant difference with respect to the average distribution of pathogenic mutations in other protein families. Conclusions Disease-associated mutations display sound differences with respect to neutral mutations: several amino acids are specific of each mutation type, different structural properties characterize each class and the distribution of pathogenic mutations within the consensus structure of the Protein Kinase domain is substantially different to that for non-pathogenic mutations. This preferential distribution confirms previous observations about the functional and structural distribution of the controversial cancer driver and passenger somatic mutations and their use as a proxy for the study of the involvement of somatic mutations in cancer development.</p&gt

CiteSeerX

Full-text Institutional Repository of the Ruđer Bošković Institute

Enlighten

Concomitant prediction of function and fold at the domain level with GO-based profiles

Author: A Andreeva
A Valencia
B Rost
B Rost
C Notredame
CT Porter
D Devos
D Lopez
D Pal
DA de Lima Morais
Daniel Lopez
DM Martin
EM Marcotte
F Pazos
Florencio Pazos
G Apic
HM Berman
JD Watson
JM Chandonia
L Holm
L Xie
M Ashburner
M Chagoyen
M Wistrand
MN Wass
R Rentzsch
S Gotz
SC Schuster
SF Altschul
T Fawcett
T Hawkins
W Tian
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Protein docking prediction using predicted protein-protein interface

Author: A Berchanski
A Porollo
A Szilagyi
A Tovchigrechko
AA Bogan
AM Bonvin
B Huang
B Pierce
Bin Li
C Dominguez
C Zhang
CL Hutchinson
CL Lo
D Eisenberg
D Fischer
D Kozakov
D Kozakov
D La
D Schneidman-Duhovny
Daisuke Kihara
DR Caffrey
DW Ritchie
DW Ritchie
E Karaca
EJ Gardiner
EJ Gardiner
EV Pletneva
F Jiang
F Pazos
F Pazos
FK Pettit
GS Anand
H Hwang
H Neuvirth
H Tjong
H Wolfson
HA Gabb
HM Berman
HX Zhou
HX Zhou
I Andre
I Ezkurdia
I Halperin
I Halperin
I Kufareva
I Mihalek
I Res
J Esquivel-Rodriguez
J Esquivel-Rodriguez
J Janin
J Mintseris
JI Garzon
JJ Gray
JR Bradford
K Henrick
K Wiehe
L Giot
M Meyer
M Tress
MF Lensink
MH Li
N Andrusier
NA Meenan
NJ Burgoyne
O Schueler-Furman
P Aloy
P Heuser
P Uetz
R Chen
R Das
R Mendez
RB Russell
RC Edgar
RD Finn
RD Finn
RT Bradshaw
S Dhungana
S Jones
S Jones
S Liang
S Liang
S Qin
SH Speck
SJ de Vries
SJ de Vries
SR Comeau
SR Comeau
SS Negi
SY Huang
T Ito
T Lazaridis
Uniprot Consortium
V Chelliah
V Collura
V Venkatraman
W Kabsch
W Tong
WL Delano
X Li
Y Inbar
Y Shen
Z Shentu
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background Many important cellular processes are carried out by protein complexes. To provide physical pictures of interacting proteins, many computational protein-protein prediction methods have been developed in the past. However, it is still difficult to identify the correct docking complex structure within top ranks among alternative conformations. Results We present a novel protein docking algorithm that utilizes imperfect protein-protein binding interface prediction for guiding protein docking. Since the accuracy of protein binding site prediction varies depending on cases, the challenge is to develop a method which does not deteriorate but improves docking results by using a binding site prediction which may not be 100% accurate. The algorithm, named PI-LZerD (using Predicted Interface with Local 3D Zernike descriptor-based Docking algorithm), is based on a pair wise protein docking prediction algorithm, LZerD, which we have developed earlier. PI-LZerD starts from performing docking prediction using the provided protein-protein binding interface prediction as constraints, which is followed by the second round of docking with updated docking interface information to further improve docking conformation. Benchmark results on bound and unbound cases show that PI-LZerD consistently improves the docking prediction accuracy as compared with docking without using binding site prediction or using the binding site prediction as post-filtering. Conclusion We have developed PI-LZerD, a pairwise docking algorithm, which uses imperfect protein-protein binding interface prediction to improve docking accuracy. PI-LZerD consistently showed better prediction accuracy over alternative methods in the series of benchmark experiments including docking using actual docking interface site predictions as well as unbound docking cases.</p