Search CORE

UCL Discovery

arXiv.org e-Print Archive

Composite structural motifs of binding sites for delineating biological functions of proteins

Author: A Bairoch
A Fiorillo
A Rausell
A Stark
AC Joerger
AC Wallace
AG Murzin
Akira R. Kinjo
AM Schnoes
AR Kinjo
AR Kinjo
AR Kinjo
B Bollobás
B Dasgupta
B Louie
B Rost
BH Dessailly
C Branden
C Winter
CV Robinson
D Petrey
DJ Schuller
DM Chipman
E Krissinel
E Toyota
FP Davis
FP Davis
GM Santos
H Berman
H Kettenberger
Haruki Nakamura
I Friedberg
J Janin
J Shi
J Westbrook
JI Yeh
K Chen
K Henrick
K Kinoshita
K Kinoshita
K Kinoshita
K Okazaki
K Stenberg
L Xie
M Bashton
M Brylinski
M Kitayner
M Levitt
M Moertl
M Nardini
M Tyagi
M Yang
N Nagano
N Tuncbag
N Tuncbag
N Zhao
ND Gold
O Keskin
O Keskin
OC Redfern
Ozlem Keskin
P Cramer
P Shannon
PD Pawelek
R Koike
R Koike
R Rentzsch
R Sinha
RR Thangudu
S Kadono
SF Altschul
T Amemiya
T Kawabata
T Kawabata
TA Holland
TC Terwilliger
Y Loewenstein
Z Aung
ZX Xia
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs which represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures.Comment: 34 pages, 7 figure

CiteSeerX

Homology Inference of Protein-Protein Interactions via Conserved Binding Sites

Author: A Marchler-Bauer
A Marchler-Bauer
A Marchler-Bauer
A Shulman-Peleg
AJ Walhout
Anna R. Panchenko
B Burgess
BA Shoemaker
BA Shoemaker
BA Shoemaker
BG Ma
BH Dessailly
D Kemmer
Dachuan Zhang
E Krissinel
E Krissinel
ED Levy
ER Jefferson
H Chen
H Neuvirth
H Yu
H Zhu
HM Berman
I Ispolatov
J Chen
J Kim
J Kirn
JE Dayhoff
JF Gibrat
K Hashimoto
K Henrick
L Xue
LR Matthews
M Gribskov
M Persico
Manoj Tyagi
MP Stumpf
N Slonim
P Aloy
P Fariselli
Q Xu
QC Zhang
Ratna R. Thangudu
RH Holm
RR Thangudu
S Henikoff
S Liang
S Mika
S Mintz
SF Altschul
Stephen H. Bryant
T Reguly
Thomas Madej
Vladimir N. Uversky
WE Newton
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The coverage and reliability of protein-protein interactions determined by high-throughput experiments still needs to be improved, especially for higher organisms, therefore the question persists, how interactions can be verified and predicted by computational approaches using available data on protein structural complexes. Recently we developed an approach called IBIS (Inferred Biomolecular Interaction Server) to predict and annotate protein-protein binding sites and interaction partners, which is based on the assumption that the structural location and sequence patterns of protein-protein binding sites are conserved between close homologs. In this study first we confirmed high accuracy of our method and found that its accuracy depends critically on the usage of all available data on structures of homologous complexes, compared to the approaches where only a non-redundant set of complexes is employed. Second we showed that there exists a trade-off between specificity and sensitivity if we employ in the prediction only evolutionarily conserved binding site clusters or clusters supported by only one observation (singletons). Finally we addressed the question of identifying the biologically relevant interactions using the homology inference approach and demonstrated that a large majority of crystal packing interactions can be correctly identified and filtered by our algorithm. At the same time, about half of biological interfaces that are not present in the protein crystallographic asymmetric unit can be reconstructed by IBIS from homologous complexes without the prior knowledge of crystal parameters of the query protein

CiteSeerX

FigShare

A new approach to assess and predict the functional roles of proteins across all known structures

Author: A Bairoch
A Medrano-Soto
A Preumont
AG Murzin
AS Juncker
B Rost
BH Dessailly
C Radauer
CF Schaefer
D Devos
D Lee
D Pal
D Petrey
D Yarullina
Elchin S. Julfayev
EM Marcotte
F Pazos
H Takahashi
HM Berman
I Friedberg
I Levin
J Benach
JS Richardson
JU Bowie
L Aravind
L Jaroszewski
L Xie
M Ashburner
M Chruszcz
M Kanehisa
M Levitt
P Yue
PD Karp
R Nair
R Rentzsch
RA Laskowski
RD Finn
RE Schapire
RL Marsden
RM Ward
Ryan J. McLaughlin
S Singh
SF Altschul
SK Burley
TC Terwilliger
VA McKusick
William A. McLaughlin
Yi-Ping Tao
YYA Godzik
Publication venue: Springer Netherlands
Publication date: 01/01/2011
Field of study

The three dimensional atomic structures of proteins provide information regarding their function; and codified relationships between structure and function enable the assessment of function from structure. In the current study, a new data mining tool was implemented that checks current gene ontology (GO) annotations and predicts new ones across all the protein structures available in the Protein Data Bank (PDB). The tool overcomes some of the challenges of utilizing large amounts of protein annotation and measurement information to form correspondences between protein structure and function. Protein attributes were extracted from the Structural Biology Knowledgebase and open source biological databases. Based on the presence or absence of a given set of attributes, a given protein’s functional annotations were inferred. The results show that attributes derived from the three dimensional structures of proteins enhanced predictions over that using attributes only derived from primary amino acid sequence. Some predictions reflected known but not completely documented GO annotations. For example, predictions for the GO term for copper ion binding reflected used information a copper ion was known to interact with the protein based on information in a ligand interaction database. Other predictions were novel and require further experimental validation. These include predictions for proteins labeled as unknown function in the PDB. Two examples are a role in the regulation of transcription for the protein AF1396 from Archaeoglobus fulgidus and a role in RNA metabolism for the protein psuG from Thermotoga maritima

Springer - Publisher Connector

Protein ligand-specific binding residue predictions by an ensemble classifier

Author: A Roy
AT Laurie
B Huang
B Panwar
BH Dessailly
BK Dukka
C Fang
C-C Chang
C-H Ngan
CH Lu
D Xu
DW Buchan
GY Wong
HR Ansari
I Mayrose
J Konc
J Yang
J Yang
J Yang
JA Capra
JA Capra
JA Horst
JS Chauhan
JS Chauhan
K Chen
K Chen
Kai Wang
L Fu
M Brylinski
N Shu
NK Mishra
P Chen
PW Rose
Q Dong
Qiwen Dong
R Liu
R Wang
S Leis
S Wu
S Wu
SF Altschul
T Gallo Cassarino
T Pupko
T Schmidt
U Consortium
V Sobolev
V Sobolev
VN Vapnik
W Nemoto
X Ma
Xiuzhen Hu
Y Freund
Z Zhang
ZR Xie
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

From protein sequences to 3D-structures and beyond: the example of the UniProt Knowledgebase

Author: A Andreeva
A Ben-Shem
A Chatr-aryamontri
A Cuff
A Gavin
A Hamosh
A Juncker
A Matte
AD Moore
AE Todd
AK Dunker
B Boeckmann
B Wollscheid
BH Dessailly
C Alfarano
C Bru
C Chothia
C Dodge
C Sala
C Yeats
CB Anfinsen
CH Wu
D Barrell
D Wilson
DA Benson
DH Haft
DH Shin
E Jain
E Zito
EA Bruford
EL Ulrich
F Chiti
F Kiefer
G Cochrane
G Lopez
G Zanotti
GE Tusnády
H Berman
H Boutselakis
H Mi
H Yu
HM Berman
I Letunic
I Xenarios
J Piatigorsky
J Rual
J Tamames
J White
JD Watson
JJW Wiltzius
JL Markley
JS Garavelli
K Degtyarenko
KD Pruitt
KD Pruitt
M Bauer
M Grabowski
M Hendlich
M Mueller
M Punta
M Revington
M Sickmeier
MA Hadders
ME Cusick
MI Ivanova
MJ Fogg
ML Benson
MR Sawaya
N Dephoure
N Farriol-Mathis
N Hulo
N Simonis
NJ Mulder
O Gileadi
OC Redfern
P Flicek
PG Bagos
R Gerber
R Nair
R Nelson
R Olson
R Rentzsch
RA Laskowski
RD Finn
S Addou
S Braconi Quintaje
S Dutta
S Hiller
S Hunter
S Kerrien
S Orchard
S Topiol
SB Long
SE Antonarakis
SI O’Donoghue
SJ Wodak
SM Johnson
ST Sherry
SW Cowan-Jacob
T Köcher
T Lima
The Uni Prot Consortium
U Pieper
Ursula Hinz
Y Jiang
Y Wang
YL Yip
Publication venue: SP Birkhäuser Verlag Basel
Publication date: 01/01/2009
Field of study

With the dramatic increase in the volume of experimental results in every domain of life sciences, assembling pertinent data and combining information from different fields has become a challenge. Information is dispersed over numerous specialized databases and is presented in many different formats. Rapid access to experiment-based information about well-characterized proteins helps predict the function of uncharacterized proteins identified by large-scale sequencing. In this context, universal knowledgebases play essential roles in providing access to data from complementary types of experiments and serving as hubs with cross-references to many specialized databases. This review outlines how the value of experimental data is optimized by combining high-quality protein sequences with complementary experimental results, including information derived from protein 3D-structures, using as an example the UniProt knowledgebase (UniProtKB) and the tools and links provided on its website (http://www.uniprot.org/). It also evokes precautions that are necessary for successful predictions and extrapolations

Springer - Publisher Connector

Serveur académique lausannois

UCL Discovery

PoPMuSiC 2.1: a web server for the estimation of protein stability changes upon mutation and sequence optimality

ABSTRACT:Journal ArticleSCOPUS: ar.jinfo:eu-repo/semantics/publishe

Springer - Publisher Connector

DI-fusion

The Vein Patterning 1 (VEP1) Gene Family Laterally Spread through an Ecological Network

Author: A Boc
A Thorn
A Wagner
AG Murzin
AS Konagurthu
AS Lang
B Becker
B McSpadden Gardener
B Persson
BA Babst
BH Dessailly
C Chothia
C Filling
C Notredame
CA Jacobi
CA Orengo
CG Kurland
CH Lecellier
CM Liba
CM Thomas
CR Woese
CX Chan
CX Chan
DE Gärtner
DE Gärtner
DK Choudhary
DL Wheeler
DM Gardiner
DR Maddison
DS Heckman
E Bapteste
E Burda
E Ferrada
E Torrents
EE Allen
F Abascal
F Armougom
F Pearl
F Rodríguez-Trelles
F Rodríguez-Trelles
Francisco J. Ayala
Francisco Rodríguez-Trelles
G Caetano-Anollés
G Emiliani
G Talavera
GA Reeves
GE Crooks
GJ Seifert
GL Holliday
H Jörnvall
H Ochman
H Reichenbach
HK Lamb
I Gavidia
I González
I Nobeli
I Schmitt
J Castresana
J Felsenstein
J Gough
J Huang
J Raes
JA Draghi
JA Ranea
JBH Martiny
JD Bloom
JE Bray
JE Stajich
JH Jun
JO Andersson
John W. Stiller
JP Gogarten
JP Huelsenbeck
K-Y Yang
KL Kavanagh
L Boto
L Holm
L Riechmann
L Roca-Pérez
L Selbmann
LA Lewis
LS Frost
M Anisimova
M Bashton
M Cardinale
M Cardinale
M Gerstein
M Grube
M Grube
M Hertzberg
M Madera
M Marcet-Houben
M Podar
MA DePristo
MA Ragan
MB Swindells
MC Rivera
MD Hendy
MG Rossmann
MJ Betts
MM Lee
N Galtier
N Goldenfeld
N Gottig
N Guex
N Tanaka
NA Moran
NU Frigaard
O Poirot
OX Cordero
P Bednarek
P Bonfante
P Pérez-Bermúdez
P Yarza
PA Grimont
PF Gherardini
PJ Keeling
PJ Keeling
PJ Lockhart
R Bock
R Sorek
R Tarrío
RD Finn
RG Beiko
RI Sadreyev
RL Tatusov
Rosa Tarrío
S Chaffron
S Guindon
S Guindon
S Gustavsson
S Kumar
S Perotto
S Rachid
S Yang
SB Hedges
SD Hooper
SF Altschul
SF Altschul
SG Ralph
SQ Le
SR Eddy
T Dagan
T Girke
T Kloesges
T Nishiyama
T Nosenko
TA Richards
TY James
V Herl
V Kunin
VID Ros
W Dawid
W Hao
W Reiter
WC Lima
WF Doolittle
X Zheng
Y Boucher
Y Kallberg
Y Kallberg
Y Nakamura
Publication venue: Public Library of Science
Publication date: 26/07/2011
Field of study

Lateral gene transfer (LGT) is a major evolutionary mechanism in prokaryotes. Knowledge about LGT— particularly, multicellular— eukaryotes has only recently started to accumulate. A widespread assumption sees the gene as the unit of LGT, largely because little is yet known about how LGT chances are affected by structural/functional features at the subgenic level. Here we trace the evolutionary trajectory of VEin Patterning 1, a novel gene family known to be essential for plant development and defense. At the subgenic level VEP1 encodes a dinucleotide-binding Rossmann-fold domain, in common with members of the short-chain dehydrogenase/reductase (SDR) protein family. We found: i) VEP1 likely originated in an aerobic, mesophilic and chemoorganotrophic α-proteobacterium, and was laterally propagated through nets of ecological interactions, including multiple LGTs between phylogenetically distant green plant/fungi-associated bacteria, and five independent LGTs to eukaryotes. Of these latest five transfers, three are ancient LGTs, implicating an ancestral fungus, the last common ancestor of land plants and an ancestral trebouxiophyte green alga, and two are recent LGTs to modern embryophytes. ii) VEP1's rampant LGT behavior was enabled by the robustness and broad utility of the dinucleotide-binding Rossmann-fold, which provided a platform for the evolution of two unprecedented departures from the canonical SDR catalytic triad. iii) The fate of VEP1 in eukaryotes has been different in different lineages, being ubiquitous and highly conserved in land plants, whereas fungi underwent multiple losses. And iv) VEP1-harboring bacteria include non-phytopathogenic and phytopathogenic symbionts which are non-randomly distributed with respect to the type of harbored VEP1 gene. Our findings suggest that VEP1 may have been instrumental for the evolutionary transition of green plants to land, and point to a LGT-mediated ‘Trojan Horse’ mechanism for the evolution of bacterial pathogenesis against plants. VEP1 may serve as tool for revealing microbial interactions in plant/fungi-associated environments

eScholarship - University of California

Development of a semi-analytical algorithm for the retrieval of suspended particulate matter from remote sensing over clear to very turbid waters

Author: Bryere P.
Dessailly D.
Han B.
Loisel Hubert
Meriaux X.
Ouillon Sylvain
Vantrepotte V.
Xing Q. G.
Zhu J. H.
Publication venue
Publication date: 01/01/2016
Field of study

Remote sensing of suspended particulate matter, SPM, from space has long been used to assess its spatio-temporal variability in various coastal areas. The associated algorithms were generally site specific or developed over a relatively narrow range of concentration, which make them inappropriate for global applications (or at least over broad SPM range). In the frame of the GlobCoast project, a large in situ data set of SPM and remote sensing reflectance, R-rs(lambda), has been built gathering together measurements from various coastal areas around Europe, French Guiana, North Canada, Vietnam, and China. This data set covers various contrasting coastal environments diversely affected by different biogeochemical and physical processes such as sediment resuspension, phytoplankton bloom events, and rivers discharges (Amazon, Mekong, Yellow river, MacKenzie, etc.). The SPM concentration spans about four orders of magnitude, from 0.15 to 2626 g center dot m(-3). Different empirical and semi-analytical approaches developed to assess SPM from R-rs(lambda) were tested over this in situ data set. As none of them provides satisfactory results over the whole SPM range, a generic semi-analytical approach has been developed. This algorithm is based on two standard semi-analytical equations calibrated for low-to-medium and highly turbid waters, respectively. A mixing law has also been developed for intermediate environments. Sources of uncertainties in SPM retrieval such as the bio-optical variability, atmospheric correction errors, and spectral bandwidth have been evaluated. The coefficients involved in these different algorithms have been calculated for ocean color (SeaWiFS, MODIS-A/T, MERIS/OLCI, VIIRS) and high spatial resolution (LandSat8-OLI, and Sentinel2-MSI) sensors. The performance of the proposed algorithm varies only slightly from one sensor to another demonstrating the great potential applicability of the proposed approach over global and contrasting coastal waters

Multidisciplinary Digital Publishing Institute