Search CORE

16 research outputs found

New methods to measure residues coevolution in proteins

Author: Dou Yongchao
Gao Hongyun
Wang Jun
Yang Jialiang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The covariation of two sites in a protein is often used as the degree of their coevolution. To quantify the covariation many methods have been developed and most of them are based on residues position-specific frequencies by using the mutual information (MI) model. Results In the paper, we proposed several new measures to incorporate new biological constraints in quantifying the covariation. The first measure is the mutual information with the amino acid background distribution (MIB), which incorporates the amino acid background distribution into the marginal distribution of the MI model. The modification is made to remove the effect of amino acid evolutionary pressure in measuring covariation. The second measure is the mutual information of residues physicochemical properties (MIP), which is used to measure the covariation of physicochemical properties of two sites. The third measure called MIBP is proposed by applying residues physicochemical properties into the MIB model. Moreover, scores of our new measures are applied to a robust indicator <it>conn(k) </it>in finding the covariation signal of each site. Conclusions We find that incorporating amino acid background distribution is effective in removing the effect of evolutionary pressure of amino acids. Thus the MIB measure describes more biological background information for the coevolution of residues. Besides, our analysis also reveals that the covariation of physicochemical properties is a new aspect of coevolution information.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Epitope mapping using combinatorial phage-display libraries: a graph-based algorithm

Author: Alon
Barbas
Burritt
Chen
De Groot
Durbin
Enshell-Seijffers
Eytan Ruppin
Goldsby
Grantham
Halperin
Henikoff
Irving
Itay Mayrose
Jonathan M. Gershoni
Jones
Kotz
Kwong
Lang
Madabushi
Miller
Moreau
Mumey
Neuvirth
Nimrod
Nimrod D. Rubinstein
Padlan
Pizzi
Pupko
Rickles
Riemer
Roded Sharan
Schreiber
Shlomi
Sobolev
Sussman
Takenaka
Tal Pupko
Tomer Shlomi
Tsodikov
Villard
Westwood
Publication venue: Oxford University Press
Publication date: 06/12/2006
Field of study

A phage-display library of random peptides is a combinatorial experimental technique that can be harnessed for studying antibody–antigen interactions. In this technique, a phage peptide library is scanned against an antibody molecule to obtain a set of peptides that are bound by the antibody with high affinity. This set of peptides is regarded as mimicking the genuine epitope of the antibody's interacting antigen and can be used to define it. Here we present PepSurf, an algorithm for mapping a set of affinity-selected peptides onto the solved structure of the antigen. The problem of epitope mapping is converted into the task of aligning a set of query peptides to a graph representing the surface of the antigen. The best match of each peptide is found by aligning it against virtually all possible paths in the graph. Following a clustering step, which combines the most significant matches, a predicted epitope is inferred. We show that PepSurf accurately predicts the epitope in four cases for which the epitope is known from a solved antibody–antigen co-crystal complex. We further examine the capabilities of PepSurf for predicting other types of protein–protein interfaces. The performance of PepSurf is compared to other available epitope mapping programs

Crossref

PubMed Central

Using Shifts in Amino Acid Frequency and Substitution Rate to Identify Latent Structural Characters in Base-Excision Repair Enzymes

Author: A Bateman
A del Sol
A Gutteridge
A Gutteridge
A Marchler-Bauer
A Stamatakis
AB Robertson
AH Elcock
AN Barclay
AR Panchenko
AT Laurie
B Kolaczkowski
B Reva
C Branden
C Notredame
DA Kraut
DE Pumo
DM Standley
DO Zharkov
DO Zharkov
DO Zharkov
DP Brown
DR Caffrey
DT Jones
E Deu
E Hodis
E Martz
E Youn
EA Gaucher
EC Friedberg
F Coste
G Casari
G Golan
G Nimrod
GJ Naylor
H Hirano
I Mihalek
IN Sarkar
J Felsenstein
J Ko
J Pei
JA Capra
JA Capra
JC Fromme
Jeffrey P. Bond
JM Koshi
K Imamura
K Katoh
K Pereira de Jesus
KD Pruitt
KP Peters
KY Kropachev
L Rabow
LA Mirny
LE Limbird
M Clamp
M Guharoy
M Landau
M Rogacheva
M Saparbaev
M Sugahara
MJ Ondrechen
N Galtier
N Miyatake
NV Petrova
O Lichtarge
O Rahat
O Schueler-Furman
OM Sidorkina
OV Kalinina
P Aloy
P Amara
P Lio
P Lopez
P Marttinen
Q Cheng
R Gilboa
R Landgraf
RA George
Ramiro Barrantes-Reynolds
S Ahmad
S Burgess
S Doublie
S Gribaldo
S Henikoff
S Madabushi
S Sankararaman
S Wolfram
SD Kathe
Sebastian D. Fugmann
SF Altschul
SS Hannenhalli
SS Wallace
Susan S. Wallace
SV Kuznetsov
V Bandaru
V Ruano-Rubio
WL Delano
X Gu
X Gu
X Gu
X Gu
X Gu
X Gu
X Gu
Z Yang
Publication venue: Public Library of Science
Publication date: 06/10/2011
Field of study

Protein evolution includes the birth and death of structural motifs. For example, a zinc finger or a salt bridge may be present in some, but not all, members of a protein family. We propose that such transitions are manifest in sequence phylogenies as concerted shifts in substitution rates of amino acids that are neighbors in a representative structure. First, we identified rate shifts in a quartet from the Fpg/Nei family of base excision repair enzymes using a method developed by Xun Gu and coworkers. We found the shifts to be spatially correlated, more precisely, associated with a flexible loop involved in bacterial Fpg substrate specificity. Consistent with our result, sequences and structures provide convincing evidence that this loop plays a very different role in other family members. Second, then, we developed a method for identifying latent protein structural characters (LSC) given a set of homologous sequences based on Gu's method and proximity in a high-resolution structure. Third, we identified LSC and assigned states of LSC to clades within the Fpg/Nei family of base excision repair enzymes. We describe seven LSC; an accompanying Proteopedia page (http://proteopedia.org/wiki/index.php/Fpg_Nei_Protein_Family) describes these in greater detail and facilitates 3D viewing. The LSC we found provided a surprisingly complete picture of the interaction of the protein with the DNA capturing familiar examples, such as a Zn finger, as well as more subtle interactions. Their preponderance is consistent with an important role as phylogenetic characters. Phylogenetic inference based on LSC provided convincing evidence of independent losses of Zn fingers. Structural motifs may serve as important phylogenetic characters and modeling transitions involving structural motifs may provide a much deeper understanding of protein evolution

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

An association-adjusted consensus deleterious scheme to classify homozygous Mis-sense mutations for personal genome interpretation

Author: Greg Gibson
Thanawadee Preeprem
Publication venue: Springer Nature
Publication date: 23/12/2013
Field of study

BACKGROUND: Personal genome analysis is now being considered for evaluation of disease risk in healthy individuals, utilizing both rare and common variants. Multiple scores have been developed to predict the deleteriousness of amino acid substitutions, using information on the allele frequencies, level of evolutionary conservation, and averaged structural evidence. However, agreement among these scores is limited and they likely over-estimate the fraction of the genome that is deleterious. METHOD: This study proposes an integrative approach to identify a subset of homozygous non-synonymous single nucleotide polymorphisms (nsSNPs). An 8-level classification scheme is constructed from the presence/absence of deleterious predictions combined with evidence of association with disease or complex traits. Detailed literature searches and structural validations are then performed for a subset of homozygous 826 mis-sense mutations in 575 proteins found in the genomes of 12 healthy adults. RESULTS: Implementation of the Association-Adjusted Consensus Deleterious Scheme (AACDS) classifies 11% of all predicted highly deleterious homozygous variants as most likely to influence disease risk. The number of such variants per genome ranges from 0 to 8 with no significant difference between African and Caucasian Americans. Detailed analysis of mutations affecting the APOE, MTMR2, THSB1, CHIA, αMyHC, and AMY2A proteins shows how the protein structure is likely to be disrupted, even though the associated phenotypes have not been documented in the corresponding individuals. CONCLUSIONS: The classification system for homozygous nsSNPs provides an opportunity to systematically rank nsSNPs based on suggestive evidence from annotations and sequence-based predictions. The ranking scheme, in-depth literature searches, and structural validations of highly prioritized mis-sense mutations compliment traditional sequence-based approaches and should have particular utility for the development of individualized health profiles. An online tool reporting the AACDS score for any variant is provided at the authors’ website

Springer - Publisher Connector

PubMed Central

Classifying RNA-Binding Proteins Based on Electrostatic Properties

Author: A Andreeva
A Lingel
A Szilagyi
AG Murzin
BL Staker
BM Lunde
C Maris
CE Felder
CH Ding
CZ Cai
D Tworowski
DE Brodersen
DE Draper
EW Stawiski
F Bono
F Cazals
G Bejerano
G Nimrod
GB Robb
HP Shanahan
I Friedberg
I Guyon
IB Kuznetsov
J Cavarelli
JD Keene
JJ Ellis
JR Bock
JS Mattick
JS Parker
L Corsini
L ElAntak
L Wang
LY Han
M Ruff
M Terribilini
M Terribilini
N Bhardwaj
NM Luscombe
P Sampath
P Sanchez-Diaz
PM Dehe
R Karchin
RN De Guzman
S Ahmad
S Ahmad
S Jones
S Jones
S Jones
S Ramaswamy
S Shazman
Shula Shazman
T Burckin
U Hobohm
Uwe Ohler
X Yang
X Yu
Y Chen
Y Hargous
Y Mandel-Gutfreund
Y Nakamura
Y Xing
Yael Mandel-Gutfreund
YC Chen
YD Cai
YD Cai
YD Cai
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Protein structure can provide new insight into the biological function of a protein and can enable the design of better experiments to learn its biological roles. Moreover, deciphering the interactions of a protein with other molecules can contribute to the understanding of the protein's function within cellular processes. In this study, we apply a machine learning approach for classifying RNA-binding proteins based on their three-dimensional structures. The method is based on characterizing unique properties of electrostatic patches on the protein surface. Using an ensemble of general protein features and specific properties extracted from the electrostatic patches, we have trained a support vector machine (SVM) to distinguish RNA-binding proteins from other positively charged proteins that do not bind nucleic acids. Specifically, the method was applied on proteins possessing the RNA recognition motif (RRM) and successfully classified RNA-binding proteins from RRM domains involved in protein–protein interactions. Overall the method achieves 88% accuracy in classifying RNA-binding proteins, yet it cannot distinguish RNA from DNA binding proteins. Nevertheless, by applying a multiclass SVM approach we were able to classify the RNA-binding proteins based on their RNA targets, specifically, whether they bind a ribosomal RNA (rRNA), a transfer RNA (tRNA), or messenger RNA (mRNA). Finally, we present here an innovative approach that does not rely on sequence or structural homology and could be applied to identify novel RNA-binding proteins with unique folds and/or binding motifs

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Functional region prediction with a set of appropriate homologous sequences-an index for sequence selection by integrating structure and sequence information with spatial statistics

Author: Hiroyuki Toh
Wataru Nemoto
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

Springer - Publisher Connector

Computer-aided identification of the binding sites of protein-ligand complexes

Author: Lumipuu Markus
Publication venue: University of Eastern Finland
Publication date
Field of study

UEF Electronic Publications

Recommended from our members

Molecular characterization and evolutionary plasticity of protein-protein interfaces

Author: Bickerton George Richard James
Publication venue: University of Cambridge
Publication date: 01/01/2010
Field of study

Abstract The sequencing of the human genome provides the parts list for understanding cellular processes. However, as 70% of eukaryotic genes work through multi-protein systems, it is only through detailed study of the interactions of these components, that a more complete, systems-level understanding can be gained. This thesis is centred on the establishment of PICCOLO - a comprehensive database of structurally characterized protein interactions. In generating the resource, issues of interface definition, quaternary structure, data redundancy, structural environment and interaction type are addressed. The resource enables a variety of analyses to be performed concerning interface properties including residue propensity, hydropathy, polarity, interface size, sequence entropy and residue contact preference. PICCOLO has been applied to probing the patterns of substitutions that are accepted in protein interfaces across evolution, and whether these patterns are distinguishable from those seen in other structural environments. The derivation of a high-quality set of multiple structural alignments in the form of the database TOCCATA, a prerequisite for such analysis, is described, as well as procedures to derive environment-specific substitution tables. The Blundell group has contributed a series of methods to predict the likely effect of non-synonymous Single Nucleotide Polymorphisms (nsSNPs) on protein stability, function and interactions in order to triage the large volumes of data created from high-throughput genetic screening studies, enabling prioritization of those nsSNPs most likely to be phenotypically detrimental. PICCOLO's contribution to these predictions is described. Historically there has been little focus on protein-protein interactions as drug targets for small-molecule therapeutics. However, alanine-scanning mutagenesis studies have revealed that only a subset of residues contribute the greater part of free energy to binding - so-called "hot-spots". Molecular characterization of hot-spots performed using PICCOLO, probes the molecular basis underlying this important phenomenon leading to the possibility of predictive methods to identify hot-spots 'in silico'

Apollo (Cambridge)

OpenGrey Repository

A broadly applicable artificial selection system for biomolecule evolution

Author: Selles Vidal Lara
Publication venue: Life Sciences, Imperial College London
Publication date: 01/03/2020
Field of study

Biocatalysis offers an attractive alternative to traditional chemical catalysis. However, it is often found that an enzyme with the optimal properties for a specific application is not available within the natural repertoire of enzymes. It is then desirable to obtain an improved variant by altering the sequence of a known enzyme, in a process known as protein engineering. Directed evolution is one of the most powerful tools for protein engineering. In directed evolution, the process of natural evolution is mimicked in the laboratory at a much shorter timescale and selecting for properties that make the enzyme (or any other type of biomolecule) more suitable for an application of human interest. The main bottleneck of directed evolution is the identification of the desired variants amongst a majority of variants without the sought altered or improved property. Selection approaches link the desired activity to an increased survival rate or improved growth. While in principle such methodologies allow for ultra high-throughput analysis of libraries, most selection techniques have a limited scope, and can only be applied to a relatively reduced set of biomolecules or properties. This thesis presents the most broadly-applicable artificial selection system for the evolution of biomolecules ever reported. The selection platform is based on an engineered E. coli strain with impaired regeneration of NAD+, causing a conditional growth defect during anaerobic fermentation. By directly or indirectly linking the activity of the biomolecules of interest to the oxidation of NADH, cells can be rescued from this growth defect. The efficacy of such selection system has been demonstrated by using it to select alcohol dehydrogenase, imine reductase and nitroreductase variants with altered or enhanced catalytic properties, as well as an isopropanol-producing metabolic pathway with optimised regulatory elements leading to a maximised yield of isopropanol. These results confirm the wide scope of the developed selection system, which can replace conventional screening currently used in many cases of direct relevance for industrial processes. Increasing the throughput of the variant search process by many orders of magnitude will lead to the discovery of novel biomolecules and accelerate the implementation of biocatalysis.Open Acces

Spiral - Imperial College Digital Repository