Search CORE

80 research outputs found

Identification of HCV protease inhibitor resistance mutations by selection pressure-based method

Author: A. Skelton
Altschuh
C. Cullen
Chen
E. Xia
Ewing
Fried
Hadziyannis
Herrmann
J. Greene
Kass
Kieffer
Li
Li
Liu
Lohmann
Malcolm
Manns
Neher
Nei
P. Qiu
Prongay
Qiu
R. Ralston
S. Curry
S. Liu
TAREMI
Tong
Tong
V. Sanfiorenzo
Wong
X. Tong
Z. Guo
Zhang
Publication venue: Oxford University Press
Publication date
Field of study

A major challenge to successful antiviral therapy is the emergence of drug-resistant viruses. Recent studies have developed several automated analyses of HIV sequence polymorphism based on calculations of selection pressure (Ka/Ks) to predict drug resistance mutations. Similar resistance analysis programs for HCV inhibitors are not currently available. Taking advantage of the recently available sequence data of patient HCV samples from a Phase II clinical study of protease inhibitor boceprevir, we calculated the selection pressure for all codons in the HCV protease region (amino acid 1–181) to identify potential resistance mutations. The correlation between mutations was also calculated to evaluate linkage between any two mutations. Using this approach, we identified previously known major resistant mutations, including a recently reported mutation V55A. In addition, a novel mutation V158I was identified, and we further confirmed its resistance to boceprevir in protease enzyme and replicon assay. We also extended the approach to analyze potential interactions between individual mutations and identified three pairs of correlated changes. Our data suggests that selection pressure-based analysis and correlation mapping could provide useful tools to analyze large amount of sequencing data from clinical samples and to identify new drug resistance mutations as well as their linkage and correlations

Crossref

PubMed Central

Direct-coupling analysis of residue co-evolution captures native contacts across many protein families

Author: A. Bertolino
A. Pagnani
Altschuh
Atchley
B. Lunt
Berman
Burger
C. Sander
Campbell
D. S. Marks
Dima
Eddy
F. Morcos
Go
Gouveia-Oliveira
G bel
Halabi
Herrou
Hoch
J. N. Onuchic
Lashuel
Lee
Lockless
Lunt
M. Weigt
Maris
Miyazawa
Neher
Pai
Pasternak
Procaccini
R. Zecchina
Schug
Shindyalov
Skerker
T. Hwa
Tame
Tanaka
Tillier
White
Wisedchaisri
Wollenberg
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2011
Field of study

The similarity in the three-dimensional structures of homologous proteins imposes strong constraints on their sequence variability. It has long been suggested that the resulting correlations among amino acid compositions at different sequence positions can be exploited to infer spatial contacts within the tertiary protein structure. Crucial to this inference is the ability to disentangle direct and indirect correlations, as accomplished by the recently introduced Direct Coupling Analysis (DCA) (Weigt et al. (2009) Proc Natl Acad Sci 106:67). Here we develop a computationally efficient implementation of DCA, which allows us to evaluate the accuracy of contact prediction by DCA for a large number of protein domains, based purely on sequence information. DCA is shown to yield a large number of correctly predicted contacts, recapitulating the global structure of the contact map for the majority of the protein domains examined. Furthermore, our analysis captures clear signals beyond intra- domain residue contacts, arising, e.g., from alternative protein conformations, ligand- mediated residue couplings, and inter-domain interactions in protein oligomers. Our findings suggest that contacts predicted by DCA can be used as a reliable guide to facilitate computational predictions of alternative protein conformations, protein complex formation, and even the de novo prediction of protein domain structures, provided the existence of a large number of homologous sequences which are being rapidly made available due to advances in genome sequencing.Comment: 28 pages, 7 figures, to appear in PNA

arXiv.org e-Print Archive

Crossref

PubMed Central

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

HAL-Paris1

Hal-Diderot

PORTO Publications Open Repository TOrino

Correlated Mutations: A Hallmark of Phenotypic Amino Acid Substitutions

Author: A Bairoch
A Fuchs
A Hamosh
A Lapedes
A Lupi
A Tanoue
A Tanoue
AA Fodor
Andreas Kowarsch
Angelika Fuchs
BC Lee
C von Mering
D Altschuh
D Altschuh
D Vitkup
DD Pollock
DD Pollock
Dmitrij Frishman
EE Winter
F Endo
F Pazos
GB Gloor
H Huang
HM Berman
I Feldman
I Kass
IN Shindyalov
JG Caporaso
LC Martin
M Krzywinski
M Socolich
MH Knaggs
MS Singer
N Lopez-Bigas
NGC Smith
O Noivirt
O Noivirt-Brik
O Olmea
O Olmea
P Fariselli
P Ledoux
P Tuffery
P Wong
PC Ng
PC Ng
PD Stenson
Philipp Pagel
PJ Kundrotas
RC Edgar
RE Steward
RR Gutell
S Henikoff
S Sunyaev
S Vicatos
SAA Travers
SD Dunn
SK Ng
SM Larson
T Hershkovitz
Thomas Lengauer
U Göbel
V Ramensky
W Kabsch
WP Russ
WR Taylor
ZO Wang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Point mutations resulting in the substitution of a single amino acid can cause severe functional consequences, but can also be completely harmless. Understanding what determines the phenotypical impact is important both for planning targeted mutation experiments in the laboratory and for analyzing naturally occurring mutations found in patients. Common wisdom suggests using the extent of evolutionary conservation of a residue or a sequence motif as an indicator of its functional importance and thus vulnerability in case of mutation. In this work, we put forward the hypothesis that in addition to conservation, co-evolution of residues in a protein influences the likelihood of a residue to be functionally important and thus associated with disease. While the basic idea of a relation between co-evolution and functional sites has been explored before, we have conducted the first systematic and comprehensive analysis of point mutations causing disease in humans with respect to correlated mutations. We included 14,211 distinct positions with known disease-causing point mutations in 1,153 human proteins in our analysis. Our data show that (1) correlated positions are significantly more likely to be disease-associated than expected by chance, and that (2) this signal cannot be explained by conservation patterns of individual sequence positions. Although correlated residues have primarily been used to predict contact sites, our data are in agreement with previous observations that (3) many such correlations do not relate to physical contacts between amino acid residues. Access to our analysis results are provided at http://webclu.bio.wzw.tum.de/~pagel/supplements/correlated-positions/

Crossref

Directory of Open Access Journals

PubMed Central

PuSH

H2r: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments

Author: A del Sol Mesa
AL Barabási
B Rost
C Notredame
C Ouzounis
C Sander
C Steegborn
CC Hyde
CE Shannon
D Altschuh
DR Caffrey
E Eyal
E Neher
E Weber-Ban
E Zuckerkandl
ER Tillier
F Pearl
GB Gloor
GM Süel
HO Villar
I Kass
IM Wallace
J Tsai
JA Capra
JP Dekker
K Katoh
K Wang
LA Kelley
LC Martin
M Landau
Matthias Zwick
MC Saraf
ME Noble
O Noivirt
O Olmea
OV Kalinina
OV Kalinina
R Merkl
RA Estabrook
RA Laskowski
Rainer Merkl
RD Finn
RI Dima
S Henikoff
SJ Fleishman
SM Larson
SW Lockless
T Lassmann
T Sato
TD Schneider
U Göbel
V Kulik
V Kulik
WH Press
WR Atchley
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: A multiple sequence alignment (MSA) generated for a protein can be used to characterise residues by means of a statistical analysis of single columns. In addition to the examination of individual positions, the investigation of co-variation of amino acid frequencies offers insights into function and evolution of the protein and residues. RESULTS: We introduce conn(k), a novel parameter for the characterisation of individual residues. For each residue k, conn(k) is the number of most extreme signals of co-evolution. These signals were deduced from a normalised mutual information (MI) value U(k, l) computed for all pairs of residues k, l. We demonstrate that conn(k) is a more robust indicator than an individual MI-value for the prediction of residues most plausibly important for the evolution of a protein. This proposition was inferred by means of statistical methods. It was further confirmed by the analysis of several proteins. A server, which computes conn(k)-values is available at http://www-bioinf.uni-regensburg.de. CONCLUSION: The algorithms H2r, which analyses MSAs and computes conn(k)-values, characterises a specific class of residues. In contrast to strictly conserved ones, these residues possess some flexibility in the composition of side chains. However, their allocation is sensibly balanced with several other positions, as indicated by conn(k)

University of Regensburg Publication Server

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Protein 3D Structure Computed from Evolutionary Sequence Variation

Author: A Kryshtafovych
A Roy
A Schug
A Zemla
AA Fodor
AF Poon
AF Poon
Andrea Pagnani
Andrej Sali
AP Kamat
AR Ortiz
AR Ortiz
ASGB Lapedes
AT Brunger
B Reva
BG Giraud
C Chothia
Chris Sander
CS Miller
D Altschuh
D Altschuh
D Cozzetto
DE Kim
DE Shaw
Debora S. Marks
E Neher
E Schneidman
EI Shakhnovich
F Morcos
G Kolesov
H Fehlhammer
HRFB Kappen
IN Shindyalov
J DeBartolo
J Moult
J Moult
J Moult
J Qiu
J Skolnick
JM Duarte
JM Skerker
JS Yang
JW Locasale
KT Simons
L Burger
L Burger
L Holm
Lucy J. Colwell
M Mezard
M Miyano
M Vendruscolo
M Weigt
MMT Mezard
N Halabi
N Siew
P Bradley
P Bradley
P Fariselli
P Joost
PMJW Ravikumar
R Das
R Nair
R Sathyapriya
RD Finn
Riccardo Zecchina
RO Dror
Robert Sheridan
S Raman
S Raman
S Wu
S Wu
S Yooseph
SD Dunn
T Mora
TF Havel
Thomas A. Hopf
TR Lezon
TR Lezon
U Göbel
V Morea
VMR Sessak
WP Russ
WR Atchley
WR Taylor
WR Taylor
Y Duan
Y Zhang
Y Zhang
YJAH Roudi
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

The evolutionary trajectory of a protein through sequence space is constrained by its function. Collections of sequence homologs record the outcomes of millions of evolutionary experiments in which the protein evolves according to these constraints. Deciphering the evolutionary record held in these sequences and exploiting it for predictive and engineering purposes presents a formidable challenge. The potential benefit of solving this challenge is amplified by the advent of inexpensive high-throughput genomic sequencing

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Surface plasmon resonance imaging of cells and surface-associated fibronectin

Author: A Krishnan
Alessandro Tona
Alexander W Peterson
Anne L Plant
BG Keselowsky
BP Nelson
BP Nelson
C Lheveder
CEH Berger
CS Chen
D Altschuh
D Malacara
DG Myszka
DW DeSimone
F Grinnell
FG Giancotti
FS Jones
G Baneyx
GJ Wegner
H Noda
H Raether
I Wierzbicka-Patynowski
JA Ruemmele
JS Burmeister
JS Shumaker-Parry
JT Elliott
KA Peterlinz
KF Giebel
Kiran Bhadriraju
L Guemouri
L Renner
LK Wolf
LK Wolf
M Gotoh
M Halter
M Harke
M Mrksich
M Yamamoto
MA Model
Michael Halter
MR Siegel
N El Kadi
NJ Boudreau
PA Dimilla
SP Palecek
T Ohashi
Y Harpaz
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background A critical challenge in cell biology is quantifying the interactions of cells with their extracellular matrix (ECM) environment and the active remodeling by cells of their ECM. Fluorescence microscopy is a commonly employed technique for examining cell-matrix interactions. A label-free imaging method would provide an alternative that would eliminate the requirement of transfected cells and modified biological molecules, and if collected nondestructively, would allow long term observation and analysis of live cells. Results Using surface plasmon resonance imaging (SPRI), the deposition of protein by vascular smooth muscle cells (vSMC) cultured on fibronectin was quantified as a function of cell density and distance from the cell periphery. We observed that as much as 120 ng/cm2 of protein was deposited by cells in 24 h. Conclusion SPRI is a real-time, low-light-level, label-free imaging technique that allows the simultaneous observation and quantification of protein layers and cellular features. This technique is compatible with live cells such that it is possible to monitor cellular modifications to the extracellular matrix in real-time.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Structural and Functional Roles of Coevolved Sites in Proteins

Author: A Marchler-Bauer
A Valencia
Anna R. Panchenko
AS Kondrashov
B Bulka
BC Lee
BT Korber
C Ferrer-Costa
CA Voigt
CH Yeang
CM Buslje
CS Goh
D Altschuh
DB Johnson
DD Pollock
DJ Watts
DY Little
ER Tillier
G Chelvanayagam
GB Gloor
GL Moore
IN Shindyalov
K Fukami-Kobayashi
K Henrick
K Mizuguchi
KR Wollenberg
L Pritchard
LA Amaral
LC Martin
M Kimura
M Vendruscolo
MC Saraf
MD Daily
MG Kann
N Mathias
Narcis Fernandez-Fuentes
O Olmea
P Shannon
R Gouveia-Oliveira
S Chakrabarti
S Chakrabarti
S Govindarajan
Saikat Chakrabarti
SD Dunn
SN Fatakia
SS Choi
TA Castoe
U Gobel
WL DeLano
WM Fitch
WR Atchley
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Understanding the residue covariations between multiple positions in protein families is very crucial and can be helpful for designing protein engineering experiments. These simultaneous changes or residue coevolution allow protein to maintain its overall structural-functional integrity while enabling it to acquire specific functional modifications. Despite the significant efforts in the field there is still controversy in terms of the preferable locations of coevolved residues on different regions of protein molecules, the strength of coevolutionary signal and role of coevolution in functional diversification.In this paper we study the scale and nature of residue coevolution in maintaining the overall functionality and structural integrity of proteins. We employed a large scale study to investigate the structural and functional aspects of coevolved residues. We found that the networks representing the coevolutionary residue connections within our dataset are in general of 'small-world' type as they have clustering coefficient values higher than random networks and also show smaller mean shortest path lengths similar and/or lower than random and regular networks. We also found that altogether 11% of functionally important sites are coevolved with any other sites. Active sites are found more frequently to coevolve with any other sites (15%) compared to protein (11%) and ligand (9%) binding sites. Metal binding and active sites are also found to be more frequently coevolved with other metal binding and active sites, respectively. Analysis of the coupling between coevolutionary processes and the spatial distribution of coevolved sites reveals that a high fraction of coevolved sites are located close to each other. Moreover, approximately 80% of charge compensatory substitutions within coevolved sites are found at very close spatial proximity (<or= 5A), pointing to the possible preservation of salt bridges in evolution.Our findings show that a noticeable fraction of functionally important sites undergo coevolution and also point towards compensatory substitutions as a probable coevolutionary mechanism within spatially proximal coevolved functional sites

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins

Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

VBN

Efficient representation of uncertainty in multiple sequence alignments using directed acyclic graphs

Author: A Dress
A Godzik
A Löytynoja
A Löytynoja
A Novák
A Novák
A Sali
A Siepel
A Tramontano
Adrienn Szabó
AS Schwartz
AS Schwartz
B Dwivedi
B Knudsen
B Larget
B Misof
B Schwikowski
BD Redelings
BD Redelings
BJM Webb
BP Blackburne
C Dessimoz
C Notredame
C Notredame
CB Do
CJ Challis
D Altschuh
D Chivian
D DeBlasio
D Lupyan
D Metzler
D Metzler
D Robinson
DA Morrison
DF Feng
E Levy Karin
G Jordan
G Landan
G Lunter
G Lunter
G Lunter
G Raghava
G Talavera
GA Churchill
GA Lunter
Hall B G
HT Mevissen
I Holmes
I Miklós
I Miklós
IL Dryden
IM Wallace
István Miklós
J Castresana
J Felsenstein
J Gatesy
J Hein
J Kim
J Zhu
JA Lake
JD Thompson
JD Thompson
JL Thorne
JL Thorne
JL Thorne
JL Thorne
Joseph L Herman
Jotun Hein
K Bucka-Lassen
K Liu
K Liu
KM Wong
L Wang
L Yu
LE Carvalho
LS Wang
M Hamada
M Hamada
M Hamada
M Höhl
M Vingron
M Vingron
M Wu
M Zuker
MA Suchard
MJ Wise
MO Dayhoff
MP Simmons
MS Waterman
MSY Lee
O Gotoh
O Penn
O Penn
O Penn
P Ajawatanawong
P Arunapuram
P Collingridge
PJ Green
PJ Green
PP Gardner
R Durbin
R Satija
R Satija
R Schwarzenbacher
RA Cartwright
RC Edgar
RJ Dickson
RJ Dickson
RK Bradley
Rune Lyngsø
S Capella-Gutiérrez
S Karlin
S Miyazawa
S Needleman
S Sinha
Silla-Martínez Capella-Gutiérrez S
SME Sahraeian
TA Hopf
TH Ogden
TL Blundell
U Roshan
V Ahola
W Fletcher
WC Wheeler
Y Liu
Y Ruffieux
Ádám Novák
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Background A standard procedure in many areas of bioinformatics is to use a single multiple sequence alignment (MSA) as the basis for various types of analysis. However, downstream results may be highly sensitive to the alignment used, and neglecting the uncertainty in the alignment can lead to significant bias in the resulting inference. In recent years, a number of approaches have been developed for probabilistic sampling of alignments, rather than simply generating a single optimum. However, this type of probabilistic information is currently not widely used in the context of downstream inference, since most existing algorithms are set up to make use of a single alignment. Results In this work we present a framework for representing a set of sampled alignments as a directed acyclic graph (DAG) whose nodes are alignment columns; each path through this DAG then represents a valid alignment. Since the probabilities of individual columns can be estimated from empirical frequencies, this approach enables sample-based estimation of posterior alignment probabilities. Moreover, due to conditional independencies between columns, the graph structure encodes a much larger set of alignments than the original set of sampled MSAs, such that the effective sample size is greatly increased. Conclusions The alignment DAG provides a natural way to represent a distribution in the space of MSAs, and allows for existing algorithms to be efficiently scaled up to operate on large sets of alignments. As an example, we show how this can be used to compute marginal probabilities for tree topologies, averaging over a very large number of MSAs. This framework can also be used to generate a statistically meaningful summary alignment; example applications show that this summary alignment is consistently more accurate than the majority of the alignment samples, leading to improvements in downstream tree inference. Implementations of the methods described in this article are available at http://statalign.github.io/WeaveAlign webcite

Crossref

SZTAKI Publication Repository

Springer - Publisher Connector

PubMed Central

Oxford University Research Archive

Computing Highly Correlated Positions Using Mutual Information and Graph Theory for G Protein-Coupled Receptors

Author: A Pagano
AA Ivanov
AK Ramani
AR Ortiz
B Galitsky
BTM Korber
C Goh
C Hemmerich
C Yeang
Carson C. Chow
CD Strader
CE Shannon
CJ Harris
CS Sum
D Altschuh
DD Pollock
DD Pollock
DKY Chiu
DM Rosenbaum
E Neher
F Horn
F Knoflach
F Pazos
F Pazos
FY Carroll
G Casari
G Kleinau
G Kleinau
G Suel
G Swaminath
GB Gloor
H Herzel
H Jaschke
I Halperin
I Kass
IG Tikhonova
IN Shindyalov
J Dutheil
J Kim
J Thomas
JA Ballesteros
JA Ballesteros
JA Capra
JE Donald
JE Donald
JE Donald
JS Surgand
JW Kelly
JX Hu
K Palczewski
K Ray
K Ray
K Sjolander
K Ye
K Ye
KD Pruitt
KL Pierce
KY Yip
L Lewyn
L Oliveira
L Oliveira
L Oliveira
L Pritchard
LA Mirny
LC Martin
LH Heitman
M Raviscioni
M Scarselli
M Socolich
MA Hanson
Matthieu Louis
ME Olah
MJ Buck
ML Lopez-Rodriguez
MS Roulston
MW Dimmic
ND Clarke
NG Hoffman
O Lichtarge
O Lichtarge
O Noivirt
OF Lange
OV Kalinina
PJ Kundrotas
PR Gouldson
R Banerjee
R Brun
R Fredriksson
R Jothi
R Steuer
RI Dima
RM Williamson
RR Gutell
S Chakrabarti
S Costanzi
S Costanzi
S Costanzi
S Costanzi
S Govindarajan
S Litschig
S Madabushi
S Moore
S Moro
S Ohno
S Takeda
Sarosh N. Fatakia
SB Nagl
SB Nagl
SD Dunn
SGF Rasmussen
SJ Fleishman
SS Hannenhalli
Stefano Costanzi
SW Lockless
T Klabunde
T Klabunde
T Sato
T Warne
TD Schneider
TM Cover
V Batageli
V Cherezov
VP Jaakola
WP Russ
WR Atchley
WR Atchley
WR Taylor
Y Liu
Y Qi
Publication venue: Public Library of Science
Publication date: 05/03/2009
Field of study

G protein-coupled receptors (GPCRs) are a superfamily of seven transmembrane-spanning proteins involved in a wide array of physiological functions and are the most common targets of pharmaceuticals. This study aims to identify a cohort or clique of positions that share high mutual information. Using a multiple sequence alignment of the transmembrane (TM) domains, we calculated the mutual information between all inter-TM pairs of aligned positions and ranked the pairs by mutual information. A mutual information graph was constructed with vertices that corresponded to TM positions and edges between vertices were drawn if the mutual information exceeded a threshold of statistical significance. Positions with high degree (i.e. had significant mutual information with a large number of other positions) were found to line a well defined inter-TM ligand binding cavity for class A as well as class C GPCRs. Although the natural ligands of class C receptors bind to their extracellular N-terminal domains, the possibility of modulating their activity through ligands that bind to their helical bundle has been reported. Such positions were not found for class B GPCRs, in agreement with the observation that there are not known ligands that bind within their TM helical bundle. All identified key positions formed a clique within the MI graph of interest. For a subset of class A receptors we also considered the alignment of a portion of the second extracellular loop, and found that the two positions adjacent to the conserved Cys that bridges the loop with the TM3 qualified as key positions. Our algorithm may be useful for localizing topologically conserved regions in other protein families

Public Library of Science (PLOS)

Crossref

PubMed Central