Search CORE

79 research outputs found

FoldGPCR: Structure prediction protocol for the transmembrane domain of G protein-coupled receptors from class A

Author: Archer
Baldwin
Ballesteros
Bhattacharya
Bissantz
Brooks
Brooks
Bu
Bu
Bu
Chen
Chen
Cherezov
Chopra
Farrens
Feig
Fleishman
Fredriksson
Gloriam
Grigorieff
Hanson
Hawtin
Herzyk
Im
Im
Jaakola
Kobilka
Kobilka
Kopp
Lefkowitz
Li
Lomize
Lomize
MacKerell
Mackerell
Marti-Renom
Michino
Michino
Mirzadegan
Mitsutake
Momany
Murakami
Okada
Palczewski
Park
Pierce
Pogozheva
Roche
Rohl
Rosenbaum
Salom
Scheerer
Shacham
Shi
Suel
Sugita
Surgand
Tress
Tyndall
Vaidehi
Vortherms
Warne
Zagrovic
Zhang
Zhang
Zhang
Publication venue: 'Wiley'
Publication date: 01/08/2010
Field of study

Building reliable structural models of G protein-coupled receptors (GPCRs) is a difficult task because of the paucity of suitable templates, low sequence identity, and the wide variety of ligand specificities within the superfamily. Template-based modeling is known to be the most successful method for protein structure prediction. However, refinement of homology models within 1–3 Å CΑ RMSD of the native structure remains a major challenge. Here, we address this problem by developing a novel protocol (foldGPCR) for modeling the transmembrane (TM) region of GPCRs in complex with a ligand, aimed to accurately model the structural divergence between the template and target in the TM helices. The protocol is based on predicted conserved inter-residue contacts between the template and target, and exploits an all-atom implicit membrane force field. The placement of the ligand in the binding pocket is guided by biochemical data. The foldGPCR protocol is implemented by a stepwise hierarchical approach, in which the TM helical bundle and the ligand are assembled by simulated annealing trials in the first step, and the receptor-ligand complex is refined with replica exchange sampling in the second step. The protocol is applied to model the human Β 2 -adrenergic receptor (Β 2 AR) bound to carazolol, using contacts derived from the template structure of bovine rhodopsin. Comparison with the X-ray crystal structure of the Β 2 AR shows that our protocol is particularly successful in accurately capturing helix backbone irregularities and helix-helix packing interactions that distinguish rhodopsin from Β 2 AR. Proteins 2010. © 2010 Wiley-Liss, Inc.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/77435/1/22731_ftp.pd

Crossref

PubMed Central

Deep Blue Documents at the University of Michigan

In Silico Veritas: The Pitfalls and Challenges of Predicting

Recently the first community-wide assessments of the prediction of the structures of complexes between proteins and small molecule ligands have been reported in the so-called GPCR Dock 2008 and 2010 assessments. In the current review we discuss the different steps along the protein-ligand modeling workflow by critically analyzing the modeling strategies we used to predict the structures of protein-ligand complexes we submitted to the recent GPCR Dock 2010 challenge. These representative test cases, focusing on the pharmaceutically relevant G Protein-Coupled Receptors, are used to demonstrate the strengths and challenges of the different modeling methods. Our analysis indicates that the proper performance of the sequence alignment, introduction of structural adjustments guided by experimental data, and the usage of experimental data to identify protein-ligand interactions are critical steps in the protein-ligand modeling protocol. © 2011 by the authors; licensee MDPI, Basel, Switzerland

VU Research Portal

Crossref

Directory of Open Access Journals

PubMed Central

Radboud Repository

Global Analysis of Small Molecule Binding to Related Protein Targets

Author: A Bateman
A Halevy
A Hopkins
AJ Vilella
AL Hopkins
AM Davis
CF George
D Tanramluk
D Vidal
DE Gloriam
DL Nersesian
E Koonin
E van der Horst
E van der Horst
Felix A. Kruger
Greg Tucker-Kellogg
I Shamovsky
J Cozzi
J Fox
J Overington
JD Wichard
John P. Overington
JP Overington
JS Surgand
JT Metz
JU Bowie
KL McGary
KR Taylor
L Ireland-Denny
L Zhi
LA Black
LJ Bellis
M Davies
M Ekroos
M Kimura
M Vieth
MA Fabian
ME Peterson
MM Hann
MP Gleeson
MS Stephens
MW Karaman
N Brooijmans
PR Caron
RD Finn
RL Tatusov
TA Esbenshade
TW Lovenberg
W Sujansky
WM Fitch
X Ligneau
Y Arens
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

We report on the integration of pharmacological data and homology information for a large scale analysis of small molecule binding to related targets. Differences in small molecule binding have been assessed for curated pairs of human to rat orthologs and also for recently diverged human paralogs. Our analysis shows that in general, small molecule binding is conserved for pairs of human to rat orthologs. Using statistical tests, we identified a small number of cases where small molecule binding is different between human and rat, some of which had previously been reported in the literature. Knowledge of species specific pharmacology can be advantageous for drug discovery, where rats are frequently used as a model system. For human paralogs, we demonstrate a global correlation between sequence identity and the binding of small molecules with equivalent affinity. Our findings provide an initial general model relating small molecule binding and sequence divergence, containing the foundations for a general model to anticipate and predict within-target-family selectivity

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Multidimensional Scaling Reveals the Main Evolutionary Pathways of Class A G-Protein-Coupled Receptors

Author: A Gogos
A Rokas
A Rokas
B Wu
C Kuiken
C Tuffley
David Thybert
DE Gloriam
DG Higgins
DJ Smith
DK Vassilatis
E Susko
F Lu
F Murtagh
G Blackshields
G Casari
H Abdi
H Abdi
H Rompler
HC Wang
Hervé Abdi
I Domazet
I Kass
IG Choi
J Devillé
J Hou
J Hou
J Tzeng
JC Gower
JH Park
JS Surgand
Julien Pelé
JW DeLano
K Palczewski
K Ye
KB Nicholas
KJ Woolley
Kolakowski LF Jr
M Anctil
M Greenacre
M Nei
MA Larkin
Marie Chabbert
Matthieu Moreau
MC Peeters
MW Trosset
P Rousseeuw
P Scheerer
R Fredriksson
R Fredriksson
RA Studer
RP Metpally
S Yohannan
S Yohannan
SC Sealfon
SG Rasmussen
T Haitina
U Gether
V Cherezov
Vladimir N. Uversky
VN Grishin
W Shi
WM Fitch
WS Togerson
WS Valdar
Y Takane
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Class A G-protein-coupled receptors (GPCRs) constitute the largest family of transmembrane receptors in the human genome. Understanding the mechanisms which drove the evolution of such a large family would help understand the specificity of each GPCR sub-family with applications to drug design. To gain evolutionary information on class A GPCRs, we explored their sequence space by metric multidimensional scaling analysis (MDS). Three-dimensional mapping of human sequences shows a non-uniform distribution of GPCRs, organized in clusters that lay along four privileged directions. To interpret these directions, we projected supplementary sequences from different species onto the human space used as a reference. With this technique, we can easily monitor the evolutionary drift of several GPCR sub-families from cnidarians to humans. Results support a model of radiative evolution of class A GPCRs from a central node formed by peptide receptors. The privileged directions obtained from the MDS analysis are interpretable in terms of three main evolutionary pathways related to specific sequence determinants. The first pathway was initiated by a deletion in transmembrane helix 2 (TM2) and led to three sub-families by divergent evolution. The second pathway corresponds to the differentiation of the amine receptors. The third pathway corresponds to parallel evolution of several sub-families in relation with a covarion process involving proline residues in TM2 and TM5. As exemplified with GPCRs, the MDS projection technique is an important tool to compare orthologous sequence sets and to help decipher the mutational events that drove the evolution of protein families

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

Molecular Evolution of the Neuropeptide S Receptor

Author: A Krogh
A Oksche
AD Macrae
AJ Drummond
AV Sverdlov
AY Guo
B Beck
B Chini
B Mouillac
B Neron
B Vroling
C Ruzza
CH Hoyle
D Nepomuceno
D Sanchez
DA Schwarz
DE Gloriam
DE Gloriam
DG Bichet
DM Duangdao
E Melen
E Stafflinger
E Ufer
F Abascal
F Ronquist
G Cazzamali
G Gimpl
G Sundström
GF Schertler
GJ Roch
HC Pape
HH Zingg
I Erlenbach
J Gupte
J Majewski
J Vendelin
JA Ballesteros
JI Park
JS Surgand
K Katoh
K Ukena
KK Hansen
KP Hofmann
KPD Tamura
L Oliveira
L Wang
M Conner
M D'Amato
M Vanetti
MJ Aikins
MJ Betts
MP Bokoch
MR Elphick
N Cotte
N Okamura
Narayanan Manoj
P Gouet
P Scheerer
R Postina
RK Reinscheid
RK Reinscheid
RK Reinscheid
RW Han
S Ahuja
S Dreborg
S Sato
SD Clark
SR Eddy
T Kawada
T Laitinen
T Laitinen
T Warne
Thejkiran Pitti
V Bernier
V Brendel
V Camarda
V Pulkkinen
Vladimir N. Uversky
W Li
W Li
WG Qiu
X Gu
X Gu
X Gu
X Gu
YL Peng
YL Xu
YL Xu
Publication venue: Public Library of Science
Publication date: 30/03/2012
Field of study

The neuropeptide S receptor (NPSR) is a recently deorphanized member of the G protein-coupled receptor (GPCR) superfamily and is activated by the neuropeptide S (NPS). NPSR and NPS are widely expressed in central nervous system and are known to have crucial roles in asthma pathogenesis, locomotor activity, wakefulness, anxiety and food intake. The NPS-NPSR system was previously thought to have first evolved in the tetrapods. Here we examine the origin and the molecular evolution of the NPSR using in-silico comparative analyses and document the molecular basis of divergence of the NPSR from its closest vertebrate paralogs. In this study, NPSR-like sequences have been identified in a hemichordate and a cephalochordate, suggesting an earlier emergence of a NPSR-like sequence in the metazoan lineage. Phylogenetic analyses revealed that the NPSR is most closely related to the invertebrate cardioacceleratory peptide receptor (CCAPR) and the group of vasopressin-like receptors. Gene structure features were congruent with the phylogenetic clustering and supported the orthology of NPSR to the invertebrate NPSR-like and CCAPR. A site-specific analysis between the vertebrate NPSR and the well studied paralogous vasopressin-like receptor subtypes revealed several putative amino acid sites that may account for the observed functional divergence between them. The data can facilitate experimental studies aiming at deciphering the common features as well as those related to ligand binding and signal transduction processes specific to the NPSR

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Chemogenomic Analysis of G-Protein Coupled Receptors and Their Ligands Deciphers Locks and Keys Governing Diverse Aspects of Signalling

Author: A Ghose
A Zurn
AJ Harmar
AL Hopkins
Antonius ter Laak
AO Chugunov
AV Smrcka
B Kobilka
B Kuhn
BK Kobilka
CJ van Koppen
CL Worth
D Massotte
D Verzijl
DE Gloriam
DM Perez
DM Rosenbaum
E Jacoby
E Urizar
F Horn
G Kleinau
G Kleinau
G Kleinau
G Vauquelin
Gerd Krause
GF Schertler
Gunnar Kleinau
H Kubinyi
H Unal
HE van der
ID Pogozheva
J Hu
J Huang
J Huynh
J Wess
J Wess
JA Ballesteros
JD Tyndall
JH Park
JR Raymond
JS Surgand
Jörg D. Wichard
K Balakin
K Hogan
K Kristiansen
K Wenzel-Seifert
K Ye
KP Hofmann
L Hu
L Jacob
L Lowell
L Oliveira
L Pardo
Leo Lee
LH Hall
M Conner
MA Cascieri
MA Hanson
MC Lagerstrom
MJ Keiser
MJ Smit
MP Bokoch
N Lehmann
N Van Eps
Nikolaus Heinrich
OM Becker
P Scheerer
P Scheerer
P Tarnow
PD Evans
PW Hildebrand
Q Jiang
R Fredriksson
R Seifert
R Steuer
R Todeschini
RB Westkaemper
RD Finn
Ronald Kühne
RP Bywater
RT Dorsam
S Ahuja
S Bhattacharya
S Moller
S Robb
S Schlyer
S Ye
SA Nickolls
SK Wong
SN Fatakia
SP Runyon
SR Eddy
T Klabunde
T Schoneberg
T Schoneberg
TE Angel
TK Bjarnadottir
TW Schwartz
V Cherezov
VP Jaakola
W Li
WM Oldham
Y Okuno
Y Sun
YK Yang
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Understanding the molecular mechanism of signalling in the important super-family of G-protein-coupled receptors (GPCRs) is causally related to questions of how and where these receptors can be activated or inhibited. In this context, it is of great interest to unravel the common molecular features of GPCRs as well as those related to an active or inactive state or to subtype specific G-protein coupling. In our underlying chemogenomics study, we analyse for the first time the statistical link between the properties of G-protein-coupled receptors and GPCR ligands. The technique of mutual information (MI) is able to reveal statistical inter-dependence between variations in amino acid residues on the one hand and variations in ligand molecular descriptors on the other. Although this MI analysis uses novel information that differs from the results of known site-directed mutagenesis studies or published GPCR crystal structures, the method is capable of identifying the well-known common ligand binding region of GPCRs between the upper part of the seven transmembrane helices and the second extracellular loop. The analysis shows amino acid positions that are sensitive to either stimulating (agonistic) or inhibitory (antagonistic) ligand effects or both. It appears that amino acid positions for antagonistic and agonistic effects are both concentrated around the extracellular region, but selective agonistic effects are cumulated between transmembrane helices (TMHs) 2, 3, and ECL2, while selective residues for antagonistic effects are located at the top of helices 5 and 6. Above all, the MI analysis provides detailed indications about amino acids located in the transmembrane region of these receptors that determine G-protein signalling pathway preferences

Crossref

Directory of Open Access Journals

PubMed Central

Hochschulbibliothekszentrum des Landes Nordrhein-Westfalen (hbz)

Computing Highly Correlated Positions Using Mutual Information and Graph Theory for G Protein-Coupled Receptors

Author: A Pagano
AA Ivanov
AK Ramani
AR Ortiz
B Galitsky
BTM Korber
C Goh
C Hemmerich
C Yeang
Carson C. Chow
CD Strader
CE Shannon
CJ Harris
CS Sum
D Altschuh
DD Pollock
DD Pollock
DKY Chiu
DM Rosenbaum
E Neher
F Horn
F Knoflach
F Pazos
F Pazos
FY Carroll
G Casari
G Kleinau
G Kleinau
G Suel
G Swaminath
GB Gloor
H Herzel
H Jaschke
I Halperin
I Kass
IG Tikhonova
IN Shindyalov
J Dutheil
J Kim
J Thomas
JA Ballesteros
JA Ballesteros
JA Capra
JE Donald
JE Donald
JE Donald
JS Surgand
JW Kelly
JX Hu
K Palczewski
K Ray
K Ray
K Sjolander
K Ye
K Ye
KD Pruitt
KL Pierce
KY Yip
L Lewyn
L Oliveira
L Oliveira
L Oliveira
L Pritchard
LA Mirny
LC Martin
LH Heitman
M Raviscioni
M Scarselli
M Socolich
MA Hanson
Matthieu Louis
ME Olah
MJ Buck
ML Lopez-Rodriguez
MS Roulston
MW Dimmic
ND Clarke
NG Hoffman
O Lichtarge
O Lichtarge
O Noivirt
OF Lange
OV Kalinina
PJ Kundrotas
PR Gouldson
R Banerjee
R Brun
R Fredriksson
R Jothi
R Steuer
RI Dima
RM Williamson
RR Gutell
S Chakrabarti
S Costanzi
S Costanzi
S Costanzi
S Costanzi
S Govindarajan
S Litschig
S Madabushi
S Moore
S Moro
S Ohno
S Takeda
Sarosh N. Fatakia
SB Nagl
SB Nagl
SD Dunn
SGF Rasmussen
SJ Fleishman
SS Hannenhalli
Stefano Costanzi
SW Lockless
T Klabunde
T Klabunde
T Sato
T Warne
TD Schneider
TM Cover
V Batageli
V Cherezov
VP Jaakola
WP Russ
WR Atchley
WR Atchley
WR Taylor
Y Liu
Y Qi
Publication venue: Public Library of Science
Publication date: 05/03/2009
Field of study

G protein-coupled receptors (GPCRs) are a superfamily of seven transmembrane-spanning proteins involved in a wide array of physiological functions and are the most common targets of pharmaceuticals. This study aims to identify a cohort or clique of positions that share high mutual information. Using a multiple sequence alignment of the transmembrane (TM) domains, we calculated the mutual information between all inter-TM pairs of aligned positions and ranked the pairs by mutual information. A mutual information graph was constructed with vertices that corresponded to TM positions and edges between vertices were drawn if the mutual information exceeded a threshold of statistical significance. Positions with high degree (i.e. had significant mutual information with a large number of other positions) were found to line a well defined inter-TM ligand binding cavity for class A as well as class C GPCRs. Although the natural ligands of class C receptors bind to their extracellular N-terminal domains, the possibility of modulating their activity through ligands that bind to their helical bundle has been reported. Such positions were not found for class B GPCRs, in agreement with the observation that there are not known ligands that bind within their TM helical bundle. All identified key positions formed a clique within the MI graph of interest. For a subset of class A receptors we also considered the alignment of a portion of the second extracellular loop, and found that the two positions adjacent to the conserved Cys that bridges the loop with the TM3 qualified as key positions. Our algorithm may be useful for localizing topologically conserved regions in other protein families

Public Library of Science (PLOS)

Crossref

PubMed Central

Développement de nouvelles méthodes bioinformatiques pour l’étude des récepteurs couplés aux protéines G

Author: SURGAND Jean-Sébastien
Publication venue
Publication date: 24/05/2006
Field of study

Les récepteurs couplés aux protéines G (RCPG) sont des protéines membranaires responsables de la transduction de signaux de l'extérieur vers l'intérieur de la cellule. Localisés partout dans l'organisme, ils sont impliqués dans de très nombreuses fonc tions physiologiques comme la vision, l'olfaction, la croissance et l'adhésion cellulaire, la régulation hormonale, etc. Ils sont cibles d'une grande diversité de ligands : des photons, des ions, des amines biogènes, des hormones, des glycoprotéines, des molécules odorantes et gustatives, etc. Un RCPG est formé de 7 hélices α transmembranaires reliées par des boucles intra- et extra-cellulaires. Ces 7 hélices α délimitent une cavité. Les ligands se fixent soit dans cette cavité soit au niveau des boucles extra-cellulaires, activant le récepteur qui va se lier à une protéine G à l'intérieur de la cellule, initiant une cascade de réactions chimiques. Un seul RCPG a été cristallisé jusqu'µa présent, la rhodopsine bovine, apportant une information structurale précieuse. Les RCPG présentent un grand intérêt pharmacologique. Leur diversité et les nom- breuses fonctions qu'ils contrôlent les font intervenir dans de nombreuses pathologies. Plus de 30% des nouveaux médicaments mis sur le marché ciblent des RCPG. De plus, beaucoup de RCPG sont encore orphelins, c'est-à-dire sans ligand connu, et possèdent donc un fort potentiel pharmacologique. Les RCPG forment une super-famille de plus d'un millier de membres, dont la grande majorité sont responsables de la perception des molécules olfactives. Le site de liaison de la plupart des RCPG est situé dans la cavité transmembranaire ; mais pour certains, le site est externe à la membrane et le ligand se fixe aux boucles extra- cellulaires. Mais même dans ces cas, les récepteurs possèdent une cavité transmembranaire et c'est sur elle que nous focalisons notre travail. Plusieurs classifications des RCPG ont déjà été proposées : phylogénétiques, basées sur des automates statistiques, sur des empreintes physico-chimiques ou sur la compo- sition en acides aminés. Mais aucune ne prend en compte précisément le point de vue pharmacologique, axé sur le ligand (le médicament). C'est pourquoi nous proposons une nouvelle classification des RCPG orientée phar- macologie. Elle est basée sur l'étude de résidus supposés critiques de la cavité trans- membranaire, qu'on pense interagir avec des ligands. Nous partons d'un jeu de données de 369 séquences de RCPG humains non-olfactifs, aussi (( propre )) que possible. Puis nous alignons les parties transmembranaires de fa»con automatique mais avec une étape de vérification et de raffinement manuels. Ensuite nous extrayons 30 résidus critiques en étudiant la cavité de la rhodopsine bovine, dont les parties transmembranaires ont une très forte identité avec celles de la rhodopsine humaine (94%). Nous supposons que ces 30 résidus sont critiques pour tous les RCPG, c'est-à-dire que la forme des cavités de tous les récepteurs est globalement conservée. Cette supposition est étayée par diverses publications. Enfin nous classifions ces séquences discontinues par une méthode de clustering hiérarchique agglomératif (la méthode UPGMA). Les distances entre séquences sont simplement les scores d'identité entre elles. Une procédure de bootstrapping vient apporter un poids statistique à la classification qui aboutit à 22 clusters bien distincts. Notre classification est en accord avec la classification GRAFS publiée récemment (analyse phylogénétique exhaustive de 342 RCPG humains non-olfactifs), c'est-à-dire que les clusters obtenus correspondent aux familles et sous-familles déjà identifiees, à quelques différences près. On peut appliquer cette classification à la recherche de nouvelles cibles pour des ligands partageant des sous-structures communes (on parle de structures privilégiées). Partant de ligands dont on connait des récepteurs, on recherche parmi les 30 résidus critiques de ces récepteurs ceux qui interviennent dans la liaison. Ensuite on recherche d'autres récepteurs qui partagent les mêmes résidus. On peut proposer ces nouveaux récepteurs comme cibles potentielles pour les ligands de départ. Une deuxième application immédiate de notre classification est la désorphanisation de récepteurs, c'est-à-dire la découverte d'un premier ligand pour un récepteur orphelin. Nous proposons comme point de départ pour un récepteur orphelin les ligands connus des récepteurs de la même classe. La pertinence d'une classification basée sur les cavités transmembranaires pour des récepteurs dont le site de liaison n'est pas la cavité est discutable. Pourtant nous constatons que les familles pour lesquelles le ligand se fixe à l'extérieur de la membrane (famille de classe secrétine et glutamate) sont très bien identifiées et séparées les unes des autres. Dans un deuxième temps, nous avons construit une autre classification des RCPG en utilisant, non plus les séquences, mais des modèles 3D des récepteurs, construits par homologie. Pour schématiser, nous recherchons le point de vue qu'aurait un ligand placé dans la cavité. Nous plaçons donc une sphère conceptuelle, qui représente le ligand, au centre de gravité de la cavité. Cette sphère est découpée en 80 éléments triangulaires, de même taille et disposés uniformément sur la sphère. Ensuite nous projetons, à partir du carbone β des résidus, des informations physico-chimiques et géométriques de la cavité dans les triangles qui intersectent la demi-droite partant du centre de la sphère et allant vers le carbone β du résidu considéré. Enfin nous comparons les sphères deux à deux, chaque comparaison donnant un score utilisé pour générer une matrice de distances et finalement une classification par la même méthode que précédemment. La méthode est assez floue (discrétisation peu poussée, projection à partir des carbones β et non à partir de tous les atomes) pour masquer les erreurs dues à la modélisation. Les modèles utilisés sont préalignés sur la structure qui a servi de point de départ, la structure cristallographique de la rhodopsine bovine. Malheureusement ces alignements ne sont pas suffisamment précis pour la génération d'une matrice de distances. Nous avons donc construit un outil d'alignement structural pour les raffiner. Cet outil est guidé par le score décrit ci-dessus pour trouver l'alignement entre deux cavités. Le meilleur score donnera le résultat de l'alignement. La classification est homogène avec la précédente, mais le nombre de récepteurs non classés (singletons) est plus grand. Intéressant est le fait que le récepteur DUFFY, non classé par notre précédente classification, est classé ici avec les récepteurs des chimiokines, en accord avec la classification de la base Swiss-Prot. Notre classification a en outre donné naissance à un outil d'alignement structural de cavité adapté au travail sur des modèles, mais qui peut aussi aligner des structures cristallographiques.G-protein-coupled receptors (GPCRs) are membrane proteins responsible for the transduction of signals from outside into the cell. Distributed in the whole body, they are implied in various physiological functions, like vision, olfaction, cellular growth and adhesion, etc. They are targeted by a tremendous diversity of possible ligands: photons, ions, biogenic amines, hormones, glycoproteins, olfactive and gustative molecules, etc. A GPCR is composed of 7 transmembrane α-helices linked together by intra- and extracellular loops. These 7 α-helices delineate a cavity. Ligands bind either in this cavity, or on the extracellular loops, activating the receptor that will link to a G-protein inside the cell, initiating a cascade of secondary messengers. Up to now, the bovine rhodopsin is the only known crystallographic structure, bringing a valuable structural information. GPCRs present a big pharmacological interest. Their diversity and the numerous functions they control get them involved in numerous pathologies. More than 30% of new commercialized drugs target GPCRs. Furthermore, many GPCRs are still orphan, without known ligand, and hence constitute potential pharmacological targets. GPCRs form a superfamily of more than one thousand members. Three out of four are involved in the perception of olfactive molecules. The binding site of most GPCRs is located in the transmembrane cavity; but for some of them, it is located outside the membrane and the ligand binds an extracellular loop. But even for these cases, all receptors show a transmembrane cavity on which we will concentrate during this work. Several classifications of GPCRs have been proposed: phylogenetic classifications, or based on statistical automata, or on physicochemical fingerprints, or based on their amino-acid composition. But none of them takes into account precisely the pharmacolo- gical point of view of the ligand (the drug). That's why we propose a new classification of GPCRs which is pharmacologically- oriented. It is based on the study of some residues of the transmembrane cavity supposed to be critical and to interact with the ligand. We start from a dataset of 369 sequences of human nonolfactory GPCRs, as \clean" as possible. Then we align automatically the transmembrane parts, but with a manual check following. Then we extract 30 critical residues by studying the cavity of bovine rhodopsin, which transmembrane parts share a high identity score with the human rhodopsin (94%). We suppose that these 30 residues are critical for all GPCRs, i.e. the fold of all GPCRs is overall conserved. This hypothesis is supported by several publications. Eventually, we classify these sequences by an agglo merative hierarchical clustering algorithm (UPGMA). The distances between sequences are simply the identity scores between them. A bootstrap procedure brings a statistical support to the classification, that leads to 22 well-defined clusters. Our classification agrees the recently published GRAFS classification (a phylogenetic analysis of 342 human non-olfactory GPCRs): resulting clusters correspond to already identified families and subfamilies, with slight differences. This classi¯cation can be applied to ¯nd new targets to ligands that share some common substructures (called priviledged structures). We start from ligands with known receptors, we seek among the 30 critical residues of these receptors those that participate to the binding. Then we seek for other receptors that share the same residues. We can eventually propose these new receptors as putative targets for the ligands we started with. A second straightforward application of the classification is the deorphanization of receptors, i.e. the discovery of a fisrt ligand for an orphan receptor. We propose, as a starting point for an orphan receptor, the known ligands of the receptors of the same cluster. The relevence of a classification based on the transmembrane cavities for receptors which binding site is not the this cavity is questionable. However we find that even the families for which the ligand bind outside the membrane (secretin and glutamate families) are well identified and well separated. In a second time, we built another classification of GPCRs, based on 3D homology models rather than on sequences. To simplify, we try to take the point of view of a ligand inside a cavity. We put a conceptual sphere, that stands for the ligand, at the center of gravity of the cavity. This sphere is tessellated into 80 triangles, with same size, homogenously dispatched. Then we project from the β carbon of the residues some phy sicochemical and geometrical information into the triangles. Eventually we compare the spheres, each comparison gives a score used to build a distance matrix and a classification with the same method as for the previous one. This method is quite fuzzy (low resolution discretization, projection from the β carbons and not from all atoms) to hide the errors due to modelling. The models are prealigned on the cristal structure of bovine rhodopsin. But unfortu- nately these alignments were not precise enough to build a distance matrix. So we coded a structural alignment tool to refine the alignments. This tool is guided by the previsouly- described score to find an alignment between two cavities. The best score gives the output alignment. The resulting clustering is homogenous with the previous one, but the number of non-classified receptors (singletons) is higher. Interestingly, the receptor DUFFY, not classified by our previous clustering, is classified here in the Chemokine cluster, in agree- ment with the Swiss-Prot database classification. Our classification leads to a structural alignment tool adapted to work on models, but that can also work on crystal structures

Theses de l'ULP

Thèses-Unistra : thèses et mémoires électroniques de l'Université de Strasbourg

Development of new bioinformatics methods for the study of G-protein-coupled receptors

Author: Surgand Jean-Sébastien
Publication venue
Publication date: 24/05/2006
Field of study

Thèses-Unistra : thèses et mémoires électroniques de l'Université de Strasbourg

Development of new bioinformatics methods for the study of G-protein-coupled receptors

Author: Surgand Jean-Sébastien
Publication venue
Publication date: 24/05/2006
Field of study

Les récepteurs couplés aux protéines G (RCPG) sont des protéines membranaires responsables de la transduction de signaux de l'extérieur vers l'intérieur de la cellule. Localisés partout dans l'organisme, ils sont impliqués dans de très nombreuses fonc tions physiologiques comme la vision, l'olfaction, la croissance et l'adhésion cellulaire, la régulation hormonale, etc. Ils sont cibles d'une grande diversité de ligands : des photons, des ions, des amines biogènes, des hormones, des glycoprotéines, des molécules odorantes et gustatives, etc. Un RCPG est formé de 7 hélices a transmembranaires reliées par des boucles intra- et extra-cellulaires. Ces 7 hélices a délimitent une cavité. Les ligands se fixent soit dans cette cavité soit au niveau des boucles extra-cellulaires, activant le récepteur qui va se lier à une protéine G à l'intérieur de la cellule, initiant une cascade de réactions chimiques. Un seul RCPG a été cristallisé jusqu' a présent, la rhodopsine bovine, apportant une information structurale précieuse. Les RCPG présentent un grand intérêt pharmacologique. Leur diversité et les nom- breuses fonctions qu'ils contrôlent les font intervenir dans de nombreuses pathologies. Plus de 30% des nouveaux médicaments mis sur le marché ciblent des RCPG. De plus, beaucoup de RCPG sont encore orphelins, c'est-à-dire sans ligand connu, et possèdent donc un fort potentiel pharmacologique. Les RCPG forment une super-famille de plus d'un millier de membres, dont la grande majorité sont responsables de la perception des molécules olfactives. Le site de liaison de la plupart des RCPG est situé dans la cavité transmembranaire ; mais pour certains, le site est externe à la membrane et le ligand se fixe aux boucles extra- cellulaires. Mais même dans ces cas, les récepteurs possèdent une cavité transmembranaire et c'est sur elle que nous focalisons notre travail. Plusieurs classifications des RCPG ont déjà été proposées : phylogénétiques, basées sur des automates statistiques, sur des empreintes physico-chimiques ou sur la compo- sition en acides aminés. Mais aucune ne prend en compte précisément le point de vue pharmacologique, axé sur le ligand (le médicament). C'est pourquoi nous proposons une nouvelle classification des RCPG orientée phar- macologie. Elle est basée sur l'étude de résidus supposés critiques de la cavité trans- membranaire, qu'on pense interagir avec des ligands. Nous partons d'un jeu de données de 369 séquences de RCPG humains non-olfactifs, aussi (( propre )) que possible. Puis nous alignons les parties transmembranaires de fa con automatique mais avec une étape de vérification et de raffinement manuels. Ensuite nous extrayons 30 résidus critiques en étudiant la cavité de la rhodopsine bovine, dont les parties transmembranaires ont une très forte identité avec celles de la rhodopsine humaine (94%). Nous supposons que ces 30 résidus sont critiques pour tous les RCPG, c'est-à-dire que la forme des cavités de tous les récepteurs est globalement conservée. Cette supposition est étayée par diverses publications. Enfin nous classifions ces séquences discontinues par une méthode de clustering hiérarchique agglomératif (la méthode UPGMA). Les distances entre séquences sont simplement les scores d'identité entre elles. Une procédure de bootstrapping vient apporter un poids statistique à la classification qui aboutit à 22 clusters bien distincts. Notre classification est en accord avec la classification GRAFS publiée récemment (analyse phylogénétique exhaustive de 342 RCPG humains non-olfactifs), c'est-à-dire que les clusters obtenus correspondent aux familles et sous-familles déjà identifiees, à quelques différences près. On peut appliquer cette classification à la recherche de nouvelles cibles pour des ligands partageant des sous-structures communes (on parle de structures privilégiées). Partant de ligands dont on connait des récepteurs, on recherche parmi les 30 résidus critiques de ces récepteurs ceux qui interviennent dans la liaison. Ensuite on recherche d'autres récepteurs qui partagent les mêmes résidus. On peut proposer ces nouveaux récepteurs comme cibles potentielles pour les ligands de départ. Une deuxième application immédiate de notre classification est la désorphanisation de récepteurs, c'est-à-dire la découverte d'un premier ligand pour un récepteur orphelin. Nous proposons comme point de départ pour un récepteur orphelin les ligands connus des récepteurs de la même classe. La pertinence d'une classification basée sur les cavités transmembranaires pour des récepteurs dont le site de liaison n'est pas la cavité est discutable. Pourtant nous constatons que les familles pour lesquelles le ligand se fixe à l'extérieur de la membrane (famille de classe secrétine et glutamate) sont très bien identifiées et séparées les unes des autres. Dans un deuxième temps, nous avons construit une autre classification des RCPG en utilisant, non plus les séquences, mais des modèles 3D des récepteurs, construits par homologie. Pour schématiser, nous recherchons le point de vue qu'aurait un ligand placé dans la cavité. Nous plaçons donc une sphère conceptuelle, qui représente le ligand, au centre de gravité de la cavité. Cette sphère est découpée en 80 éléments triangulaires, de même taille et disposés uniformément sur la sphère. Ensuite nous projetons, à partir du carbone b des résidus, des informations physico-chimiques et géométriques de la cavité dans les triangles qui intersectent la demi-droite partant du centre de la sphère et allant vers le carbone b du résidu considéré. Enfin nous comparons les sphères deux à deux, chaque comparaison donnant un score utilisé pour générer une matrice de distances et finalement une classification par la même méthode que précédemment. La méthode est assez floue (discrétisation peu poussée, projection à partir des carbones b et non à partir de tous les atomes) pour masquer les erreurs dues à la modélisation. Les modèles utilisés sont préalignés sur la structure qui a servi de point de départ, la structure cristallographique de la rhodopsine bovine. Malheureusement ces alignements ne sont pas suffisamment précis pour la génération d'une matrice de distances. Nous avons donc construit un outil d'alignement structural pour les raffiner. Cet outil est guidé par le score décrit ci-dessus pour trouver l'alignement entre deux cavités. Le meilleur score donnera le résultat de l'alignement. La classification est homogène avec la précédente, mais le nombre de récepteurs non classés (singletons) est plus grand. Intéressant est le fait que le récepteur DUFFY, non classé par notre précédente classification, est classé ici avec les récepteurs des chimiokines, en accord avec la classification de la base Swiss-Prot. Notre classification a en outre donné naissance à un outil d'alignement structural de cavité adapté au travail sur des modèles, mais qui peut aussi aligner des structures cristallographiques.G-protein-coupled receptors (GPCRs) are membrane proteins responsible for the transduction of signals from outside into the cell. Distributed in the whole body, they are implied in various physiological functions, like vision, olfaction, cellular growth and adhesion, etc. They are targeted by a tremendous diversity of possible ligands: photons, ions, biogenic amines, hormones, glycoproteins, olfactive and gustative molecules, etc. A GPCR is composed of 7 transmembrane a-helices linked together by intra- and extracellular loops. These 7 a-helices delineate a cavity. Ligands bind either in this cavity, or on the extracellular loops, activating the receptor that will link to a G-protein inside the cell, initiating a cascade of secondary messengers. Up to now, the bovine rhodopsin is the only known crystallographic structure, bringing a valuable structural information. GPCRs present a big pharmacological interest. Their diversity and the numerous functions they control get them involved in numerous pathologies. More than 30% of new commercialized drugs target GPCRs. Furthermore, many GPCRs are still orphan, without known ligand, and hence constitute potential pharmacological targets. GPCRs form a superfamily of more than one thousand members. Three out of four are involved in the perception of olfactive molecules. The binding site of most GPCRs is located in the transmembrane cavity; but for some of them, it is located outside the membrane and the ligand binds an extracellular loop. But even for these cases, all receptors show a transmembrane cavity on which we will concentrate during this work. Several classifications of GPCRs have been proposed: phylogenetic classifications, or based on statistical automata, or on physicochemical fingerprints, or based on their amino-acid composition. But none of them takes into account precisely the pharmacolo- gical point of view of the ligand (the drug). That's why we propose a new classification of GPCRs which is pharmacologically- oriented. It is based on the study of some residues of the transmembrane cavity supposed to be critical and to interact with the ligand. We start from a dataset of 369 sequences of human nonolfactory GPCRs, as \clean" as possible. Then we align automatically the transmembrane parts, but with a manual check following. Then we extract 30 critical residues by studying the cavity of bovine rhodopsin, which transmembrane parts share a high identity score with the human rhodopsin (94%). We suppose that these 30 residues are critical for all GPCRs, i.e. the fold of all GPCRs is overall conserved. This hypothesis is supported by several publications. Eventually, we classify these sequences by an agglo merative hierarchical clustering algorithm (UPGMA). The distances between sequences are simply the identity scores between them. A bootstrap procedure brings a statistical support to the classification, that leads to 22 well-defined clusters. Our classification agrees the recently published GRAFS classification (a phylogenetic analysis of 342 human non-olfactory GPCRs): resulting clusters correspond to already identified families and subfamilies, with slight differences. This classi cation can be applied to nd new targets to ligands that share some common substructures (called priviledged structures). We start from ligands with known receptors, we seek among the 30 critical residues of these receptors those that participate to the binding. Then we seek for other receptors that share the same residues. We can eventually propose these new receptors as putative targets for the ligands we started with. A second straightforward application of the classification is the deorphanization of receptors, i.e. the discovery of a fisrt ligand for an orphan receptor. We propose, as a starting point for an orphan receptor, the known ligands of the receptors of the same cluster. The relevence of a classification based on the transmembrane cavities for receptors which binding site is not the this cavity is questionable. However we find that even the families for which the ligand bind outside the membrane (secretin and glutamate families) are well identified and well separated. In a second time, we built another classification of GPCRs, based on 3D homology models rather than on sequences. To simplify, we try to take the point of view of a ligand inside a cavity. We put a conceptual sphere, that stands for the ligand, at the center of gravity of the cavity. This sphere is tessellated into 80 triangles, with same size, homogenously dispatched. Then we project from the b carbon of the residues some phy sicochemical and geometrical information into the triangles. Eventually we compare the spheres, each comparison gives a score used to build a distance matrix and a classification with the same method as for the previous one. This method is quite fuzzy (low resolution discretization, projection from the b carbons and not from all atoms) to hide the errors due to modelling. The models are prealigned on the cristal structure of bovine rhodopsin. But unfortu- nately these alignments were not precise enough to build a distance matrix. So we coded a structural alignment tool to refine the alignments. This tool is guided by the previsouly- described score to find an alignment between two cavities. The best score gives the output alignment. The resulting clustering is homogenous with the previous one, but the number of non-classified receptors (singletons) is higher. Interestingly, the receptor DUFFY, not classified by our previous clustering, is classified here in the Chemokine cluster, in agree- ment with the Swiss-Prot database classification. Our classification leads to a structural alignment tool adapted to work on models, but that can also work on crystal structures

Thèses-Unistra : thèses et mémoires électroniques de l'Université de Strasbourg