Article thumbnail

Multidimensional Scaling Reveals the Main Evolutionary Pathways of Class A G-Protein-Coupled Receptors

By Julien Pelé, Hervé Abdi, Matthieu Moreau, David Thybert and Marie Chabbert


Class A G-protein-coupled receptors (GPCRs) constitute the largest family of transmembrane receptors in the human genome. Understanding the mechanisms which drove the evolution of such a large family would help understand the specificity of each GPCR sub-family with applications to drug design. To gain evolutionary information on class A GPCRs, we explored their sequence space by metric multidimensional scaling analysis (MDS). Three-dimensional mapping of human sequences shows a non-uniform distribution of GPCRs, organized in clusters that lay along four privileged directions. To interpret these directions, we projected supplementary sequences from different species onto the human space used as a reference. With this technique, we can easily monitor the evolutionary drift of several GPCR sub-families from cnidarians to humans. Results support a model of radiative evolution of class A GPCRs from a central node formed by peptide receptors. The privileged directions obtained from the MDS analysis are interpretable in terms of three main evolutionary pathways related to specific sequence determinants. The first pathway was initiated by a deletion in transmembrane helix 2 (TM2) and led to three sub-families by divergent evolution. The second pathway corresponds to the differentiation of the amine receptors. The third pathway corresponds to parallel evolution of several sub-families in relation with a covarion process involving proline residues in TM2 and TM5. As exemplified with GPCRs, the MDS projection technique is an important tool to compare orthologous sequence sets and to help decipher the mutational events that drove the evolution of protein families

Topics: Research Article
Publisher: Public Library of Science
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. (2006). A chemogenomic analysis of the transmembrane binding cavity of human G-protein-coupled receptors.
  2. (2010). A complete analysis of HA and NA genes of influenza A viruses.
  3. (2003). A global representation of the protein fold space.
  4. (1995). A method to predict functional residues in proteins.
  5. (2006). A two-entropies analysis to identify functional positions in the transmembrane region of class A G protein-coupled receptors.
  6. (1986). A use for principal coordinate analysis in the comparison of protein sequences.
  7. (1968). Adding a Point to Vector Diagrams in Multivaraiate Analysis
  8. (2009). An indel in transmembrane helix 2 helps to trace the molecular evolution of class A G-protein-coupled receptors.
  9. (2005). Animal evolution and the molecular signature of radiations compressed in time.
  10. (2000). Assignment of enzyme substrate specificity by principal component analysis of aligned protein sequences: an experimental test using DNA glycosylase homologs.
  11. (2006). Bushes in the tree of life.
  12. (2007). Clustal W and Clustal X version 2.0.
  13. (2007). Correspondance analysis in practice.
  14. (2005). Correspondence Analysis and data Coding with R and Java.
  15. (2008). Crystal structure of opsin in its G-protein-interacting conformation.
  16. (2000). Crystal structure of rhodopsin: A G protein-coupled receptor.
  17. (2008). Crystal structure of the ligand-free G-protein-coupled receptor opsin.
  18. (2002). Euclidian space and grouping of biological objects.
  19. (2006). Evolution of protein structural classes and protein sequence families.
  20. (2005). Evolutionary Distance: Estimation. Encyclopedia of Life Sciences.
  21. (2005). Framework for kernel regularization with application to protein clustering.
  22. (2007). G proteincoupled time travel: evolutionary aspects of GPCR research.
  23. (1994). GCRDb: a G-protein-coupled receptor database.
  24. (1997). GeneDoc: Analysis and Visualization of Genetic Variation.
  25. (2005). Genome wide survey of G protein-coupled receptors in Tetraodon nigroviridis.
  26. (2007). High-resolution crystal structure of an engineered human beta2-adrenergic G protein-coupled receptor.
  27. (2002). Horovitz A
  28. (2011). Importance of the extracellular loops in G protein-coupled receptors for ligand recognition and receptor activation.
  29. (2010). Large-scale analysis of orthologs and paralogs under covarion-like and constant-but-different models of amino acid evolution.
  30. (2004). Mapping the antigenic and genetic evolution of influenza virus.
  31. (2007). Metric multidimensional scaling.
  32. (1998). Modeling the covarion hypothesis of nucleotide substitution.
  33. (2008). Multidimensional scaling for large genomic data sets.
  34. (2009). Multidimensional scaling.
  35. (2004). On inconsistency of the neighbor-joining, least squares, and minimum evolution estimation when substitution processes are incorrectly modeled.
  36. (2010). Principal component analysis.
  37. (2004). Proline substitutions are not easily accommodated in a membrane protein.
  38. (1971). Rate of change of concomitantly variable codons.
  39. (1995). Related contribution of specific helix 2 and 7 residues to conformational activation of the serotonin 5-HT2A receptor.
  40. (2002). Scoring residue conservation.
  41. (2007). Sequence and expression of four coral G protein-coupled receptors distinct from all classifiable members of the rhodopsin family.
  42. (2010). Sequence embedding for fast construction of guide trees for multiple sequence alignment.
  43. (1992). Sequence ordinations: a multivariate analysis approach to analysing large sequence data sets.
  44. (1987). Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis.
  45. (2011). Structure of a nanobody-stabilized active state of the beta(2) adrenoceptor.
  46. (2010). Structures of the CXCR4 chemokine GPCR with small-molecule and cyclic peptide antagonists.
  47. (2004). The evolution of transmembrane helix kinks and the structural diversity of G protein-coupled receptors.
  48. (2003). The G protein-coupled receptor repertoires of human and mouse.
  49. (2009). The G protein-coupled receptor subset of the dog genome is more similar to that in humans than rodents.
  50. (2007). The G protein-coupled receptor subset of the rat genome.
  51. (2003). The Gprotein-coupled receptors in the human genome form five main families. Phylogenetic analysis, paralogon groups, and fingerprints.
  52. (2008). The hepatitis C sequence database in Los Alamos.
  53. (2008). The out-of-sample problem for classical multidimensional scaling.
  54. (2011). The PyMOL molecular graphics system.
  55. (2005). The repertoire of G-protein-coupled receptors in fully sequenced genomes.
  56. (2009). The second transmembrane domain of the human type 1 angiotensin II receptor participates in the formation of the ligand binding pocket and undergoes integral pivoting movement during the process of receptor activation.
  57. (1958). Theory and methods of scaling.
  58. (2008). Topological estimation biases with covarion evolution.
  59. (2000). Uncovering molecular mechanisms involved in activation of G protein-coupled receptors.