Search CORE

4,575 research outputs found

Functional classification of G-Protein coupled receptors, based on their specific ligand coupling patterns

Author: Bakır Burcu
Sezerman Uğur
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2006
Field of study

Functional identification of G-Protein Coupled Receptors (GPCRs) is one of the current focus areas of pharmaceutical research. Although thousands of GPCR sequences are known, many of them re- main as orphan sequences (the activating ligand is unknown). Therefore, classification methods for automated characterization of orphan GPCRs are imperative. In this study, for predicting Level 2 subfamilies of Amine GPCRs, a novel method for obtaining fixed-length feature vectors, based on the existence of activating ligand specific patterns, has been developed and utilized for a Support Vector Machine (SVM)-based classification. Exploiting the fact that there is a non-promiscuous relationship between the specific binding of GPCRs into their ligands and their functional classification, our method classifies Level 2 subfamilies of Amine GPCRs with a high predictive accuracy of 97.02% in a ten-fold cross validation test. The presented machine learning approach, bridges the gulf between the excess amount of GPCR sequence data and their poor functional characterization

Sabanci University Research Database

Prediction and classification for GPCR sequences based on ligand specific features

Author: F. Horn
G.E. Tusnády
K.R. Sreekumar
M. Bouvier
R. Karchin
S. Altshul
T. Gudermann
W. Pearson
Y. Huang
Publication venue: Lecture Notes in Computer Science,
Publication date: 01/01/2006
Field of study

Functional identification of G-Protein Coupled Receptors (GPCRs) is one of the current focus areas of pharmaceutical research. Although thousands of GPCR sequences are known, many of them are orphan sequences (the activating ligand is unknown). Therefore, classification methods for automated characterization of orphan GPCRs are imperative. In this study, for predicting Level 1 subfamilies of GPCRs, a novel method for obtaining class specific features, based on the existence of activating ligand specific patterns, has been developed and utilized for a majority voting classification. Exploiting the fact that there is a non-promiscuous relationship between the specific binding of GPCRs into their ligands and their functional classification, our method classifies Level 1 subfamilies of GPCRs with a high predictive accuracy between 99% and 87% in a three-fold cross validation test. The method also tells us which motifs are significant for class determination which has important design implications. The presented machine learning approach, bridges the gulf between the excess amount of GPCR sequence data and their poor functional characterization

Crossref

Sabanci University Research Database

On the hierarchical classification of G Protein-Coupled Receptors

Author: A. A. Freitas
A. Secker
Attwood
Bhasin
Bhasin
Bissantz
Cardoso
Christopoulos
D. R. Flower
Das
Davies
Flower
Flower
Foord
Gether
Gloriam
Guo
Horn
H bert
J. Timmis
Karchin
Keerthi
Klabunde
Kolakowski
Lapinsh
M. Mendao
M. N. Davies
Milligan
Papasaikas
Prabhu
Sandberg
Schi th
Publication venue: 'Oxford University Press (OUP)'
Publication date: 22/10/2007
Field of study

Motivation: G protein-coupled receptors (GPCRs) play an important role in many physiological systems by transducing an extracellular signal into an intracellular response. Over 50% of all marketed drugs are targeted towards a GPCR. There is considerable interest in developing an algorithm that could effectively predict the function of a GPCR from its primary sequence. Such an algorithm is useful not only in identifying novel GPCR sequences but in characterizing the interrelationships between known GPCRs. Results: An alignment-free approach to GPCR classification has been developed using techniques drawn from data mining and proteochemometrics. A dataset of over 8000 sequences was constructed to train the algorithm. This represents one of the largest GPCR datasets currently available. A predictive algorithm was developed based upon the simplest reasonable numerical representation of the protein's physicochemical properties. A selective top-down approach was developed, which used a hierarchical classifier to assign sequences to subdivisions within the GPCR hierarchy. The predictive performance of the algorithm was assessed against several standard data mining classifiers and further validated against Support Vector Machine-based GPCR prediction servers. The selective top-down approach achieves significantly higher accuracy than standard data mining methods in almost all cases

CiteSeerX

Crossref

Aberystwyth Research Portal

Kent Academic Repository

Systematic analysis of primary sequence domain segments for the discrimination between class C GPCR subtypes

Author: Alquézar Mancho René
Giraldo Arjonilla Jesús
König Caroline
Vellido Alcacena Alfredo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

G-protein-coupled receptors (GPCRs) are a large and diverse super-family of eukaryotic cell membrane proteins that play an important physiological role as transmitters of extracellular signal. In this paper, we investigate Class C, a member of this super-family that has attracted much attention in pharmacology. The limited knowledge about the complete 3D crystal structure of Class C receptors makes necessary the use of their primary amino acid sequences for analytical purposes. Here, we provide a systematic analysis of distinct receptor sequence segments with regard to their ability to differentiate between seven class C GPCR subtypes according to their topological location in the extracellular, transmembrane, or intracellular domains. We build on the results from the previous research that provided preliminary evidence of the potential use of separated domains of complete class C GPCR sequences as the basis for subtype classification. The use of the extracellular N-terminus domain alone was shown to result in a minor decrease in subtype discrimination in comparison with the complete sequence, despite discarding much of the sequence information. In this paper, we describe the use of Support Vector Machine-based classification models to evaluate the subtype-discriminating capacity of the specific topological sequence segments.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Recommended from our members

Global Analysis of Predicted G Protein-Coupled Receptor Genes in the Filamentous Fungus, Neurospora crassa.

Author: Ahrendt Steven R
Barsoum Andrew H
Borkovich Katherine A
Cabrera Ilva E
Cao Jackie
Carrillo Alexander J
Castillo Ronald
Chen Wan-Ching
Davidson Liza M
Dinkchian Alex
Guerrero Nayeli
Kim Stephanie
Kitada Sho M
Krystofova Svetlana
Lai Taffani H
Li Liande
Lim Andrew
Mach Ashley
Malekyan Cristin
Michkov Alexander V
Moua Toua R
Pacentine Itallia V
Servin Jacqueline A
Torres Carlos Rojas
Yamamoto Alaina
Publication venue: eScholarship, University of California
Publication date: 01/10/2015
Field of study

G protein-coupled receptors (GPCRs) regulate facets of growth, development, and environmental sensing in eukaryotes, including filamentous fungi. The largest predicted GPCR class in these organisms is the Pth11-related, with members similar to a protein required for disease in the plant pathogen Magnaporthe oryzae. However, the Pth11-related class has not been functionally studied in any filamentous fungal species. Here, we analyze phenotypes in available mutants for 36 GPCR genes, including 20 Pth11-related, in the model filamentous fungus Neurospora crassa. We also investigate patterns of gene expression for all 43 predicted GPCR genes in available datasets. A total of 17 mutants (47%) possessed at least one growth or developmental phenotype. We identified 18 mutants (56%) with chemical sensitivity or nutritional phenotypes (11 uniquely), bringing the total number of mutants with at least one defect to 28 (78%), including 15 mutants (75%) in the Pth11-related class. Gene expression trends for GPCR genes correlated with the phenotypes observed for many mutants and also suggested overlapping functions for several groups of co-transcribed genes. Several members of the Pth11-related class have phenotypes and/or are differentially expressed on cellulose, suggesting a possible role for this gene family in plant cell wall sensing or utilization

eScholarship - University of California

Analysis of class C G-protein coupled receptors using supervised classification methods

Author: König Caroline Leonore
Publication venue: Universitat Politècnica de Catalunya
Publication date: 30/10/2018
Field of study

G protein-coupled receptors (GPCRs) are cell membrane proteins with a key role in regulating the function of cells. This is the result of their ability to transmit extracellular signals, which makes them relevant for pharmacology and has led, over the last decade, to active research in the field of proteomics. The current thesis specifically targets class C of GPCRs, which are relevant in therapies for various central nervous system disorders, such as Alzheimer’s disease, anxiety, Parkinson’s disease and schizophrenia. The investigation of protein functionality often relies on the knowledge of crystal three dimensional (3-D) structures, which determine the receptor’s ability for ligand binding responsible for the activation of certain functionalities in the protein. The structural information is therefore paramount, but it is not always known or easily unravelled, which is the case of eukaryotic cell membrane proteins such as GPCRs. In the face of the lack of information about the 3-D structure, research is often bound to the analysis of the primary amino acid sequences of the proteins, which are commonly known and available from curated databases. Much research on sequence analysis has focused on the quantitative analysis of their aligned versions, although, recently, alternative approaches using machine learning techniques for the analysis of alignment-free sequences have been proposed. In this thesis, we focus on the differentiation of class C GPCRs into functional and structural related subgroups based on the alignment-free analysis of their sequences using supervised classification models. In the first part of the thesis, the main topic is the construction of supervised classification models for unaligned protein sequences based on physicochemical transformations and n-gram representations of their amino acid sequences. These models are useful to assess the internal data quality of the externally labeled dataset and to manage the label noise problem from a data curation perspective. In its second part, the thesis focuses on the analysis of the sequences to discover subtype- and region-speci¿c sequence motifs. For that, we carry out a systematic analysis of the topological sequence segments with supervised classification models and evaluate the subtype discrimination capability of each region. In addition, we apply different types of feature selection techniques to the n-gram representation of the amino acid sequence segments to find subtype and region specific motifs. Finally, we compare the findings of this motif search with the partially known 3D crystallographic structures of class C GPCRs.Los receptores acoplados a proteínas G (GPCRs) son proteínas de la membrana celular con un papel clave para la regulación del funcionamiento de una célula. Esto es consecuencia de su capacidad de transmisión de señales extracelulares, lo que les hace relevante en la farmacología y que ha llevado a investigaciones activas en la última década en el área de la proteómica. Esta tesis se centra específicamente en la clase C de GPCRs, que son relevante para terapias de varios trastornos del sistema nervioso central, como la enfermedad de Alzheimer, ansiedad, enfermedad de Parkinson y esquizofrenia. La investigación de la funcionalidad de proteínas muchas veces se basa en el conocimiento de la estructura cristalina tridimensional (3-D), que determina la capacidad del receptor para la unión con ligandos, que son responsables para la activación de ciertas funcionalidades en la proteína. El análisis de secuencias de amino ácidos se ha centrado en muchas investigaciones en el análisis cuantitativo de las versiones alineados de las secuencias, aunque, recientemente, se han propuesto métodos alternativos usando métodos de aprendizaje automático aplicados a las versiones no-alineadas de las secuencias. En esta tesis, nos centramos en la diferenciación de los GPCRs de la clase C en subgrupos funcionales y estructurales basado en el análisis de las secuencias no-alineadas utilizando modelos de clasificación supervisados. Estos modelos son útiles para evaluar la calidad interna de los datos a partir del conjunto de datos etiquetados externamente y para gestionar el problema del 'ruido de datos' desde la perspectiva de la curación de datos. En su segunda parte, la tesis enfoca el análisis de las secuencias para descubrir motivos de secuencias específicos a nivel de subtipo o región. Para eso, llevamos a cabo un análisis sistemático de los segmentos topológicos de la secuencia con modelos supervisados de clasificación y evaluamos la capacidad de discriminar entre subtipos de cada región. Adicionalmente, aplicamos diferentes tipos de técnicas de selección de atributos a las representaciones mediante n-gramas de los segmentos de secuencias de amino ácidos para encontrar motivos específicos a nivel de subtipo y región. Finalmente, comparamos los descubrimientos de la búsqueda de motivos con las estructuras cristalinas parcialmente conocidas para la clase C de GPCRs

Tesis Doctorals en Xarxa

A database for G proteins and their interaction with GPCRs

Author: Bagos Pantelis G
Elefsinioti Antigoni L
Hamodrakas Stavros J
Spyropoulos Ioannis C
Publication venue: BioMed Central
Publication date: 01/12/2004
Field of study

BACKGROUND: G protein-coupled receptors (GPCRs) transduce signals from extracellular space into the cell, through their interaction with G proteins, which act as switches forming hetero-trimers composed of different subunits (α,β,γ). The α subunit of the G protein is responsible for the recognition of a given GPCR. Whereas specialised resources for GPCRs, and other groups of receptors, are already available, currently, there is no publicly available database focusing on G Proteins and containing information about their coupling specificity with their respective receptors. DESCRIPTION: gpDB is a publicly accessible G proteins/GPCRs relational database. Including species homologs, the database contains detailed information for 418 G protein monomers (272 Gα, 87 Gβ and 59 Gγ) and 2782 GPCRs sequences belonging to families with known coupling to G proteins. The GPCRs and the G proteins are classified according to a hierarchy of different classes, families and sub-families, based on extensive literature searchs. The main innovation besides the classification of both G proteins and GPCRs is the relational model of the database, describing the known coupling specificity of the GPCRs to their respective α subunit of G proteins, a unique feature not available in any other database. There is full sequence information with cross-references to publicly available databases, references to the literature concerning the coupling specificity and the dimerization of GPCRs and the user may submit advanced queries for text search. Furthermore, we provide a pattern search tool, an interface for running BLAST against the database and interconnectivity with PRED-TMR, PRED-GPCR and TMRPres2D. CONCLUSIONS: The database will be very useful, for both experimentalists and bioinformaticians, for the study of G protein/GPCR interactions and for future development of predictive algorithms. It is available for academics, via a web browser at the URL

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Residue conservation and dimer-interface analysis of olfactory receptor molecular models

Author: Fukui Kazuhiko
Harini Krishnan
Kannan Sankar
Nemoto Wataru
Sowdhamini Ramanathan
Publication venue: Lorem Ipsum Press
Publication date: 21/10/2012
Field of study

Olfactory Receptors (ORs) are members of the Class A rhodopsin like G-protein coupled receptors (GPCRs) which are the initial players in the signal transduction cascade, leading to the generation of nerve impulses transmitted to the brain and resulting in the detection of odorant molecules. Despite the accumulation of thousands of olfactory receptor sequences, no crystal structures of ORs are known tο date. However, the recent availability of crystallographic models of a few GPCRs allows us to generate homology models of ORs and analyze their amino acid patterns, as there is a huge diversity in OR sequences. In this study, we have generated three-dimensional models of 100 representative ORs from Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans and Sacharomyces cerevisiae which were selected on the basis of a composite classification scheme and phylogenetic analysis. The crystal structure of bovine rhodopsin was used as a template and it was found that the full-length models have more than 90% of their residues in allowed regions of the Ramachandran plot. The structures were further used for analysis of conserved residues in the transmembrane and extracellular loop regions in order to identify functionally important residues. Several ORs are known to be functional as dimers and hence dimer interfaces were predicted for OR models to analyse their oligomeric functional state

Journal of Molecular Biochemistry

Multidimensional Scaling Reveals the Main Evolutionary Pathways of Class A G-Protein-Coupled Receptors

Author: A Gogos
A Rokas
A Rokas
B Wu
C Kuiken
C Tuffley
David Thybert
DE Gloriam
DG Higgins
DJ Smith
DK Vassilatis
E Susko
F Lu
F Murtagh
G Blackshields
G Casari
H Abdi
H Abdi
H Rompler
HC Wang
Hervé Abdi
I Domazet
I Kass
IG Choi
J Devillé
J Hou
J Hou
J Tzeng
JC Gower
JH Park
JS Surgand
Julien Pelé
JW DeLano
K Palczewski
K Ye
KB Nicholas
KJ Woolley
Kolakowski LF Jr
M Anctil
M Greenacre
M Nei
MA Larkin
Marie Chabbert
Matthieu Moreau
MC Peeters
MW Trosset
P Rousseeuw
P Scheerer
R Fredriksson
R Fredriksson
RA Studer
RP Metpally
S Yohannan
S Yohannan
SC Sealfon
SG Rasmussen
T Haitina
U Gether
V Cherezov
Vladimir N. Uversky
VN Grishin
W Shi
WM Fitch
WS Togerson
WS Valdar
Y Takane
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Class A G-protein-coupled receptors (GPCRs) constitute the largest family of transmembrane receptors in the human genome. Understanding the mechanisms which drove the evolution of such a large family would help understand the specificity of each GPCR sub-family with applications to drug design. To gain evolutionary information on class A GPCRs, we explored their sequence space by metric multidimensional scaling analysis (MDS). Three-dimensional mapping of human sequences shows a non-uniform distribution of GPCRs, organized in clusters that lay along four privileged directions. To interpret these directions, we projected supplementary sequences from different species onto the human space used as a reference. With this technique, we can easily monitor the evolutionary drift of several GPCR sub-families from cnidarians to humans. Results support a model of radiative evolution of class A GPCRs from a central node formed by peptide receptors. The privileged directions obtained from the MDS analysis are interpretable in terms of three main evolutionary pathways related to specific sequence determinants. The first pathway was initiated by a deletion in transmembrane helix 2 (TM2) and led to three sub-families by divergent evolution. The second pathway corresponds to the differentiation of the amine receptors. The third pathway corresponds to parallel evolution of several sub-families in relation with a covarion process involving proline residues in TM2 and TM5. As exemplified with GPCRs, the MDS projection technique is an important tool to compare orthologous sequence sets and to help decipher the mutational events that drove the evolution of protein families

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

HAL-Inserm

PubMed Central

Okina