Search CORE

12,451 research outputs found

Functional discrimination of membrane proteins using machine learning techniques

Author: AG Garrow
B Rost
D Fu
DP Chimento
DP Chimento
EL Borths
G von Heijne
GE Tusnady
IH Witten
J Abramson
M Michael Gromiha
MH Saier Jr
MH Saier Jr
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
NK Natt
PG Bagos
PL Martelli
Q Ren
R Dutzler
S Murakami
SF Altschul
T Hirokawa
T Nogi
Y Huang
YD Cai
YH Taguchi
Yukimitsu Yabuki
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Discriminating membrane proteins based on their functions is an important task in genome annotation. In this work, we have analyzed the characteristic features of amino acid residues in membrane proteins that perform major functions, such as channels/pores, electrochemical potential-driven transporters and primary active transporters. Results We observed that the residues Asp, Asn and Tyr are dominant in channels/pores whereas the composition of hydrophobic residues, Phe, Gly, Ile, Leu and Val is high in electrochemical potential-driven transporters. The composition of all the amino acids in primary active transporters lies in between other two classes of proteins. We have utilized different machine learning algorithms, such as, Bayes rule, Logistic function, Neural network, Support vector machine, Decision tree etc. for discriminating these classes of proteins. We observed that most of the algorithms have discriminated them with similar accuracy. The neural network method discriminated the channels/pores, electrochemical potential-driven transporters and active transporters with the 5-fold cross validation accuracy of 64% in a data set of 1718 membrane proteins. The application of amino acid occurrence improved the overall accuracy to 68%. In addition, we have discriminated transporters from other α-helical and β-barrel membrane proteins with the accuracy of 85% using k-nearest neighbor method. The classification of transporters and all other proteins (globular and membrane) showed the accuracy of 82%. Conclusion The performance of discrimination with amino acid occurrence is better than that with amino acid composition. We suggest that this method could be effectively used to discriminate transporters from all other globular and membrane proteins, and classify them into channels/pores, electrochemical and active transporters.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Knowledge-based energy functions for computational studies of proteins

Author: A. Ben-Naim
A. Godzik
A. Godzik
A. Rossi
A.J. Bordner
A.V. Finkelstein
B. Fain
B. Krishnamoorthy
B. Kuhlman
B. Schölkopf
B.H. Park
B.I. Dahiyat
B.J. McConkey
B.O. Mitchell
C. Anfinsen
C. Carter Jr.
C. Czaplewski
C. Hoppe
C. Hu
C. Micheletti
C. Papadimitriou
C. Zhang
C. Zhang
C. Zhang
C. Zhang
C. Zhang
C.A. Rohl
C.B. Anfinsen
C.M.R Lemer
C.S. Mészáros
D. Gilis
D. Gilis
D. Gilis
D. Tobi
D. Xu
E. Venclovas
E.I. Shakhnovich
E.I. Shakhnovich
F.A. Momany
H. Dobbs
H. Edelsbrunner
H. Gan
H. Li
H. Li
H. Lu
H. Zhou
H.S. Chan
I. Muegge
J. Khatun
J. Liang
J.A. Kocher
J.A. Rank
J.M. Deutsch
J.R. Bienkowska
K. Nishikawa
K. Sale
K.H. Lee
K.K. Koretke
K.K. Koretke
K.T. Simons
L. Adamian
L. Adamian
L. Adamian
L.A. Mirny
L.L. Looger
L.M. Amzel
M. Karplus
M. Levitt
M. Vendruscolo
M. Vendruscolo
M.H. Hao
M.H. Hao
M.J. Sippl
M.J. Sippl
M.J. Sippl
M.P. Eastwood
M.R. Betancourt
M.S. Friedrichs
N. Karmarkar
N.V. Buchete
N.V. Buchete
P. Koehl
P. Koehl
P.D. Thomas
P.D. Thomas
P.G. Wolynes
P.J. Munson
R. Goldstein
R. Guerois
R. Jackups Jr.
R. Janicke
R. Méndez
R. Samudrala
R. Samudrala
R.B. Hill
R.I. Dima
R.J. Vanderbei
R.K. Singh
R.L. Jernigan
R.S. DeWitte
S. Liu
S. Miyazawa
S. Miyazawa
S. Miyazawa
S. Shimizu
S. Shimizu
S. Tanaka
S.J. Wodak
T. Kortemme
T. Kortemme
T. Kortemme
T. Lazaridis
T.L. Chiu
U. Bastolla
U. Bastolla
V. Vapnik
V. Vapnik
V.N. Maiorov
W.P. Russ
X. Li
X. Li
Y. Duan
Y. Park
Y. Xia
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/01/2006
Field of study

This chapter discusses theoretical framework and methods for developing knowledge-based potential functions essential for protein structure prediction, protein-protein interaction, and protein sequence design. We discuss in some details about the Miyazawa-Jernigan contact statistical potential, distance-dependent statistical potentials, as well as geometric statistical potentials. We also describe a geometric model for developing both linear and non-linear potential functions by optimization. Applications of knowledge-based potential functions in protein-decoy discrimination, in protein-protein interactions, and in protein design are then described. Several issues of knowledge-based potential functions are finally discussed.Comment: 57 pages, 6 figures. To be published in a book by Springe

arXiv.org e-Print Archive

Crossref

Functional classification of G-Protein coupled receptors, based on their specific ligand coupling patterns

Author: Bakır Burcu
Sezerman Uğur
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2006
Field of study

Functional identification of G-Protein Coupled Receptors (GPCRs) is one of the current focus areas of pharmaceutical research. Although thousands of GPCR sequences are known, many of them re- main as orphan sequences (the activating ligand is unknown). Therefore, classification methods for automated characterization of orphan GPCRs are imperative. In this study, for predicting Level 2 subfamilies of Amine GPCRs, a novel method for obtaining fixed-length feature vectors, based on the existence of activating ligand specific patterns, has been developed and utilized for a Support Vector Machine (SVM)-based classification. Exploiting the fact that there is a non-promiscuous relationship between the specific binding of GPCRs into their ligands and their functional classification, our method classifies Level 2 subfamilies of Amine GPCRs with a high predictive accuracy of 97.02% in a ten-fold cross validation test. The presented machine learning approach, bridges the gulf between the excess amount of GPCR sequence data and their poor functional characterization

Sabanci University Research Database

Systematic analysis of primary sequence domain segments for the discrimination between class C GPCR subtypes

Author: Alquézar Mancho René
Giraldo Arjonilla Jesús
König Caroline
Vellido Alcacena Alfredo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

G-protein-coupled receptors (GPCRs) are a large and diverse super-family of eukaryotic cell membrane proteins that play an important physiological role as transmitters of extracellular signal. In this paper, we investigate Class C, a member of this super-family that has attracted much attention in pharmacology. The limited knowledge about the complete 3D crystal structure of Class C receptors makes necessary the use of their primary amino acid sequences for analytical purposes. Here, we provide a systematic analysis of distinct receptor sequence segments with regard to their ability to differentiate between seven class C GPCR subtypes according to their topological location in the extracellular, transmembrane, or intracellular domains. We build on the results from the previous research that provided preliminary evidence of the potential use of separated domains of complete class C GPCR sequences as the basis for subtype classification. The use of the extracellular N-terminus domain alone was shown to result in a minor decrease in subtype discrimination in comparison with the complete sequence, despite discarding much of the sequence information. In this paper, we describe the use of Support Vector Machine-based classification models to evaluate the subtype-discriminating capacity of the specific topological sequence segments.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Predicting the outer membrane proteome of Pasteurella multocida based on consensus prediction enhanced by results integration and manual confirmation

Author: Burchmore R.
Davies R.
E-komon T.
Herzyk P.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Background Outer membrane proteins (OMPs) of Pasteurella multocida have various functions related to virulence and pathogenesis and represent important targets for vaccine development. Various bioinformatic algorithms can predict outer membrane localization and discriminate OMPs by structure or function. The designation of a confident prediction framework by integrating different predictors followed by consensus prediction, results integration and manual confirmation will improve the prediction of the outer membrane proteome. Results In the present study, we used 10 different predictors classified into three groups (subcellular localization, transmembrane β-barrel protein and lipoprotein predictors) to identify putative OMPs from two available P. multocida genomes: those of avian strain Pm70 and porcine non-toxigenic strain 3480. Predicted proteins in each group were filtered by optimized criteria for consensus prediction: at least two positive predictions for the subcellular localization predictors, three for the transmembrane β-barrel protein predictors and one for the lipoprotein predictors. The consensus predicted proteins were integrated from each group into a single list of proteins. We further incorporated a manual confirmation step including a public database search against PubMed and sequence analyses, e.g. sequence and structural homology, conserved motifs/domains, functional prediction, and protein-protein interactions to enhance the confidence of prediction. As a result, we were able to confidently predict 98 putative OMPs from the avian strain genome and 107 OMPs from the porcine strain genome with 83% overlap between the two genomes. Conclusions The bioinformatic framework developed in this study has increased the number of putative OMPs identified in P. multocida and allowed these OMPs to be identified with a higher degree of confidence. Our approach can be applied to investigate the outer membrane proteomes of other Gram-negative bacteria

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Enlighten

Relationship between amino acid properties and functional parameters in olfactory receptors and discrimination of mutants with enhanced specificity

Author: A Kato
A Sali
AF Carey
AM Waterhouse
B Lee
C Jaén
D Kuang
GE Tusndy
H Zhao
IH Witten
J Overington
JF Xia
JF Xia
K Harini
K Mizuguchi
K Palczewski
K Schmiedeberg
K Tomii
Kazuhiko Fukui
L Abuin
L Ezkurdia
LB Buck
LT Huang
LT Huang
M Michael Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
MM Gromiha
O Baud
O Man
O Yanay
P Luu
P Mombaerts
PS Grewal
R Sowdhamini
RA Hall
RA Laskowski
S Katada
T Olender
W Pirovano
WS Leal
Y Ben-Chaim
YY Ou
ZH You
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Finding functional motifs in protein sequences with deep learning and natural language models

Author: Casadio R.
Martelli P. L.
Savojardo C.
Publication venue
Publication date: 01/01/2023
Field of study

Recently, prediction of structural/functional motifs in protein sequences takes advantage of powerful machine learning based approaches. Protein encoding adopts protein language models overpassing standard procedures. Different combinations of machine learning and encoding schemas are available for predicting different structural/functional motifs. Particularly interesting is the adoption of protein language models to encode proteins in addition to evolution information and physicochemical parameters. A thorough analysis of recent predictors developed for annotating transmembrane regions, sorting signals, lipidation and phosphorylation sites allows to investigate the state-of-the-art focusing on the relevance of protein language models for the different tasks. This highlights that more experimental data are necessary to exploit available powerful machine learning methods

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Image informatics strategies for deciphering neuronal network connectivity

Author: A Dani
A Rodriguez
A Rodriguez
A Rodriguez
B Deleglise
B Zhang
BA Wilt
C Sala
CG Dotti
D Cai
D Gerlich
D Popova
D Prodanov
D Smetters
DA Dombeck
DA Dombeck
DW Godwin
E Cohen
E Fluhler
E Meijering
E Meijering
E Meijering
EW Dent
F Cornelissen
F Janoos
G Pani
GG Gurkoff
GP Feng
GS Belinsky
H Bading
H Chen
H Peng
H Peng
HH Piao
IR Wickersham
IY Koh
J Cheng
J Fan
J Fan
J Lammerding
J Liu
J Livet
JA Harrill
JI Luebke
JL Chen
JM Nussbaum
K Chung
K Deisseroth
K Imamura
K Takahashi
KA Al-Kofahi
KD Micheva
KE Binley
KM Brown
KO Lai
KR Kay
L Bretzner
L Feng
L Grosenick
L Jin
L Jin
LD Costa
M Meijer
M Naujock
M Papa
M Pickering
M Pool
M Wittmann
ML Narro
MS Siegel
N Heck
O Sirenko
P Chothani
P Maiti
P Penzes
P Roqué
P Sarder
P Shi
P Verstraelen
Q Li
Q Li
R Nair
R Parekh
RA McKinney
RM Paredes
S Cho
S Schmitt
SK Schmitz
SL Wearne
SY Ho
SY Kim
T Nemoto
T Zhao
TL Fletcher
VA Alvarez
W Bai
WH Vos De
Y Al-Kofahi
Y Zhang
YC Lin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Brain function relies on an intricate network of highly dynamic neuronal connections that rewires dramatically under the impulse of various external cues and pathological conditions. Among the neuronal structures that show morphologi- cal plasticity are neurites, synapses, dendritic spines and even nuclei. This structural remodelling is directly connected with functional changes such as intercellular com- munication and the associated calcium-bursting behaviour. In vitro cultured neu- ronal networks are valuable models for studying these morpho-functional changes. Owing to the automation and standardisation of both image acquisition and image analysis, it has become possible to extract statistically relevant readout from such networks. Here, we focus on the current state-of-the-art in image informatics that enables quantitative microscopic interrogation of neuronal networks. We describe the major correlates of neuronal connectivity and present workflows for analysing them. Finally, we provide an outlook on the challenges that remain to be addressed, and discuss how imaging algorithms can be extended beyond in vitro imaging studies

Crossref

Ghent University Academic Bibliography

Institutional Repository Universiteit Antwerpen

Histopathological image analysis : a review

Author: Boucheron Laura E.
Can Ali
Gurcan Metin N.
Madabhushi Anant
Rajpoot Nasir M.
Yener Bülent
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

Over the past decade, dramatic increases in computational power and improvement in image analysis algorithms have allowed the development of powerful computer-assisted analytical approaches to radiological data. With the recent advent of whole slide digital scanners, tissue histopathology slides can now be digitized and stored in digital image form. Consequently, digitized tissue histopathology has now become amenable to the application of computerized image analysis and machine learning techniques. Analogous to the role of computer-assisted diagnosis (CAD) algorithms in medical imaging to complement the opinion of a radiologist, CAD algorithms have begun to be developed for disease detection, diagnosis, and prognosis prediction to complement the opinion of the pathologist. In this paper, we review the recent state of the art CAD technology for digitized histopathology. This paper also briefly describes the development and application of novel image analysis technology for a few specific histopathology related problems being pursued in the United States and Europe

Crossref

PubMed Central

Warwick Research Archives Portal Repository