Search CORE

3 research outputs found

Improved general regression network for protein domain boundary prediction

Author: A Ceroni
A Vieira
Abdur R Sikder
AK Jain
Albert Y Zomaya
AR Sikder
AR Sikder
Bing Bing Zhou
C Chothia
C Civera
CC Lee
CR Robinson
DB Wetlaufer
FMG Pearl
G Pollastri
G Pollastri
HC Van Leeuwen
HM Berman
J Chen
J Cheng
J Liu
J Sim
JCB Melo
JE Gewehr
JS Richardson
JSR Jang
M Dumontier
M Dumontier
M Suyama
MJ Lehtinen
N Nagarajan
OV Galzitskaya
P Baldi
P Bork
Paul D Yoo
RA George
RE Schapire
RL Marsden
RR Copley
RR Joshi
RS Gokhale
S Prompramote
S Veretnik
SF Altschul
TA Holland
Y Freund
Publication venue: BioMed Central
Publication date: 13/02/2008
Field of study

Background: Protein domains present some of the most useful information that can be used to understand protein structure and functions. Recent research on protein domain boundary prediction has been mainly based on widely known machine learning techniques, such as Artificial Neural Networks and Support Vector Machines. In this study, we propose a new machine learning model (IGRN) that can achieve accurate and reliable classification, with significantly reduced computations. The IGRN was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. Results: The proposed model achieved average prediction accuracy of 67% on the Benchmark_2 dataset for domain boundary identification in multi-domains proteins and showed superior predictive performance and generalisation ability among the most widely used neural network models. With the CASP7 benchmark dataset, it also demonstrated comparable performance to existing domain boundary predictors such as DOMpro, DomPred, DomSSEA, DomCut and DomainDiscovery with 70.10% prediction accuracy. Conclusion: The performance of proposed model has been compared favourably to the performance of other existing machine learning based methods as well as widely known domain boundary predictors on two benchmark datasets and excels in the identification of domain boundaries in terms of model bias, generalisation and computational requirements. © 2008 Yoo et al; licensee BioMed Central Ltd

Crossref

Michigan Technological University

PubMed Central

Bioinformatics research in the Asia Pacific: a 2007 update

Author: A Madhumalar
BC Kim
C Wang
CJO Baker
D Gilbert
DT Singh
GL Zhang
H Sugawara
H Zhao
KH Choo
L Kong
M Ganapathiraju
Michael Gribskov
N Yanamala
O Miotto
O Miotto
PD Yoo
Q Xu
R Ördög
RTH Tsai
S Dastmalchi
S Miyano
S Ranganathan
S Ranganathan
S Ranganathan
SH Chen
SH Nagaraj
Shoba Ranganathan
Tin Wee Tan
U Sangket
V Chelliah
WY Kim
YP Lim
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

We provide a 2007 update on the bioinformatics research in the Asia-Pacific from the Asia Pacific Bioinformatics Network (APBioNet), Asia's oldest bioinformatics organisation set up in 1998. From 2002, APBioNet has organized the first International Conference on Bioinformatics (InCoB) bringing together scientists working in the field of bioinformatics in the region. This year, the InCoB2007 Conference was organized as the 6th annual conference of the Asia-Pacific Bioinformatics Network, on Aug. 27–30, 2007 at Hong Kong, following a series of successful events in Bangkok (Thailand), Penang (Malaysia), Auckland (New Zealand), Busan (South Korea) and New Delhi (India). Besides a scientific meeting at Hong Kong, satellite events organized are a pre-conference training workshop at Hanoi, Vietnam and a post-conference workshop at Nansha, China. This Introduction provides a brief overview of the peer-reviewed manuscripts accepted for publication in this Supplement. We have organized the papers into thematic areas, highlighting the growing contribution of research excellence from this region, to global bioinformatics endeavours

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Purdue E-Pubs

Macquarie University ResearchOnline

ScholarBank@NUS

Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding

Author: Andrew
Anfinsen
Aurora
Bacardit
Bang
Brent Wathen
Broome
Bu
Bédard
Chan
Chiti
Chou
Chou
Cohen
Colloc’h
Cootes
Costantini
Crasto
Daffner
Dasgupta
de Brevern
Dill
Dill
Dill
Doig
Doig
Dong
Dunker
Dunker
Eaton
Edgar
Englander
Ermolenko
Etchebest
Fernández-Recio
Fersht
Fetrow
Fink
Fonseca
Fooks
Galzitskaya
Gruebele
Gu
Gunasekaran
Guruprasad
Hutchinson
Hutchinson
Jiménez
Jones
Kabsch
Kapp
Karplus
Kauzmann
Klingler
Krantz
Kryshtafovych
Levinthal
Levitt
Lifson
Lifson
Liu
Luo
Mahalanobis
Maity
Mandel-Gutfreund
Marqusee
Miyazaki
Murphy
Muñoz
Nakashima
Noguchi
Onuchic
Pal
Penel
Penel
Presta
Richardson
Rigden
Romero
Rose
Rossmann
Sagermann
Santiveri
Schueler-Furman
Schwartz
Schwartz
Serrano
Serrano
Shannon
Shortle
Strait
Suyama
Swanson
Unger
Uversky
Vazquez
Ventura
Viguera
Vincent
von Heijne
Walther
Wang
Wang
Weiss
West
Wetlaufer
Wheelan
White
Wilmot
Wilson
Wolynes
Wouters
Wright
Xiong
Ye
Yon
Yoo
Zhu
Zongchao Jia
Publication venue: Molecular Diversity Preservation International (MDPI)
Publication date: 01/04/2009
Field of study

The exponential growth over the past several decades in the quantity of both primary sequence data available and the number of protein structures determined has provided a wealth of information describing the relationship between protein primary sequence and tertiary structure. This growing repository of data has served as a prime source for statistical analysis, where underlying relationships between patterns of amino acids and protein structure can be uncovered. Here, we survey the main statistical approaches that have been used for identifying patterns within protein sequences, and discuss sequence pattern research as it relates to both secondary and tertiary protein structure. Limitations to statistical analyses are discussed, and a context for their role within the field of protein folding is given. We conclude by describing a novel statistical study of residue patterning in β-strands, which finds that hydrophobic (i,i+2) pairing in β-strands occurs more often than expected at locations near strand termini. Interpretations involving β-sheet nucleation and growth are discussed

Crossref

Directory of Open Access Journals

PubMed Central