Search CORE

513 research outputs found

Detailed estimation of bioinformatics prediction reliability through the Fragmented Prediction Performance Plots

Author: A Tramontano
B Rost
D Frishman
FC Bernstein
HM Berman
IH Witten
JA Cuff
JA Cuff
O Carugo
Oliviero Carugo
PY Chou
Uniprot Consortium
VA Simossis
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background An important and yet rather neglected question related to bioinformatics predictions is the estimation of the amount of data that is needed to allow reliable predictions. Bioinformatics predictions are usually validated through a series of figures of merit, like for example sensitivity and precision, and little attention is paid to the fact that their performance may depend on the amount of data used to make the predictions themselves. Results Here I describe a tool, named Fragmented Prediction Performance Plot (FPPP), which monitors the relationship between the prediction reliability and the amount of information underling the prediction themselves. Three examples of FPPPs are presented to illustrate their principal features. In one example, the reliability becomes independent, over a certain threshold, of the amount of data used to predict protein features and the intrinsic reliability of the predictor can be estimated. In the other two cases, on the contrary, the reliability strongly depends on the amount of data used to make the predictions and, thus, the intrinsic reliability of the two predictors cannot be determined. Only in the first example it is thus possible to fully quantify the prediction performance. Conclusion It is thus highly advisable to use FPPPs to determine the performance of any new bioinformatics prediction protocol, in order to fully quantify its prediction power and to allow comparisons between two or more predictors based on different types of data.</p

Crossref

Archivio Istituzionale della Ricerca - Università degli Studi di Pavia

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

PSP_MCSVM: brainstorming consensus prediction of protein secondary structures using two-stage multiclass support vector machines

Author: A Kloczkowski
AA Salamov
B Rost
B Rost
B Rost
C Cole
D Frishman
Dariusz Plewczynski
DG Kneller
H Lin
J Garnier
J Garnier
J Guo
JA Cuff
JF Gibrat
K Wu
LM Jonathon
M Ouali
Mahantapas Kundu
Mita Nasipuri
N Qian
P Chatterjee
Piyali Chatterjee
PY Chou
RD King
SF Altschul
Subhadip Basu
TD Jones
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

Secondary structure prediction is a crucial task for understanding the variety of protein structures and performed biological functions. Prediction of secondary structures for new proteins using their amino acid sequences is of fundamental importance in bioinformatics. We propose a novel technique to predict protein secondary structures based on position-specific scoring matrices (PSSMs) and physico-chemical properties of amino acids. It is a two stage approach involving multiclass support vector machines (SVMs) as classifiers for three different structural conformations, viz., helix, sheet and coil. In the first stage, PSSMs obtained from PSI-BLAST and five specially selected physicochemical properties of amino acids are fed into SVMs as features for sequence-to-structure prediction. Confidence values for forming helix, sheet and coil that are obtained from the first stage SVM are then used in the second stage SVM for performing structure-to-structure prediction. The two-stage cascaded classifiers (PSP_MCSVM) are trained with proteins from RS126 dataset. The classifiers are finally tested on target proteins of critical assessment of protein structure prediction experiment-9 (CASP9). PSP_MCSVM with brainstorming consensus procedure performs better than the prediction servers like Predator, DSC, SIMPA96, for randomly selected proteins from CASP9 targets. The overall performance is found to be comparable with the current state-of-the art. PSP_MCSVM source code, train-test datasets and supplementary files are available freely in public domain at: http://sysbio.icm.edu.pl/secstruct and http://code.google.com/p/cmater-bioinfo

Crossref

Springer - Publisher Connector

PubMed Central

Novel mutations in the VKORC1 gene of wild rats and mice – a response to 50 years of selection pressure by warfarin?

Author: A Zimmermann
AD MacNicoll
AD MacNicoll
AD MacNicoll
Alan D MacNicoll
C Vermeer
Clemens R Müller
CM Boyle
D Cain
DJ Harrington
EC Cranenburg
FP Rowe
Hans-Joachim Pelz
HH Thijssen
HH Thijssen
HJ Pelz
JA Bishop
JA Sadowski
JH Greaves
JH Greaves
JJ Mach
JK Tie
Johannes Oldenburg
Ki-Joon Song
M Lund
MA Hermodson
R Lasseur
R Redfern
RA Johnson
RG Bell
S Rost
S Rost
Sandra Menzel
Simone Rost
T Li
Thomas Jäkel
TM Misenheimer
Vanina León
World Health Organization
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Coumarin derivatives have been in world-wide use for rodent pest control for more than 50 years. Due to their retarded action as inhibitors of blood coagulation by repression of the vitamin K reductase (VKOR) activity, they are the rodenticides of choice against several species. Resistance to these compounds has been reported for rodent populations from many countries around the world and poses a considerable problem for efficacy of pest control. Results In the present study, we have sequenced the <it>VKORC1 </it>genes of more than 250 rats and mice trapped in anticoagulant-exposed areas from four continents, and identified 18 novel and five published missense mutations, as well as eight neutral sequence variants, in a total of 178 animals. Mutagenesis in <it>VKORC1 </it>cDNA constructs and their recombinant expression revealed that these mutations reduced VKOR activities as compared to the wild-type protein. However, the <it>in vitro </it>enzyme assay used was not suited to convincingly demonstrate the warfarin resistance of all mutant proteins Conclusion Our results corroborate the <it>VKORC1 </it>gene as the main target for spontaneous mutations conferring warfarin resistance. The mechanism(s) of how mutations in the <it>VKORC1 </it>gene mediate insensitivity to coumarins <it>in vivo </it>has still to be elucidated.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Holographic Metamagnetism, Quantum Criticality, and Crossover Behavior

Author: A Buchel
A Chamblin
A Parnachev
AJ Millis
AJ Millis
AW Rost
D Anninos
D Mateos
E D'Hoker
E D'Hoker
E Witten
Eric D’Hoker
G Compere
G Lifschytz
GT Horowitz
Hv Lohneysen
JA Hertz
JL Davis
JL Davis
JP Gauntlett
JP Gauntlett
JP Gauntlett
K Jensen
M Cadoni
M Cubrovic
N Evans
Per Kraus
S Nakamura
S Sachdev
S-J Rey
SA Hartnoll
V Oganesyan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/04/2010
Field of study

Using high-precision numerical analysis, we show that 3+1 dimensional gauge theories holographically dual to 4+1 dimensional Einstein-Maxwell-Chern-Simons theory undergo a quantum phase transition in the presence of a finite charge density and magnetic field. The quantum critical theory has dynamical scaling exponent z=3, and is reached by tuning a relevant operator of scaling dimension 2. For magnetic field B above the critical value B_c, the system behaves as a Fermi liquid. As the magnetic field approaches B_c from the high field side, the specific heat coefficient diverges as 1/(B-B_c), and non-Fermi liquid behavior sets in. For B<B_c the entropy density s becomes non-vanishing at zero temperature, and scales according to s \sim \sqrt{B_c - B}. At B=B_c, and for small non-zero temperature T, a new scaling law sets in for which s\sim T^{1/3}. Throughout a small region surrounding the quantum critical point, the ratio s/T^{1/3} is given by a universal scaling function which depends only on the ratio (B-B_c)/T^{2/3}. The quantum phase transition involves non-analytic behavior of the specific heat and magnetization but no change of symmetry. Above the critical field, our numerical results are consistent with those predicted by the Hertz/Millis theory applied to metamagnetic quantum phase transitions, which also describe non-analytic changes in magnetization without change of symmetry. Such transitions have been the subject of much experimental investigation recently, especially in the compound Sr_3 Ru_2 O_7, and we comment on the connections.Comment: 23 pages, 8 figures v2: added ref

arXiv.org e-Print Archive

Crossref

PCI-SS: MISO dynamic nonlinear protein secondary structure prediction

Author: A Zemla
AA Schaffer
B Alberts
B Rost
B Rost
B Rost
BW Matthews
D Przybylski
DT Jones
G Pollastri
H Berman
JA Cuff
James R Green
JJ Ward
JR Green
JR Green
JR Green
JR Green
K Lin
LJ McGuffin
M Korenberg
M Ouali
M Shah
Michael J Korenberg
MJ Korenberg
MJ Korenberg
MJ Korenberg
MJ Korenberg
Mohammed O Aboul-Magd
R Adamczak
R David
RE Dorsey
S Montgomerie
VA Eyrich
W Kabsch
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Since the function of a protein is largely dictated by its three dimensional configuration, determining a protein's structure is of fundamental importance to biology. Here we report on a novel approach to determining the one dimensional secondary structure of proteins (distinguishing α-helices, β-strands, and non-regular structures) from primary sequence data which makes use of Parallel Cascade Identification (PCI), a powerful technique from the field of nonlinear system identification. Results Using PSI-BLAST divergent evolutionary profiles as input data, dynamic nonlinear systems are built through a black-box approach to model the process of protein folding. Genetic algorithms (GAs) are applied in order to optimize the architectural parameters of the PCI models. The three-state prediction problem is broken down into a combination of three binary sub-problems and protein structure classifiers are built using 2 layers of PCI classifiers. Careful construction of the optimization, training, and test datasets ensures that no homology exists between any training and testing data. A detailed comparison between PCI and 9 contemporary methods is provided over a set of 125 new protein chains guaranteed to be dissimilar to all training data. Unlike other secondary structure prediction methods, here a web service is developed to provide both human- and machine-readable interfaces to PCI-based protein secondary structure prediction. This server, called PCI-SS, is available at <url>http://bioinf.sce.carleton.ca/PCISS</url>. In addition to a dynamic PHP-generated web interface for humans, a Simple Object Access Protocol (SOAP) interface is added to permit invocation of the PCI-SS service remotely. This machine-readable interface facilitates incorporation of PCI-SS into multi-faceted systems biology analysis pipelines requiring protein secondary structure information, and greatly simplifies high-throughput analyses. XML is used to represent the input protein sequence data and also to encode the resulting structure prediction in a machine-readable format. To our knowledge, this represents the only publicly available SOAP-interface for a protein secondary structure prediction service with published WSDL interface definition. Conclusion Relative to the 9 contemporary methods included in the comparison cascaded PCI classifiers perform well, however PCI finds greatest application as a consensus classifier. When PCI is used to combine a sequence-to-structure PCI-based classifier with the current leading ANN-based method, PSIPRED, the overall error rate (Q3) is maintained while the rate of occurrence of a particularly detrimental error is reduced by up to 25%. This improvement in BAD score, combined with the machine-readable SOAP web service interface makes PCI-SS particularly useful for inclusion in a tertiary structure prediction pipeline.</p

Crossref

Carleton University's Institutional Repository

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Responses of marine benthic microalgae to elevated CO<inf>2</inf>

Author: A Engel
A Tribollet
AE McNamara
AJ Underwood
AS Hill
AS Hill
B Rost
B Rost
BB Dias
BD Russell
BM Hopkinson
C Lombardi
C. Brownlee
CD Hepburn
CJM Hoppe
CL Hurd
CN Bianchi
DA Hutchins
DA Hutchins
E Lewis
E Vanhaecke
EGJ Edyvean
EH Simpson
F-X Fu
F-X Fu
F-X Fu
FE Round
G Diaz-Pulido
G Langer
HL Wood
IE Hendriks
IM Munda
J Barcelos e Ramos
J Beardall
J Beardall
J Liu
J Thomsen
J-M Kim
J. M. Hall-Spencer
JA Kleypas
JA Raven
JA Raven
JA Raven
JC Orr
JM Hall-Spencer
JP Barry
K Caldeira
KE Fabricius
KJ Kroeker
KR Hinga
L Porzio
M Cigliano
M Giordano
M Hein
M Matsumoto
M. Graziano
M. Milazzo
MD Iglesias-Rodriguez
MJ Anderson
ML Tuchman
MR Badger
N Nakićenović
O Levitan
P Kerrison
PD Tortell
PD Tortell
PD Tortell
PD Tortell
PM Stanley
R Huang
R Rodolfo-Metalpa
R Rodolfo-Metalpa
R Sekar
R Stafford
R. E. M. Rickaby
RC Thompson
RC Thompson
RC Thompson
RH Bustamante
RJ Ritchie
RP Couto
S Burkhardt
S Burkhardt
S Burkhardt
S Martin
S Martin
S Nagarkar
S Trimborn
S Vizzini
SA Kranz
SC Doney
SD Connell
SJ Hawkins
SR Jenkins
SW Chisholm
T Kiørboe
U Riebesell
U Riebesell
U Riebesell
V. R. Johnson
YM Mak
Z Lu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Increasing anthropogenic CO2 emissions to the atmosphere are causing a rise in pCO2 concentrations in the ocean surface and lowering pH. To predict the effects of these changes, we need to improve our understanding of the responses of marine primary producers since these drive biogeochemical cycles and profoundly affect the structure and function of benthic habitats. The effects of increasing CO2 levels on the colonisation of artificial substrata by microalgal assemblages (periphyton) were examined across a CO2 gradient off the volcanic island of Vulcano (NE Sicily). We show that periphyton communities altered significantly as CO2 concentrations increased. CO2 enrichment caused significant increases in chlorophyll a concentrations and in diatom abundance although we did not detect any changes in cyanobacteria. SEM analysis revealed major shifts in diatom assemblage composition as CO2 levels increased. The responses of benthic microalgae to rising anthropogenic CO2 emissions are likely to have significant ecological ramifications for coastal systems. © 2011 Springer-Verlag

Crossref

Plymouth Electronic Archive and Research Library

Oxford University Research Archive

Publishing Network for Geoscientific and Environmental Data

Archivio istituzionale della ricerca - Università di Palermo

Gene Function Classification Using Bayesian Models with Hierarchy-Based Priors

Author: A Clare
A McCallum
AS Weigend
B Rost
B Schoikowski
B Shahbaba
Babak Shahbaba
BE Engelhardt
D Koller
EM Marcotte
FR Blattner
H Blockeel
I Tsochantaridis
IUBMB
J DeRisi
J Fox
J Goodman
J Struyf
J Zhang
JA Eisen
JR Guest
K Sjölander
L Cai
L Dehaspe
M Brown
M Deng
M Deng
M Eisen
M Riley
M Riley
N Cesa-Bianchi
O Dekel
P Pavlidis
R Caruana
R Eisner
Radford M Neal
RD King
RD King
RM Neal
RM Neal
RM Neal
S Rison
S Sattath
S Spiro
SF Altschul
ST Dumais
WR Pearson
Z Barutcuoglu
Publication venue
Publication date: 01/01/2006
Field of study

We investigate the application of hierarchical classification schemes to the annotation of gene function based on several characteristics of protein sequences including phylogenic descriptors, sequence based attributes, and predicted secondary structure. We discuss three Bayesian models and compare their performance in terms of predictive accuracy. These models are the ordinary multinomial logit (MNL) model, a hierarchical model based on a set of nested MNL models, and a MNL model with a prior that introduces correlations between the parameters for classes that are nearby in the hierarchy. We also provide a new scheme for combining different sources of information. We use these models to predict the functional class of Open Reading Frames (ORFs) from the E. coli genome. The results from all three models show substantial improvement over previous methods, which were based on the C5 algorithm. The MNL model using a prior based on the hierarchy outperforms both the non-hierarchical MNL model and the nested MNL model. In contrast to previous attempts at combining these sources of information, our approach results in a higher accuracy rate when compared to models that use each data source alone. Together, these results show that gene function can be predicted with higher accuracy than previously achieved, using Bayesian models that incorporate suitable prior information

arXiv.org e-Print Archive

University of Toronto Research Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Organizational factors and depression management in community-based primary care settings

Author: A Donabedian
A Neumeyer-Gromen
AJ Dietrich
AM Kilbourne
AM Kilbourne
AM Kilbourne
Amy M Kilbourne
BE Landon
BE Landon
BG Druss
BN Doebbeling
Charles F Reynolds
CJ Murray
D Berwick
D Cohen
DA Regier
DE Grembowski
E Badamgarav
E Ferlie
EA Balas
EC Nelson
Edward P Post
EH Wagner
EM Yano
EM Yano
EP Post
Francis X Solano
GL Jackson
HA Pincus
HA Pincus
HA Pincus
Harold Alan Pincus
HC Schulberg
Institute of Medicine
Institute of Medicine
Institute of Medicine
J Rycroft-Malone
J Unutzer
JA Kairys
JC Coyne
JJ Ofman
JP Morrissey
JS Hunt
JS Zinn
K Rost
KA Phillips
KM Miles
KM Rost
L Casalino
LA Cooper
LK Kochevar
LM Soban
LS Meredith
LV Rubenstein
LV Rubenstein
M Horvitz-Lennon
ML Bruce
MS Ridgely
O Grusky
O Grusky
PV Marsden
RG Frank
RM Andersen
Robert W Bremer
S Findlay
S Gilbody
S Shortell
SL Krein
SM Shortell
SM Shortell
SM Shortell
SM Shortell
SM Shortell
SS Lyons
T Bodenheimer
T Bodenheimer
T Scott
U.S. Department of Health and Human Services
U.S. Department of Health and Human Services
W Katon
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Abstract Background Evidence-based quality improvement models for depression have not been fully implemented in routine primary care settings. To date, few studies have examined the organizational factors associated with depression management in real-world primary care practice. To successfully implement quality improvement models for depression, there must be a better understanding of the relevant organizational structure and processes of the primary care setting. The objective of this study is to describe these organizational features of routine primary care practice, and the organization of depression care, using survey questions derived from an evidence-based framework. Methods We used this framework to implement a survey of 27 practices comprised of 49 unique offices within a large primary care practice network in western Pennsylvania. Survey questions addressed practice structure (e.g., human resources, leadership, information technology (IT) infrastructure, and external incentives) and process features (e.g., staff performance, degree of integrated depression care, and IT performance). Results The results of our survey demonstrated substantial variation across the practice network of organizational factors pertinent to implementation of evidence-based depression management. Notably, quality improvement capability and IT infrastructure were widespread, but specific application to depression care differed between practices, as did coordination and communication tasks surrounding depression treatment. Conclusions The primary care practices in the network that we surveyed are at differing stages in their organization and implementation of evidence-based depression management. Practical surveys such as this may serve to better direct implementation of these quality improvement strategies for depression by improving understanding of the organizational barriers and facilitators that exist within both practices and practice networks. In addition, survey information can inform efforts of individual primary care practices in customizing intervention strategies to improve depression management.http://deepblue.lib.umich.edu/bitstream/2027.42/78269/1/1748-5908-4-84.xmlhttp://deepblue.lib.umich.edu/bitstream/2027.42/78269/2/1748-5908-4-84-S1.PDFhttp://deepblue.lib.umich.edu/bitstream/2027.42/78269/3/1748-5908-4-84.pdfPeer Reviewe

Crossref

Columbia University Academic Commons

PubMed Central

Deep Blue Documents at the University of Michigan

FLORA: a novel method to predict protein function from structure in diverse superfamilies

Predicting protein function from structure remains an active area of interest, particularly for the structural genomics initiatives where a substantial number of structures are initially solved with little or no functional characterisation. Although global structure comparison methods can be used to transfer functional annotations, the relationship between fold and function is complex, particularly in functionally diverse superfamilies that have evolved through different secondary structure embellishments to a common structural core. The majority of prediction algorithms employ local templates built on known or predicted functional residues. Here, we present a novel method (FLORA) that automatically generates structural motifs associated with different functional sub-families (FSGs) within functionally diverse domain superfamilies. Templates are created purely on the basis of their specificity for a given FSG, and the method makes no prior prediction of functional sites, nor assumes specific physico-chemical properties of residues. FLORA is able to accurately discriminate between homologous domains with different functions and substantially outperforms (a 2–3 fold increase in coverage at low error rates) popular structure comparison methods and a leading function prediction method. We benchmark FLORA on a large data set of enzyme superfamilies from all three major protein classes (α, β, αβ) and demonstrate the functional relevance of the motifs it identifies. We also provide novel predictions of enzymatic activity for a large number of structures solved by the Protein Structure Initiative. Overall, we show that FLORA is able to effectively detect functionally similar protein domain structures by purely using patterns of structural conservation of all residues

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

UCL Discovery

PubMed Central

Accurate prediction of protein secondary structure and solvent accessibility by consensus combiners of sequence and structure information

Author: A Ceroni
A Lesk
A Salamov
A Vullo
Alberto JM Martin
Alessandro Vullo
B Rost
B Rost
B Rost
C Orengo
Catherine Mooney
D Frishman
D Jones
D Jones
D Przybylski
E Krieger
G Gianese
G Pollastri
G Pollastri
G Pollastri
G Pollastri
Gianluca Pollastri
H Berman
H Naderi-Manesh
J Cheng
J Cheng
J Cuff
J Moult
J Moult
J Sim
JA Cuff
L Fourrier
M Mucchielli-Giorgi
M Nguyen
M Wagner
P Baldi
P Baldi
P Baldi
P Bradley
R Adamczak
R Karchin
S Ahmad
S Altschul
S Montgomerie
S Qin
SK Riis
T Petersen
U Hobohm
V Eyrich
W Kabsch
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Background : Structural properties of proteins such as secondary structure and solvent accessibility contribute to three-dimensional structure prediction, not only in the ab initio case but also when homology information to known structures is available. Structural properties are also routinely used in protein analysis even when homology is available, largely because homology modelling is lower throughput than, say, secondary structure prediction. Nonetheless, predictors of secondary structure and solvent accessibility are virtually always ab initio. Results: Here we develop high-throughput machine learning systems for the prediction of protein secondary structure and solvent accessibility that exploit homology to proteins of known structure, where available, in the form of simple structural frequency profiles extracted from sets of PDB templates. We compare these systems to their state-of-the-art ab initio counterparts, and with a number of baselines in which secondary structures and solvent accessibilities are extracted directly from the templates. We show that structural information from templates greatly improves secondary structure and solvent accessibility prediction quality, and that, on average, the systems significantly enrich the information contained in the templates. For sequence similarity exceeding 30%, secondary structure prediction quality is approximately 90%, close to its theoretical maximum, and 2-class solvent accessibility roughly 85%. Gains are robust with respect to template selection noise, and significant for marginal sequence similarity and for short alignments, supporting the claim that these improved predictions may prove beneficial beyond the case in which clear homology is available. Conclusion: The predictive system are publicly available at the address http://distill.ucd.ieScience Foundation IrelandIrish Research Council for Science, Engineering and TechnologyHealth Research BoardUCD President's Award 2004au, da, ke, ab, sp - kpw30/11/1

Crossref

Research Repository UCD

Springer - Publisher Connector

PubMed Central