Search CORE

29,723 research outputs found

Measuring Global Credibility with Application to Local Sequence Alignment

Author: Andrey Rzhetsky
B-JM Webb
Bobbie-Jo M. Webb-Robertson
BP Carlin
C Webber
Charles E. Lawrence
D Naor
DJ Lipman
HS Booth
HT Mevissen
I Holmes
J Zhu
JP Comet
JS Liu
JS Liu
JS Liu
KA Perry
KM Chao
L Yu
LE Carvalho
Lee Ann McCue
M Kendall
M Schlosshauer
M Vingron
M Vingron
M Zuker
ME Dayhoff
ML Tress
MS Waterman
R Durbin
RL Ott
S Henikoff
S Karlin
S Miyazawa
SF Altschul
SF Altschul
TF Smith
W Thompson
WR Pearson
WR Pearson
WR Pearson
Y Ding
YK Yu
Publication venue: Public Library of Science
Publication date: 01/05/2008
Field of study

Computational biology is replete with high-dimensional (high-D) discrete prediction and inference problems, including sequence alignment, RNA structure prediction, phylogenetic inference, motif finding, prediction of pathways, and model selection problems in statistical genetics. Even though prediction and inference in these settings are uncertain, little attention has been focused on the development of global measures of uncertainty. Regardless of the procedure employed to produce a prediction, when a procedure delivers a single answer, that answer is a point estimate selected from the solution ensemble, the set of all possible solutions. For high-D discrete space, these ensembles are immense, and thus there is considerable uncertainty. We recommend the use of Bayesian credibility limits to describe this uncertainty, where a (1−α)%, 0≤α≤1, credibility limit is the minimum Hamming distance radius of a hyper-sphere containing (1−α)% of the posterior distribution. Because sequence alignment is arguably the most extensively used procedure in computational biology, we employ it here to make these general concepts more concrete. The maximum similarity estimator (i.e., the alignment that maximizes the likelihood) and the centroid estimator (i.e., the alignment that minimizes the mean Hamming distance from the posterior weighted ensemble of alignments) are used to demonstrate the application of Bayesian credibility limits to alignment estimators. Application of Bayesian credibility limits to the alignment of 20 human/rodent orthologous sequence pairs and 125 orthologous sequence pairs from six Shewanella species shows that credibility limits of the alignments of promoter sequences of these species vary widely, and that centroid alignments dependably have tighter credibility limits than traditional maximum similarity alignments

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Double Bottom Line Progress Report: Assessing Social Impact in Double Bottom Line Ventures, Methods Catalog

Author: Catherine Clark
David Long
Sara Olsen
William Rosenzweig
Publication venue: Rockefeller Foundation
Publication date: 01/01/2004
Field of study

Outlines methods for social entrepreneurs and their investors to define, measure and communicate social impact and return in early-stage ventures

IssueLab

RNAG: a new Gibbs sampler for predicting RNA secondary structure for unaligned sequences

Author: Bernhart
Bindewald
Carvalho
Cary
Charles E. Lawrence
Chenna
Ding
Ding
Do
Do
Do
Donglai Wei
Eddy
Gardner
Geman
Giegerich
Griffiths-Jones
Gutell
Hamada
Hamada
Hofacker
Hofacker
Ji
Kiryu
Kiryu
Knudsen
Lauren V. Alpert
Lindgreen
Liu
Mathews
Mathews
Meyer
Nawrocki
Nawrocki
Newberg
Sakakibara
Sankoff
Seemann
Siebert
Steffen
Tabaska
Torarinsson
Webb
Webb-Robertson
Will
Xing
Yao
Zuker
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

Motivation: RNA secondary structure plays an important role in the function of many RNAs, and structural features are often key to their interaction with other cellular components. Thus, there has been considerable interest in the prediction of secondary structures for RNA families. In this article, we present a new global structural alignment algorithm, RNAG, to predict consensus secondary structures for unaligned sequences. It uses a blocked Gibbs sampling algorithm, which has a theoretical advantage in convergence time. This algorithm iteratively samples from the conditional probability distributions P(Structure | Alignment) and P(Alignment | Structure). Not surprisingly, there is considerable uncertainly in the high-dimensional space of this difficult problem, which has so far received limited attention in this field. We show how the samples drawn from this algorithm can be used to more fully characterize the posterior space and to assess the uncertainty of predictions

CiteSeerX

Crossref

PubMed Central

Development of Bioinformatic and Experimental Technologies for Identification of Prokaryotic Regulatory Networks

Author
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date
Field of study

Crossref

Double Bottom Line Project Report: Assessing Social Impact in Double Bottom Line Ventures

Author: Catherine Clark
David Long
Sara Olsen
William Rosenzweig
Publication venue: Center for Responsible Business
Publication date: 01/01/2004
Field of study

This tool expresses costs and social impacts of an investment in monetary terms. Quantification is achieved according to one or more of three measures: NPV (the aggregate value of all costs, revenues and social impacts discounted), benefit-cost ratio (the discounted value of revenues and positive impacts divided by discounted value of costs and negative impacts) and internal rate of return (the net value of revenues plus impacts expressed as an annual percentage return on the total costs of the investment)

IssueLab

Cooperative Metaheuristics for Exploring Proteomic Data

Author: Appel Ron
Frey Julien
Gras Robin
Hernandez David
Hernandez Patricia
Martin Olivier
Mescam Yoann
Nicolas Jacques
Zangge Nadine
Publication venue
Publication date: 18/06/2018
Field of study

Most combinatorial optimization problems cannotbe solved exactly. A class of methods, calledmetaheuristics, has proved its efficiency togive good approximated solutions in areasonable time. Cooperative metaheuristics area sub-set of metaheuristics, which implies aparallel exploration of the search space byseveral entities with information exchangebetween them. The importance of informationexchange in the optimization process is relatedto the building block hypothesis ofevolutionary algorithms, which is based onthese two questions: what is the pertinentinformation of a given potential solution andhow this information can be shared? Aclassification of cooperative metaheuristicsmethods depending on the nature of cooperationinvolved is presented and the specificproperties of each class, as well as a way tocombine them, is discussed. Severalimprovements in the field of metaheuristics arealso given. In particular, a method to regulatethe use of classical genetic operators and todefine new more pertinent ones is proposed,taking advantage of a building block structuredrepresentation of the explored space. Ahierarchical approach resting on multiplelevels of cooperative metaheuristics is finallypresented, leading to the definition of acomplete concerted cooperation strategy. Someapplications of these concepts to difficultproteomics problems, including automaticprotein identification, biological motifinference and multiple sequence alignment arepresented. For each application, an innovativemethod based on the cooperation concept isgiven and compared with classical approaches.In the protein identification problem, a firstlevel of cooperation using swarm intelligenceis applied to the comparison of massspectrometric data with biological sequencedatabase, followed by a genetic programmingmethod to discover an optimal scoring function.The multiple sequence alignment problem isdecomposed in three steps involving severalevolutionary processes to infer different kindof biological motifs and a concertedcooperation strategy to build the sequencealignment according to their motif conten

RERO DOC Digital Library