Search CORE

53 research outputs found

Ten Simple Rules for Choosing between Industry and Academia

Author: Searls David B.
Publication venue: Public Library of Science
Publication date: 01/06/2009
Field of study

Crossref

Directory of Open Access Journals

PubMed Central

PII: S1359-6446(99)01457-9

Author: David B Searls
David B Searls
Publication venue
Publication date: 23/04/2020
Field of study

CiteSeerX

A View from the Dark Side

Author: Adams
Choucri
David B. Searls
Dong
Eisenberg
Gershon
Hillier
Horton
Houlgatte
Johnson
Kiley
Lehrman
Marshall
Marshall
Menzies
Nilsson
Rawlings
Sayood
Searls
Searls
Searls
Stephan
Strom
Wickware
Publication venue: Public Library of Science
Publication date: 01/01/2007
Field of study

Crossref

Directory of Open Access Journals

PubMed Central

Fast Mapping of Short Sequences with Mismatches, Insertions and Deletions Using Index Structures

Author: B Langmead
Christian Otto
Cynthia M. Sharma
David B. Searls
G Myers
H Li
H Li
H Lin
JC Dohm
JM Rothberg
Jörg Hackermüller
Jörg Vogel
K Prüfer
M Crochemore
MI Abouelhoda
P Ferragina
Peter F. Stadler
Philipp Khaitovich
R Li
S Bennett
S Huse
S Karlin
SM Rumble
Stefan Kurtz
Steve Hoffmann
W Chang
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

With few exceptions, current methods for short read mapping make use of simple seed heuristics to speed up the search. Most of the underlying matching models neglect the necessity to allow not only mismatches, but also insertions and deletions. Current evaluations indicate, however, that very different error models apply to the novel high-throughput sequencing methods. While the most frequent error-type in Illumina reads are mismatches, reads produced by 454's GS FLX predominantly contain insertions and deletions (indels). Even though 454 sequencers are able to produce longer reads, the method is frequently applied to small RNA (miRNA and siRNA) sequencing. Fast and accurate matching in particular of short reads with diverse errors is therefore a pressing practical problem. We introduce a matching model for short reads that can, besides mismatches, also cope with indels. It addresses different error models. For example, it can handle the problem of leading and trailing contaminations caused by primers and poly-A tails in transcriptomics or the length-dependent increase of error rates. In these contexts, it thus simplifies the tedious and error-prone trimming step. For efficient searches, our method utilizes index structures in the form of enhanced suffix arrays. In a comparison with current methods for short read mapping, the presented approach shows significantly increased performance not only for 454 reads, but also for Illumina reads. Our approach is implemented in the software segemehl available at http://www.bioinf.uni-leipzig.de/Software/segemehl/

Public Library of Science (PLOS)

Crossref

Fraunhofer-ePrints

Directory of Open Access Journals

PubMed Central

Systematic Planning of Genome-Scale Experiments in Poorly Studied Species

Author: Amy Caudy
AP Gasch
B Efron
C Chitikila
C Huttenhower
C Huttenhower
C Shaffer
CA Ball
CL Myers
CL Myers
David B. Searls
DC Hess
G Yvert
H Parkinson
I Lee
J Ihmels
JC Rutherford
K Morik
K Xia
L Pena-Castillo
M Kellis
MA Hibbs
Maitreya Dunham
O Troyanskaya
Olga Troyanskaya
PM Fernandes
PT Spellman
R Edgar
RA Fisher
RB Brem
RB Brem
RD King
RJ Marinelli
S Bandyopadhyay
S Bergmann
S Le Crom
SL Tai
T Joachims
TR Hughes
VM Boer
VR Iyer
WJ Fu
Y Guan
Y Guan
Yuanfang Guan
Publication venue: Public Library of Science
Publication date: 01/03/2010
Field of study

Genome-scale datasets have been used extensively in model organisms to screen for specific candidates or to predict functions for uncharacterized genes. However, despite the availability of extensive knowledge in model organisms, the planning of genome-scale experiments in poorly studied species is still based on the intuition of experts or heuristic trials. We propose that computational and systematic approaches can be applied to drive the experiment planning process in poorly studied species based on available data and knowledge in closely related model organisms. In this paper, we suggest a computational strategy for recommending genome-scale experiments based on their capability to interrogate diverse biological processes to enable protein function assignment. To this end, we use the data-rich functional genomics compendium of the model organism to quantify the accuracy of each dataset in predicting each specific biological process and the overlap in such coverage between different datasets. Our approach uses an optimized combination of these quantifications to recommend an ordered list of experiments for accurately annotating most proteins in the poorly studied related organisms to most biological processes, as well as a set of experiments that target each specific biological process. The effectiveness of this experiment- planning system is demonstrated for two related yeast species: the model organism Saccharomyces cerevisiae and the comparatively poorly studied Saccharomyces bayanus. Our system recommended a set of S. bayanus experiments based on an S. cerevisiae microarray data compendium. In silico evaluations estimate that less than 10% of the experiments could achieve similar functional coverage to the whole microarray compendium. This estimation was confirmed by performing the recommended experiments in S. bayanus, therefore significantly reducing the labor devoted to characterize the poorly studied genome. This experiment-planning framework could readily be adapted to the design of other types of large-scale experiments as well as other groups of organisms

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Roots of Bioinformatics in Protein Evolution

Author: AJP Martin
AP Ryle
CA Ouzounis
CB Anfinsen
CB Anfinsen
CB Bridges
CH Li
David B. Searls
E Abderhalden
E Margoliash
E Zuckerkandl
EB Lewis
F Sanger
G Braunitzer
GA Mross
HA Itano
Ingram
JB Hagen
K Brew
KA Walsh
L Pauling
MO Dayhoff
MO Dayhoff
MO Dayhoff
MO Dayhoff
MW Nirenberg
P Edman
P Edman
R Eck
RF Doolittle
RF Doolittle
RF Doolittle
RF Doolittle
RL Hill
Russell F. Doolittle
S Henikoff
S Moore
SB Needleman
SG Stephens
SJ Singer
V du Vigneuad
V Ingram
WA Fitch
WM Fitch
Publication venue: Public Library of Science
Publication date: 01/07/2010
Field of study

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Category Theoretic Analysis of Hierarchical Protein Materials and Social Networks

Author: A Fritsch
AL Barabasi
AL Barabasi
B Alberts
BC Pierce
CM Schneider
D Eisenberg
D Taylor
DA Fletcher
David I. Spivak
DB Searls
DI Spivak
E Moggi
E Rodriguez
Elizabeth Wood
EM Marcotte
EM Marcotte
FW Lawvere
GB Olson
H Jeong
H Jeong
H Peterlik
I Lee
J Aizenberg
J Verdasca
JD Currey
K Hofstetter
Laurent Kreplak
M Barr
M Moortgat
Markus J. Buehler
MD Hauser
MJ Buehler
MJ Buehler
MS Szalay
N Chomsky
N Huebsch
NM Pugno
O Mason
P Csermely
P Fratzl
P Nurse
P Wadler
R Brown
R Lakes
R Milo
R Paparcone
R Pastor-Satorras
RC Strohman
RT Oehrle
S Awodey
S Eilenberg
S Keten
SM Lane
SW Cranford
T Ackbarow
Tristan Giesa
WW Powell
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Materials in biology span all the scales from Angstroms to meters and typically consist of complex hierarchical assemblies of simple building blocks. Here we describe an application of category theory to describe structural and resulting functional properties of biological protein materials by developing so-called ologs. An olog is like a “concept web” or “semantic network” except that it follows a rigorous mathematical formulation based on category theory. This key difference ensures that an olog is unambiguous, highly adaptable to evolution and change, and suitable for sharing concepts with other olog. We consider simple cases of beta-helical and amyloid-like protein filaments subjected to axial extension and develop an olog representation of their structural and resulting mechanical properties. We also construct a representation of a social network in which people send text-messages to their nearest neighbors and act as a team to perform a task. We show that the olog for the protein and the olog for the social network feature identical category-theoretic representations, and we proceed to precisely explicate the analogy or isomorphism between them. The examples presented here demonstrate that the intrinsic nature of a complex system, which in particular includes a precise relationship between structure and function at different hierarchical levels, can be effectively represented by an olog. This, in turn, allows for comparative studies between disparate materials or fields of application, and results in novel approaches to derive functionality in the design of de novo hierarchical systems. We discuss opportunities and challenges associated with the description of complex biological materials by using ologs as a powerful tool for analysis and design in the context of materiomics, and we present the potential impact of this approach for engineering, life sciences, and medicine.Presidential Early Career Award for Scientists and Engineers (N000141010562)United States. Army Research Office. Multidisciplinary University Research Initiative (W911NF0910541)United States. Office of Naval Research (grant N000141010841)Massachusetts Institute of Technology. Dept. of MathematicsStudienstiftung des deutschen VolkesClark BarwickJacob Luri

arXiv.org e-Print Archive

Public Library of Science (PLOS)

CiteSeerX

DSpace@MIT

Crossref

Directory of Open Access Journals

PubMed Central

Publikationsserver der RWTH Aachen University

Disease-Aging Network Reveals Significant Roles of Aging Genes in Connecting Genetic Diseases

Author: A Budovsky
A Budovsky
A Friedman
A Kowald
A Kriete
A Ozgur
AL Barabasi
C Soti
D Harman
David B. Searls
DJ Watts
E Ravasz
G Jin
GRG Lanckriet
H Kitano
H Xue
HD Osiewacz
HJ Kiss
I Feldman
J Hasty
JDJ Han
Jiguang Wang
JP de Magalhaes
JP de Magalhaes
JR Managbanag
KI Goh
L Hayflick
Luonan Chen
M Wolfson
MEJ Newman
P Shannon
P Zuppan
PF Jonsson
Q Cui
R Albert
R Bell
RI Kondor
S Karni
S Maere
S Maslov
S Peri
S Vasto
Shihua Zhang
T Ideker
T Ishunina
TBL Kirkwood
U Brandes
U Stelzl
X Jiang
X Wu
Xiang-Sun Zhang
Y Li
Yong Wang
Z Spiro
Z Tu
Publication venue: Public Library of Science
Publication date: 01/09/2009
Field of study

One of the challenging problems in biology and medicine is exploring the underlying mechanisms of genetic diseases. Recent studies suggest that the relationship between genetic diseases and the aging process is important in understanding the molecular mechanisms of complex diseases. Although some intricate associations have been investigated for a long time, the studies are still in their early stages. In this paper, we construct a human disease-aging network to study the relationship among aging genes and genetic disease genes. Specifically, we integrate human protein-protein interactions (PPIs), disease-gene associations, aging-gene associations, and physiological system–based genetic disease classification information in a single graph-theoretic framework and find that (1) human disease genes are much closer to aging genes than expected by chance; and (2) diseases can be categorized into two types according to their relationships with aging. Type I diseases have their genes significantly close to aging genes, while type II diseases do not. Furthermore, we examine the topological characters of the disease-aging network from a systems perspective. Theoretical results reveal that the genes of type I diseases are in a central position of a PPI network while type II are not; (3) more importantly, we define an asymmetric closeness based on the PPI network to describe relationships between diseases, and find that aging genes make a significant contribution to associations among diseases, especially among type I diseases. In conclusion, the network-based study provides not only evidence for the intricate relationship between the aging process and genetic diseases, but also biological implications for prying into the nature of human diseases

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Roots of Bioinformatics in Theoretical Biology

Author: A Anderson
A Boiteux
A Crombach
A Crombach
A Lindenmayer
A Lindenmayer
A Marée
A Marée
A Neyfakh
A Turing
A Varma
A Wagner
A Wagner
B Goodwin
B Hesper
B Turner
C Hewitt
C Honk
C Pál
CH Waddington
D Gillespie
D Konings
D Konings
David B. Searls
E Koonin
E van Nimwegen
EP Odum
F Crick
F Graner
F Rosenblatt
FK de Boer
G Lance
G Odell
H Abelson
H Kacser
J Draghi
J Draghi
J Griffith
J Hagen
J Holland
L Hurst
L Segel
L Von Bertalanffy
L Von Bertalanffy
L Wolpert
M Boerlijst
M Covert
M Dayhoff
M Dayhoff
M Huynen
M Huynen
M Huynen
M Kertesz
M Kozak
M Minsky
M Szekely
M Thomson
M Van Hoek
N Batada
N Savill
N Stoletzki
O Mastenbroek
O Soyer
P Hogeweg
P Hogeweg
P Hogeweg
P Hogeweg
P Hogeweg
P Hogeweg
P Hogeweg
P Hogeweg
P Hogeweg
P Hogeweg
P Hogeweg
P Macnaughton-Smith
P Nurse
P Schuster
P Sneath
Paulien Hogeweg
R Goldstein
R May
R Rosen
S Freilich
S Huang
S Huang
S Kauffman
S Papert
S Rafelski
W Gu
Publication venue: Public Library of Science
Publication date: 01/03/2011
Field of study

From the late 1980s onward, the term “bioinformatics” mostly has been used to refer to computational methods for comparative analysis of genome data. However, the term was originally more widely defined as the study of informatic processes in biotic systems. In this essay, I will trace this early history (from a personal point of view) and I will argue that the original meaning of the term is re-emerging

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

An online bioinformatics curriculum.

Author: David B Searls
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

Online learning initiatives over the past decade have become increasingly comprehensive in their selection of courses and sophisticated in their presentation, culminating in the recent announcement of a number of consortium and startup activities that promise to make a university education on the internet, free of charge, a real possibility. At this pivotal moment it is appropriate to explore the potential for obtaining comprehensive bioinformatics training with currently existing free video resources. This article presents such a bioinformatics curriculum in the form of a virtual course catalog, together with editorial commentary, and an assessment of strengths, weaknesses, and likely future directions for open online learning in this field

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central