Search CORE

17 research outputs found

Improving de novo sequence assembly using machine learning and comparative genomics for overlap correction

Author: AC Darling
AM Phillippy
D Fasulo
D Hernandez
D Zerbino
Daniel Fasulo
DD Sommer
EW Myers
IH Witten
J Butler
JR Miller
KE Holt
KR Rasmussen
Lance E Palmer
M Pop
M Roberts
M Roberts
Mathaeus Dejori
MJ Chaisson
P Havlak
PA Pevzner
Randall Bolanos
RR Quinlan
S Batzoglou
WJ Kent
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Probing a slepton Higgs on all frontiers

Author: A Bolanos
AK Grant
AK Grant
C Biggio
C Csáki
C Frugiuele
Carla Biggio
E Bertuzzo
F Riva
GD Kribs
GD Kribs
J Barranco
J Berger
J Kalinowski
JA Dror
Jeff Asaf Dror
K Benakli
K Benakli
K Benakli
K Benakli
L Randall
LJ Hall
M Heikinheimo
P Dießner
P Kumar
PJ Fox
R Davies
R Fok
R Fok
S Chakraborty
S Chakraborty
SDL Amigo
T Ohlsson
W Loinaz
Wee Hao Ng
Yuval Grossman
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Improving de novo sequence assembly using machine learning and comparative genomics for overlap correction

Author: Bolanos Randall
Dejori Mathaeus
Fasulo Daniel
Palmer Lance E
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Abstract Background With the rapid expansion of DNA sequencing databases, it is now feasible to identify relevant information from prior sequencing projects and completed genomes and apply it to de novo sequencing of new organisms. As an example, this paper demonstrates how such extra information can be used to improve de novo assemblies by augmenting the overlapping step. Finding all pairs of overlapping reads is a key task in many genome assemblers, and to this end, highly efficient algorithms have been developed to find alignments in large collections of sequences. It is well known that due to repeated sequences, many aligned pairs of reads nevertheless do not overlap. But no overlapping algorithm to date takes a rigorous approach to separating aligned but non-overlapping read pairs from true overlaps. Results We present an approach that extends the Minimus assembler by a data driven step to classify overlaps as true or false prior to contig construction. We trained several different classification models within the Weka framework using various statistics derived from overlaps of reads available from prior sequencing projects. These statistics included percent mismatch and k-mer frequencies within the overlaps as well as a comparative genomics score derived from mapping reads to multiple reference genomes. We show that in real whole-genome sequencing data from the E. coli and S. aureus genomes, by providing a curated set of overlaps to the contigging phase of the assembler, we nearly doubled the median contig length (N50) without sacrificing coverage of the genome or increasing the number of mis-assemblies. Conclusions Machine learning methods that use comparative and non-comparative features to classify overlaps as true or false can be used to improve the quality of a sequence assembly.</p

Directory of Open Access Journals

Hepatocyte growth factor and c-Met promote dendritic maturation during hippocampal neuron differentiation via the Akt pathway

Author: Berling
Birchmeier
Birchmeier
Bolanos-Garcia
Campbell
Chao
Chol Seung Lim
Christensen
Craig
Cui
Dotti
Ebens
Fan
Forte
Furlong
Gong
Goold
Grimes
Gutierrez
Hamanoue
Honda
Jaworski
Jung
Kermorgant
Lim
Ma
Maina
Maina
Maina
Naska
Niimura
Powell
Powell
Powell
Randall S. Walikonis
Sanchez
Sanchez
Segarra
Smolen
Sobeih
State
Thewke
Tyndall
Tyndall
Tyndall
Walikonis
Yoshimura
Yu
Zhou
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Age-dependent differences in the strength and persistence of psychostimulant-induced conditioned activity in rats

Author: Adriani
Ahmed
Ahmed
Anderson
Bolanos
Bronstein
Bronstein
Campbell
Campbell
Campbell
Damianopoulos
Damianopoulos
Diehl
Franklin
Geisser
Gold
Guanowsky
Herbert
Holson
Johnson
Kalivas
Koek
Lanier
Laviola
Leith
McDougall
McDougall
McDougall
McDougall
McDougall
McDougall
Michel
Michel
Michel
Paulson
Pickens
Pruitt
Randall
Robinson
Shalaby
Siegel
Spear
Tilson
Tirelli
Tirelli
Valjent
Weiss
White
Wood
Zavala
Zombeck
Zorrilla
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date
Field of study

Crossref

Plasma and brain concentrations of oral therapeutic doses of methylphenidate and their impact on brain monoamine content in mice

Author: Andersen
Aoyama
Aoyama
Augustyniak
Aygul Balcioglu
Berridge
Biederman
Biederman
Bolanos
Brandon
Brandon
Carlezon
Chase
Davids
Davids
Deirdre McCarthy
Ding
Ding
Gerasimov
Jia-Qian Ren
Joseph Biederman
Klein-Schwartz
Kotaki
Kuczenski
Kuczenski
Kuczenski
McCabe
Meririnne
Olfson
Patrick
Patrick
Pradeep G. Bhide
Randall
Robbins
Russell
Sagvolden
Schweri
Sprague
Sulzer
Swanson
Swanson
Teter
Thai
Thanos
Thomas J. Spencer
Volkow
Volkow
Volkow
Wargin
Wolf
Zuvekas
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Whole-genome shotgun assembly and comparison of human genome assemblies

We report a whole-genome shotgun assembly (called WGSA) of the human genome generated at Celera in 2001. The Celera-generated shotgun data set consisted of 27 million sequencing reads organized in pairs by virtue of end-sequencing 2-kbp, 10-kbp, and 50-kbp inserts from shotgun clone libraries. The quality-trimmed reads covered the genome 5.3 times, and the inserts from which pairs of reads were obtained covered the genome 39 times. With the nearly complete human DNA sequence [National Center for Biotechnology Information (NCBI) Build 34] now available, it is possible to directly assess the quality, accuracy, and completeness of WGSA and of the first reconstructions of the human genome reported in two landmark papers in February 2001 [Venter, J. C., Adams, M. D., Myers, E. W., Li, P. W., Mural, R. J., Sutton, G. G., Smith, H. O., Yandell, M., Evans, C. A., Holt, R. A., et al. (2001) Science 291, 1304–1351; International Human Genome Sequencing Consortium (2001) Nature 409, 860–921]. The analysis of WGSA shows 97% order and orientation agreement with NCBI Build 34, where most of the 3% of sequence out of order is due to scaffold placement problems as opposed to assembly errors within the scaffolds themselves. In addition, WGSA fills some of the remaining gaps in NCBI Build 34. The early genome sequences all covered about the same amount of the genome, but they did so in different ways. The Celera results provide more order and orientation, and the consortium sequence provides better coverage of exact and nearly exact repeats

Crossref

Repository: Freie Universität Berlin (FU), Math Department (fu_mi_publications)

PubMed Central

MPG.PuRe

A review of the use of modafinil for attention-deficit hyperactivity disorder

Author: Adler
Akaoka
Aron
Barbaresi
Biederman
Biederman
Biederman
Biederman
Biederman
Bolanos
Dackis
Danielle Turner
DeBattista
Dinn
DuPaul
Duteil
Ernst
Faraone
Faraone
Faraone
Faraone
Fava
Ferraro
Ferraro
Freeman
Hogl
Hurst
Ishizuka
Jasinski
Kessler
Levine
Lin
Makela
Mariani
Mehta
Muller
Murphy
Myrick
Nishino
Norton
Ossmann
Pliszka
Randall
Rasmussen
Rosack
Rosenberg
Rugino
Rugino
Saletu
Scammell
Sevy
Shelton
Simon
Spence
Stahl
Stahl
Stoops
Swanson
Swanson
Swanson
Talbot
Tanganelli
Turner
Turner
Turner
Turner
Volkow
Wilens
Willoughby
Wisor
Wisor
Xiong
Yu
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Alteration of cytosolic free calcium homeostasis by SIN-1: high sensitivity of L-type Ca2+ channels to extracellular oxidative/nitrosative stress in cerebellar granule cells

Crossref

Importance of environmental context for one- and three-trial cocaine-induced behavioral sensitization in preweanling rats

Crossref