Search CORE

4 research outputs found

Linkage disequilibrium based genotype calling from low-coverage shotgun sequencing reads

Author: Dinakar Sanjiv
Duitama Jorge
Hernández Yözen
Kennedy Justin
Măndoiu Ion I
Wu Yufeng
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background Recent technology advances have enabled sequencing of individual genomes, promising to revolutionize biomedical research. However, deep sequencing remains more expensive than microarrays for performing whole-genome SNP genotyping. Results In this paper we introduce a new multi-locus statistical model and computationally efficient genotype calling algorithms that integrate shotgun sequencing data with linkage disequilibrium (LD) information extracted from reference population panels such as Hapmap or the 1000 genomes project. Experiments on publicly available 454, Illumina, and ABI SOLiD sequencing datasets suggest that integration of LD information results in genotype calling accuracy comparable to that of microarray platforms from sequencing data of low-coverage. A software package implementing our algorithm, released under the GNU General Public License, is available at http://dna.engr.uconn.edu/software/GeneSeq/. Conclusions Integration of LD information leads to significant improvements in genotype calling accuracy compared to prior LD-oblivious methods, rendering low-coverage sequencing as a viable alternative to microarrays for conducting large-scale genome-wide association studies

Lirias

City University of New York

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines

Author: Amanda Larracuente
Filipe G. Vieira
Girish Ramrattan
Levy Vargas
Lia Di
Pedro Pagan
Rocky Bernstein
Saymon Akther
Wei-Gang Qiu
William McCaig
Yözen Hernández
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Abstract Background Automated bioinformatics workflows are more robust, easier to maintain, and results more reproducible when built with command-line utilities than with custom-coded scripts. Command-line utilities further benefit by relieving bioinformatics developers to learn the use of, or to interact directly with, biological software libraries. There is however a lack of command-line utilities that leverage popular Open Source biological software toolkits such as BioPerl (http://bioperl.org) to make many of the well-designed, robust, and routinely used biological classes available for a wider base of end users. Results Designed as standard utilities for UNIX-family operating systems, BpWrapper makes functionality of some of the most popular BioPerl modules readily accessible on the command line to novice as well as to experienced bioinformatics practitioners. The initial release of BpWrapper includes four utilities with concise command-line user interfaces, bioseq, bioaln, biotree, and biopop, specialized for manipulation of molecular sequences, sequence alignments, phylogenetic trees, and DNA polymorphisms, respectively. Over a hundred methods are currently available as command-line options and new methods are easily incorporated. Performance of BpWrapper utilities lags that of precompiled utilities while equivalent to that of other utilities based on BioPerl. BpWrapper has been tested on BioPerl Release 1.6, Perl versions 5.10.1 to 5.25.10, and operating systems including Apple macOS, Microsoft Windows, and GNU/Linux. Release code is available from the Comprehensive Perl Archive Network (CPAN) at https://metacpan.org/pod/Bio::BPWrapper. Source code is available on GitHub at https://github.com/bioperl/p5-bpwrapper. Conclusions BpWrapper improves on existing sequence utilities by following the design principles of Unix text utilities such including a concise user interface, extensive command-line options, and standard input/output for serialized operations. Further, dozens of novel methods for manipulation of sequences, alignments, and phylogenetic trees, unavailable in existing utilities (e.g., EMBOSS, Newick Utilities, and FAST), are provided. Bioinformaticians should find BpWrapper useful for rapid prototyping of workflows on the command-line without creating custom scripts for comparative genomics and other bioinformatics applications

City University of New York

Crossref

Directory of Open Access Journals

Copenhagen University Research Information System

BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines

Author: Amanda Larracuente
D Mcllroy
E Afgan
Filipe G. Vieira
Girish Ramrattan
JE Stajich
JE Stajich
JT Dudley
Levy Vargas
Lia Di
MN Price
P Rice
Pedro Pagan
PJA Cock
RC Edgar
Rocky Bernstein
Saymon Akther
SR Casjens
T Junier
TJ Lawrence
Wei-Gang Qiu
William McCaig
Yözen Hernández
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref