Search CORE

68,937 research outputs found

Tumor diversity and evolution revealed through RADseq

Author: Altshuler
Beal
Begum
Blaxter
Bos
Broaddus
Burdett
Butte
Caccamo
Cai
Caldas
Choy
Cresko
Cresko
Cuppen
da Silveira
De
DePristo
Durbin
Felsenstein
Gatenby
Getz
Goldberg
Greaves
Hoekstra
Hohenlohe
Hynes
Iacobuzio-Donahue
Johnson
Johnson
Katzen
Kawakami
Kernytsky
Kramer
Leder
Look
Lupski
Magi
Maley
Martincorena
Mesirov
Miller
Mullins
Nowak
Nowak
Nowak
Nowell
Parmigiani
Pawlik
Polyak
Polyak
Postlethwait
Pyeritz
Quake
Raible
Scholl
Schultz
Somers
Swanton
Townsend
Vogelstein
Weinberg
Weir
Westerfield
Yaeger
Zon
Zon
Publication venue: Digital Commons@Becker
Publication date: 01/01/2017
Field of study

Crossref

Digital Commons@Becker

Special features of RAD Sequencing data:implications for genotyping

Author: Blaxter Mark L
Cezard Timothée
Davey John W
Eland Cathlene
Fuentes-Utrilla Pablo
Gharbi Karim
Publication venue: 'Wiley'
Publication date: 01/01/2012
Field of study

Restriction site-associated DNA Sequencing (RAD-Seq) is an economical and efficient method for SNP discovery and genotyping. As with other sequencing-by-synthesis methods, RAD-Seq produces stochastic count data and requires sensitive analysis to develop or genotype markers accurately. We show that there are several sources of bias specific to RAD-Seq that are not explicitly addressed by current genotyping tools, namely restriction fragment bias, restriction site heterozygosity and PCR GC content bias. We explore the performance of existing analysis tools given these biases and discuss approaches to limiting or handling biases in RAD-Seq data. While these biases need to be taken seriously, we believe RAD loci affected by them can be excluded or processed with relative ease in most cases and that most RAD loci will be accurately genotyped by existing tools

CiteSeerX

Crossref

PubMed Central

Edinburgh Research Explorer

A clone-free, single molecule map of the domestic cow (Bos taurus) genome.

Author: Bechner Michael
Goldstein Steve
Hernandez-Ortiz Juan
Medrano Juan F
Pape Louise
Patino Diego
Place Michael
Potamousis Konstantinos
Ravindran Prabu
Rincon Gonzalo
Schwartz David C
Zhou Shiguo
Publication venue: eScholarship, University of California
Publication date: 28/08/2015
Field of study

BackgroundThe cattle (Bos taurus) genome was originally selected for sequencing due to its economic importance and unique biology as a model organism for understanding other ruminants, or mammals. Currently, there are two cattle genome sequence assemblies (UMD3.1 and Btau4.6) from groups using dissimilar assembly algorithms, which were complemented by genetic and physical map resources. However, past comparisons between these assemblies revealed substantial differences. Consequently, such discordances have engendered ambiguities when using reference sequence data, impacting genomic studies in cattle and motivating construction of a new optical map resource--BtOM1.0--to guide comparisons and improvements to the current sequence builds. Accordingly, our comprehensive comparisons of BtOM1.0 against the UMD3.1 and Btau4.6 sequence builds tabulate large-to-immediate scale discordances requiring mediation.ResultsThe optical map, BtOM1.0, spanning the B. taurus genome (Hereford breed, L1 Dominette 01449) was assembled from an optical map dataset consisting of 2,973,315 (439 X; raw dataset size before assembly) single molecule optical maps (Rmaps; 1 Rmap = 1 restriction mapped DNA molecule) generated by the Optical Mapping System. The BamHI map spans 2,575.30 Mb and comprises 78 optical contigs assembled by a combination of iterative (using the reference sequence: UMD3.1) and de novo assembly techniques. BtOM1.0 is a high-resolution physical map featuring an average restriction fragment size of 8.91 Kb. Comparisons of BtOM1.0 vs. UMD3.1, or Btau4.6, revealed that Btau4.6 presented far more discordances (7,463) vs. UMD3.1 (4,754). Overall, we found that Btau4.6 presented almost double the number of discordances than UMD3.1 across most of the 6 categories of sequence vs. map discrepancies, which are: COMPLEX (misassembly), DELs (extraneous sequences), INSs (missing sequences), ITs (Inverted/Translocated sequences), ECs (extra restriction cuts) and MCs (missing restriction cuts).ConclusionAlignments of UMD3.1 and Btau4.6 to BtOM1.0 reveal discordances commensurate with previous reports, and affirm the NCBI's current designation of UMD3.1 sequence assembly as the "reference assembly" and the Btau4.6 as the "alternate assembly." The cattle genome optical map, BtOM1.0, when used as a comprehensive and largely independent guide, will greatly assist improvements to existing sequence builds, and later serve as an accurate physical scaffold for studies concerning the comparative genomics of cattle breeds

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Intramolecular integration within Moloney murine leukemia virus DNA

Author: Baltimore David
Goff Stephen P.
Hoffmann Joseph
Shoemaker Charles
Publication venue
Publication date: 01/10/1981
Field of study

By screening a library of unintegrated, circular Moloney murine leukemia virus (M-MuLV) DNA cloned in lambda phage, we found that approximately 20% of the M-MuLV DNA inserts contained internal sequence deletions or inversions. Restriction enzyme mapping demonstrated tht the deleted segments frequently abutted a long terminal repeat (LTR) sequence, whereas the inverted segments were usually flanked by LTR sequences, suggesting that many of the variants arose as a consequence of M-MuLV DNA molecules integrating within their own DNA. Nucleotide sequencing also suggested that most of the variant inserts were generated by autointegration. One of the recombinant M-MuLV DNA inserts contained a large inverted repeat of a unique M-MuLV sequence abutting an LTR. This molecule was shown by nucleotide sequencing to have arisen by an M-MuLV DNA Molecule integrating within a second M-MuLV DNA molecule before cloning. The autointegrated M-MuLV DNA had generally lost two base pairs from the LTR sequence at each junction with target site DNA, whereas a four-base-pair direct repeat of target site DNA flanked the integrated viral DNA. Nucleotide sequencing of preintegration target site DNA showed that this four-base-pair direct repeat was present only once before integration and was thus reiterated by the integration event. The results obtained from the autointegrated clones were supported by nucleotide sequencing of the host-virus junction of two cloned M-MuLV integrated proviruses obtained from infected rat cells. Detailed analysis of the different unique target site sequences revealed no obvious common features

Caltech Authors

2b-RAD genotyping for population genomic studies of Chagas disease vectors: Rhodnius ecuadoriensis in Ecuador

Author: Andersson Björn
Costales Jaime A.
De Noia Michele
Grijalva Mario J.
Hernandez Castro Luis Enrique
Hernandez-Castro Luis E.
Llewellyn Martin S.
Ocaña-Mayorga Sofía
Paterno Marta
Villacís Anita G.
Yumiseva Cesar A.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/07/2017
Field of study

Background: Rhodnius ecuadoriensis is the main triatomine vector of Chagas disease, American trypanosomiasis, in Southern Ecuador and Northern Peru. Genomic approaches and next generation sequencing technologies have become powerful tools for investigating population diversity and structure which is a key consideration for vector control. Here we assess the effectiveness of three different 2b restriction site-associated DNA (2b-RAD) genotyping strategies in R. ecuadoriensis to provide sufficient genomic resolution to tease apart microevolutionary processes and undertake some pilot population genomic analyses. Methodology/Principal findings: The 2b-RAD protocol was carried out in-house at a non-specialized laboratory using 20 R. ecuadoriensis adults collected from the central coast and southern Andean region of Ecuador, from June 2006 to July 2013. 2b-RAD sequencing data was performed on an Illumina MiSeq instrument and analyzed with the STACKS de novo pipeline for loci assembly and Single Nucleotide Polymorphism (SNP) discovery. Preliminary population genomic analyses (global AMOVA and Bayesian clustering) were implemented. Our results showed that the 2b-RAD genotyping protocol is effective for R. ecuadoriensis and likely for other triatomine species. However, only BcgI and CspCI restriction enzymes provided a number of markers suitable for population genomic analysis at the read depth we generated. Our preliminary genomic analyses detected a signal of genetic structuring across the study area. Conclusions/Significance: Our findings suggest that 2b-RAD genotyping is both a cost effective and methodologically simple approach for generating high resolution genomic data for Chagas disease vectors with the power to distinguish between different vector populations at epidemiologically relevant scales. As such, 2b-RAD represents a powerful tool in the hands of medical entomologists with limited access to specialized molecular biological equipment. Author summary: Understanding Chagas disease vector (triatomine) population dispersal is key for the design of control measures tailored for the epidemiological situation of a particular region. In Ecuador, Rhodnius ecuadoriensis is a cause of concern for Chagas disease transmission, since it is widely distributed from the central coast to southern Ecuador. Here, a genome-wide sequencing (2b-RAD) approach was performed in 20 specimens from four communities from Manabí (central coast) and Loja (southern) provinces of Ecuador, and the effectiveness of three type IIB restriction enzymes was assessed. The findings of this study show that this genotyping methodology is cost effective in R. ecuadoriensis and likely in other triatomine species. In addition, preliminary population genomic analysis results detected a signal of population structure among geographically distinct communities and genetic variability within communities. As such, 2b-RAD shows significant promise as a relatively low-tech solution for determination of vector population genomics, dynamics, and spread

ZENODO

Directory of Open Access Journals

Electronic Archiving System

Enlighten

Recommended from our members

Double-digest RADseq loci using standard Illumina indexes improve deep and shallow phylogenetic resolution of Lophodermium, a widespread fungal endophyte of pine needles.

Author: Oono Ryoko
Salas-Lizana Rodolfo
Publication venue: eScholarship, University of California
Publication date: 01/07/2018
Field of study

The phylogenetic and population genetic structure of symbiotic microorganisms may correlate with important ecological traits that can be difficult to directly measure, such as host preferences or dispersal rates. This study develops and tests a low-cost double-digest restriction site-associated DNA sequencing (ddRADseq) protocol to reveal among- and within-species genetic structure for Lophodermium, a genus of fungal endophytes whose evolutionary analyses have been limited by the scarcity of informative markers. The protocol avoids expensive barcoded adapters and incorporates universal indexes for multiplexing. We tested for reproducibility and functionality by comparing shared loci from sample replicates and assessed the effects of numbers of ambiguous sites and clustering thresholds on coverage depths, number of shared loci among samples, and phylogenetic reconstruction. Errors between technical replicates were minimal. Relaxing the quality-filtering criteria increased the mean coverage depth per locus and the number of loci recovered within a sample, but had little effect on the number of shared loci across samples. Increasing clustering threshold decreased the mean coverage depth per cluster and increased the number of loci recovered within a sample but also decreased the number of shared loci across samples, especially among distantly related species. The combination of low similarity clustering (70%) and relaxed quality-filtering (allowing up to 30 ambiguous sites per read) performed the best in phylogenetic analyses at both recent and deep genetic divergences. Hence, this method generated sufficient number of shared homologous loci to investigate the evolutionary relationships among divergent fungal lineages with small haploid genomes. The greater genetic resolution also revealed new structure within species that correlated with ecological traits, providing valuable insights into their cryptic life histories

eScholarship - University of California

Rigorous Design of Fault-Tolerant Transactions for Replicated Database Systems using Event B

Author: Butler Michael
Yadav Divakar
Publication venue: Lecture Notes in Computer Science, Springer , 2006
Publication date: 01/01/2006
Field of study

System availability is improved by the replication of data objects in a distributed database system. However, during updates, the complexity of keeping replicas identical arises due to failures of sites and race conditions among conflicting transactions. Fault tolerance and reliability are key issues to be addressed in the design and architecture of these systems. Event B is a formal technique which provides a framework for developing mathematical models of distributed systems by rigorous description of the problem, gradually introducing solutions in refinement steps, and verification of solutions by discharge of proof obligations. In this paper, we present a formal development of a distributed system using Event B that ensures atomic commitment of distributed transactions consisting of communicating transaction components at participating sites. This formal approach carries the development of the system from an initial abstract specification of transactional updates on a one copy database to a detailed design containing replicated databases in refinement. Through refinement we verify that the design of the replicated database confirms to the one copy database abstraction

Southampton (e-Prints Soton)

Characterization of Palmitoyltransferase Proteins in Arabidopsis thaliana

Author: McGinty Danielle
Publication venue: University of New Hampshire Scholars\u27 Repository
Publication date: 01/01/2018
Field of study

UNH Scholars' Repository

Models for transcript quantification from RNA-Seq

Author: Pachter Lior
Publication venue
Publication date: 12/05/2011
Field of study

RNA-Seq is rapidly becoming the standard technology for transcriptome analysis. Fundamental to many of the applications of RNA-Seq is the quantification problem, which is the accurate measurement of relative transcript abundances from the sequenced reads. We focus on this problem, and review many recently published models that are used to estimate the relative abundances. In addition to describing the models and the different approaches to inference, we also explain how methods are related to each other. A key result is that we show how inference with many of the models results in identical estimates of relative abundances, even though model formulations can be very different. In fact, we are able to show how a single general model captures many of the elements of previously published methods. We also review the applications of RNA-Seq models to differential analysis, and explain why accurate relative transcript abundance estimates are crucial for downstream analyses

arXiv.org e-Print Archive

CiteSeerX