291,046 research outputs found
Evolution of genome sequencing techniques
The quality and the speed for genome sequencing has advanced at the same time that technology boundaries are
stretched. This advancement has been divided so far in three generations. The first-generation methods enabled
sequencing of clonal DNA populations. The second-generation massively increased throughput by parallelizing many
reactions while the third-generation methods allow direct sequencing of single DNA molecules.
The first techniques to sequence DNA were not developed until the mid-1970s, when two distinct sequencing methods
were developed almost simultaneously, one by Alan Maxam and Walter Gilbert, and the other one by Frederick Sanger.
The first one is a chemical method to cleave DNA at specific points and the second one uses ddNTPs, which synthesizes
a copy from the DNA chain template. Nevertheless, both methods generate fragments of varying lengths that are further
electrophoresed.
Moreover, it is important to say that until the 1990s, the sequencing of DNA was relatively expensive and it was seen as
a long process. Besides, using radiolabeled nucleotides also compounded the problem through safety concerns and
prevented the automation. Some advancements within the first generation include the replacement of radioactive labels
by fluorescent labeled ddNTPs and cycle sequencing with thermostable DNA polymerase, which allows automation and
signal amplification, making the process cheaper, safer and faster. Another method is Pyrosequencing, which is based on
the “sequencing by synthesis” principle. It differs from Sanger sequencing, in that it relies on the detection of
pyrophosphate release on nucleotide incorporation.
By the end of the last millennia, parallelization of this method started the Next Generation Sequencing (NGS) with 454 as
the first of many methods that can process multiple samples, calling it the 2º generation sequencing. Here
electrophoresis was completely eliminated. One of the methods that is sometimes used is SOLiD, based on sequencing
by ligation of fluorescently dye-labeled di-base probes which competes to ligate to the sequencing primer. Specificity of
the di-base probe is achieved by interrogating every 1st and 2nd base in each ligation reaction. The widely used
Solexa/Illumina method uses modified dNTPs containing so called “reversible terminators” which blocks further
polymerization. The terminator also contains a fluorescent label, which can be detected by a camera.
Now, the previous step towards the third generation was in charge of Ion Torrent, who developed a technique that is
based in a method of “sequencing-by-synthesis”. Its main feature is the detection of hydrogen ions that are released
during base incorporation.
Likewise, the third generation takes into account nanotechnology advancements for the processing of unique DNA
molecules to a real time synthesis sequencing system like PacBio; and finally, the NANOPORE, projected since 1995,
also uses Nano-sensors forming channels obtained from bacteria that conducts the sample to a sensor that allows the
detection of each nucleotide residue in the DNA strand.
The advancements in terms of technology that we have nowadays have been so quick, that it makes wonder: ¿How do
we imagine the next generation
ParMap, an Algorithm for the Identification of Complex Genomic Variations in Nextgen Sequencing Data
Next-generation sequencing produces high-throughput data, albeit with greater error and shorter reads than traditional Sanger sequencing methods. This complicates the detection of genomic variations, especially, small insertions and deletions. Here we describe ParMap, a statistical algorithm for the identification of complex genetic variants using partially mapped reads in nextgen sequencing data. We also report ParMap’s successful application to the mutation analysis of chromosome X exome-captured leukemia DNA samples
Optimization of DNA extraction from human urinary samples for mycobiome community profiling.
IntroductionRecent data suggest the urinary tract hosts a microbial community of varying composition, even in the absence of infection. Culture-independent methodologies, such as next-generation sequencing of conserved ribosomal DNA sequences, provide an expansive look at these communities, identifying both common commensals and fastidious organisms. A fundamental challenge has been the isolation of DNA representative of the entire resident microbial community, including fungi.Materials and methodsWe evaluated multiple modifications of commonly-used DNA extraction procedures using standardized male and female urine samples, comparing resulting overall, fungal and bacterial DNA yields by quantitative PCR. After identifying protocol modifications that increased DNA yields (lyticase/lysozyme digestion, bead beating, boil/freeze cycles, proteinase K treatment, and carrier DNA use), all modifications were combined for systematic confirmation of optimal protocol conditions. This optimized protocol was tested against commercially available methodologies to compare overall and microbial DNA yields, community representation and diversity by next-generation sequencing (NGS).ResultsOverall and fungal-specific DNA yields from standardized urine samples demonstrated that microbial abundances differed significantly among the eight methods used. Methodologies that included multiple disruption steps, including enzymatic, mechanical, and thermal disruption and proteinase digestion, particularly in combination with small volume processing and pooling steps, provided more comprehensive representation of the range of bacterial and fungal species. Concentration of larger volume urine specimens at low speed centrifugation proved highly effective, increasing resulting DNA levels and providing greater microbial representation and diversity.ConclusionsAlterations in the methodology of urine storage, preparation, and DNA processing improve microbial community profiling using culture-independent sequencing methods. Our optimized protocol for DNA extraction from urine samples provided improved fungal community representation. Use of this technique resulted in equivalent representation of the bacterial populations as well, making this a useful technique for the concurrent evaluation of bacterial and fungal populations by NGS
Change-point model on nonhomogeneous Poisson processes with application in copy number profiling by next-generation DNA sequencing
We propose a flexible change-point model for inhomogeneous Poisson Processes,
which arise naturally from next-generation DNA sequencing, and derive score and
generalized likelihood statistics for shifts in intensity functions. We
construct a modified Bayesian information criterion (mBIC) to guide model
selection, and point-wise approximate Bayesian confidence intervals for
assessing the confidence in the segmentation. The model is applied to DNA Copy
Number profiling with sequencing data and evaluated on simulated spike-in and
real data sets.Comment: Published in at http://dx.doi.org/10.1214/11-AOAS517 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Length distribution of sequencing by synthesis: fixed flow cycle model
Sequencing by synthesis is the underlying technology for many next-generation
DNA sequencing platforms. We developed a new model, the fixed flow cycle model,
to derive the distributions of sequence length for a given number of flow
cycles under the general conditions where the nucleotide incorporation is
probabilistic and may be incomplete, as in some single-molecule sequencing
technologies. Unlike the previous model, the new model yields the probability
distribution for the sequence length. Explicit closed form formulas are derived
for the mean and variance of the distribution.Comment: 27 pages, 5 figure
A Pilot Study Comparing HPV-Positive and HPV-Negative Head and Neck Squamous Cell Carcinomas by Whole Exome Sequencing.
Background. Next-generation sequencing of cancers has identified important therapeutic targets and biomarkers. The goal of this pilot study was to compare the genetic changes in a human papillomavirus- (HPV-)positive and an HPV-negative head and neck tumor. Methods. DNA was extracted from the blood and primary tumor of a patient with an HPV-positive tonsillar cancer and those of a patient with an HPV-negative oral tongue tumor. Exome enrichment was performed using the Agilent SureSelect All Exon Kit, followed by sequencing on the ABI SOLiD platform. Results. Exome sequencing revealed slightly more mutations in the HPV-negative tumor (73) in contrast to the HPV-positive tumor (58). Multiple mutations were noted in zinc finger genes (ZNF3, 10, 229, 470, 543, 616, 664, 638, 716, and 799) and mucin genes (MUC4, 6, 12, and 16). Mutations were noted in MUC12 in both tumors. Conclusions. HPV-positive HNSCC is distinct from HPV-negative disease in terms of evidence of viral infection, p16 status, and frequency of mutations. Next-generation sequencing has the potential to identify novel therapeutic targets and biomarkers in HNSCC
Next-generation sequencing: applications beyond genomes.
The development of DNA sequencing more than 30 years ago has profoundly impacted biological research. In the last couple of years, remarkable technological innovations have emerged that allow the direct and cost-effective sequencing of complex samples at unprecedented scale and speed. These next-generation technologies make it feasible to sequence not only static genomes, but also entire transcriptomes expressed under different conditions. These and other powerful applications of next-generation sequencing are rapidly revolutionizing the way genomic studies are carried out. Below, we provide a snapshot of these exciting new approaches to understanding the properties and functions of genomes. Given that sequencing-based assays may increasingly supersede microarray-based assays, we also compare and contrast data obtained from these distinct approaches
MISSEL: a method to identify a large number of small species-specific genomic subsequences and its application to viruses classification
Continuous improvements in next generation sequencing technologies led to ever-increasing collections of genomic sequences, which have not been easily characterized by biologists, and whose analysis requires huge computational effort. The classification of species emerged as one of the main applications of DNA analysis and has been addressed with several approaches, e.g., multiple alignments-, phylogenetic trees-, statistical- and character-based methods
VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research
Accurate variant calling in next generation sequencing (NGS) is critical to understand cancer genomes better. Here we present VarDict, a novel and versatile variant caller for both DNA- and RNA-sequencing data. VarDict simultaneously calls SNV, MNV, InDels, complex and structural variants, expanding the detected genetic driver landscape of tumors. It performs local realignments on the fly for more accurate allele frequency estimation. VarDict performance scales linearly to sequencing depth, enabling ultra-deep sequencing used to explore tumor evolution or detect tumor DNA circulating in blood. In addition, VarDict performs amplicon aware variant calling for polymerase chain reaction (PCR)-based targeted sequencing often used in diagnostic settings, and is able to detect PCR artifacts. Finally, VarDict also detects differences in somatic and loss of heterozygosity variants between paired samples. VarDict reprocessing of The Cancer Genome Atlas (TCGA) Lung Adenocarcinoma dataset called known driver mutations in KRAS, EGFR, BRAF, PIK3CA and MET in 16% more patients than previously published variant calls. We believe VarDict will greatly facilitate application of NGS in clinical cancer research
- …
