33 research outputs found

    Accurate long read mapping using enhanced suffix arrays

    Get PDF
    With the rise of high throughput sequencing, new programs have been developed for dealing with the alignment of a huge amount of short read data to reference genomes. Recent developments in sequencing technology allow longer reads, but the mappers for short reads are not suited for reads of several hundreds of base pairs. We propose an algorithm for mapping longer reads, which is based on chaining maximal exact matches and uses heuristics and the Needleman-Wunsch algorithm to bridge the gaps. To compute maximal exact matches we use a specialized index structure, called enhanced suffix array. The proposed algorithm is very accurate and can handle large reads with mutations and long insertions and deletions

    Illumina sequencing of 15 deafness genes using fragmented amplicons

    Get PDF
    BACKGROUND: Resequencing of deafness related genes using GS FLX massive parallel sequencing of PCR amplicons spanning selected genes has previously been reported as a successful strategy to discover causal variants. The amplicon lengths were designed to be smaller than the sequencing read length of GS FLX technology, but are longer than Illumina sequencing technology read lengths. Fragmentation is thus required to sequence these amplicons using high throughput Illumina technology. METHODS: We performed Illumina sequencing in 4 patients on 563 multiplexed amplicons covering the exons of 15 genes involved in the hearing process. After exploring several fragmentation strategies, the amplicons were fragmented using Covaris sonication prior to library preparation. CLC genomic workbench was used to analyze the data. RESULTS: We achieve an excellent coverage with more than 99% of the amplicons bases covered. All variants that were previously validated using Sanger sequencing, were also called in this study. Variant calling revealed less false positive and false negative results compared to the previous study. For each patient, several variants were found that are reported by ClinVar as possible hearing loss variants. CONCLUSION: Migration from GS FLX amplicon sequencing to Illumina amplicon sequencing is straightforward and leads to more accurate results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1756-0500-7-509) contains supplementary material, which is available to authorized users

    GLUCAN SYNTHASE-LIKE8 and STEROL METHYLTRANSFERASE2 are required for ploidy consistency of the sexual reproduction system in Arabidopsis

    Get PDF
    In sexually reproducing plants, the meiocyte-producing archesporal cell lineage is maintained at the diploid state to consolidate the formation of haploid gametes. In search of molecular factors that regulate this ploidy consistency, we isolated an Arabidopsis thaliana mutant, called enlarged tetrad2 (et2), which produces tetraploid meiocytes through the stochastic occurrence of premeiotic endomitosis. Endomitotic polyploidization events were induced by alterations in cell wall formation, and similar cytokinetic defects were sporadically observed in other tissues, including cotyledons and leaves. ET2 encodes GLUCAN SYNTHASE-LIKE8 (GSL8), a callose synthase that mediates the deposition of callose at developing cell plates, root hairs, and plasmodesmata. Unlike other gsl8 mutants, in which defects in cell plate formation are seedling lethal, cytokinetic defects in et2 predominantly occur in flowers and have little effect on vegetative growth and development. Similarly, mutations in STEROL METHYLTRANSFERASE2 (SMT2), a major sterol biosynthesis enzyme, also lead to weak cytokinetic defects, primarily in the flowers. In addition, SMT2 allelic mutants also generate tetraploid meiocytes through the ectopic induction of premeiotic endomitosis. These observations demonstrate that appropriate callose and sterol biosynthesis are required for maintaining the ploidy level of the premeiotic germ lineage and that subtle defects in cytokinesis may lead to diploid gametes and polyploid offspring

    Molecular diagnostics for congenital hearing loss including 15 deafness genes using a next generation sequencing platform

    Get PDF
    Background: Hereditary hearing loss (HL) can originate from mutations in one of many genes involved in the complex process of hearing. Identification of the genetic defects in patients is currently labor intensive and expensive. While screening with Sanger sequencing for GJB2 mutations is common, this is not the case for the other known deafness genes (> 60). Next generation sequencing technology (NGS) has the potential to be much more cost efficient. Published methods mainly use hybridization based target enrichment procedures that are time saving and efficient, but lead to loss in sensitivity. In this study we used a semi-automated PCR amplification and NGS in order to combine high sensitivity, speed and cost efficiency. Results: In this proof of concept study, we screened 15 autosomal recessive deafness genes in 5 patients with congenital genetic deafness. 646 specific primer pairs for all exons and most of the UTR of the 15 selected genes were designed using primerXL. Using patient specific identifiers, all amplicons were pooled and analyzed using the Roche 454 NGS technology. Three of these patients are members of families in which a region of interest has previously been characterized by linkage studies. In these, we were able to identify two new mutations in CDH23 and OTOF. For another patient, the etiology of deafness was unclear, and no causal mutation was found. In a fifth patient, included as a positive control, we could confirm a known mutation in TMC1. Conclusions: We have developed an assay that holds great promise as a tool for screening patients with familial autosomal recessive nonsyndromal hearing loss (ARNSHL). For the first time, an efficient, reliable and cost effective genetic test, based on PCR enrichment, for newborns with undiagnosed deafness is available

    Practical Tools to Implement Massive Parallel Pyrosequencing of PCR Products in Next Generation Molecular Diagnostics

    Get PDF
    Despite improvements in terms of sequence quality and price per basepair, Sanger sequencing remains restricted to screening of individual disease genes. The development of massively parallel sequencing (MPS) technologies heralded an era in which molecular diagnostics for multigenic disorders becomes reality. Here, we outline different PCR amplification based strategies for the screening of a multitude of genes in a patient cohort. We performed a thorough evaluation in terms of set-up, coverage and sequencing variants on the data of 10 GS-FLX experiments (over 200 patients). Crucially, we determined the actual coverage that is required for reliable diagnostic results using MPS, and provide a tool to calculate the number of patients that can be screened in a single run. Finally, we provide an overview of factors contributing to false negative or false positive mutation calls and suggest ways to maximize sensitivity and specificity, both important in a routine setting. By describing practical strategies for screening of multigenic disorders in a multitude of samples and providing answers to questions about minimum required coverage, the number of patients that can be screened in a single run and the factors that may affect sensitivity and specificity we hope to facilitate the implementation of MPS technology in molecular diagnostics

    Analysing 454 amplicon resequencing experiments using the modular and database oriented Variant Identification Pipeline

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Next-generation amplicon sequencing enables high-throughput genetic diagnostics, sequencing multiple genes in several patients together in one sequencing run. Currently, no open-source out-of-the-box software solution exists that reliably reports detected genetic variations and that can be used to improve future sequencing effectiveness by analyzing the PCR reactions.</p> <p>Results</p> <p>We developed an integrated database oriented software pipeline for analysis of 454/Roche GS-FLX amplicon resequencing experiments using Perl and a relational database. The pipeline enables variation detection, variation detection validation, and advanced data analysis, which provides information that can be used to optimize PCR efficiency using traditional means. The modular approach enables customization of the pipeline where needed and allows researchers to adopt their analysis pipeline to their experiments. Clear documentation and training data is available to test and validate the pipeline prior to using it on real sequencing data.</p> <p>Conclusions</p> <p>We designed an open-source database oriented pipeline that enables advanced analysis of 454/Roche GS-FLX amplicon resequencing experiments using SQL-statements. This modular database approach allows easy coupling with other pipeline modules such as variant interpretation or a LIMS system. There is also a set of standard reporting scripts available.</p

    Identification of new molecular species using second-generation sequencing

    No full text
    corecore