21 research outputs found

    Optimization Techniques For Next-Generation Sequencing Data Analysis

    Get PDF
    High-throughput RNA sequencing (RNA-Seq) is a popular cost-efficient technology with many medical and biological applications. This technology, however, presents a number of computational challenges in reconstructing full-length transcripts and accurately estimate their abundances across all cell types. Our contributions include (1) transcript and gene expression level estimation methods, (2) methods for genome-guided and annotation-guided transcriptome reconstruction, and (3) de novo assembly and annotation of real data sets. Transcript expression level estimation, also referred to as transcriptome quantification, tackle the problem of estimating the expression level of each transcript. Transcriptome quantification analysis is crucial to determine similar transcripts or unraveling gene functions and transcription regulation mechanisms. We propose a novel simulated regression based method for transcriptome frequency estimation from RNA-Seq reads. Transcriptome reconstruction refers to the problem of reconstructing the transcript sequences from the RNA-Seq data. We present genome-guided and annotation-guided transcriptome reconstruction methods. Empirical results on both synthetic and real RNA-seq datasets show that the proposed methods improve transcriptome quantification and reconstruction accuracy compared to currently state of the art methods. We further present the assembly and annotation of Bugula neritina transcriptome (a marine colonial animal), and Tallapoosa darter genome (a species-rich radiation freshwater fish)

    TRIP: A method for novel transcript reconstruction from paired-end RNA-seq reads

    Get PDF
    Preliminary experimental results on synthetic datasets generated with various sequencing parameters and distribution assumptions show that TRIP has increased transcriptome reconstruction accuracy compared to previous methods that ignore fragment length distribution information

    Une approche parallèle et distribuée pour la complétion d'automates d'arbre

    Get PDF
    National audienceLa technique dite de complétion sur les automates d'arbre permet, à partir d'un automate d'arbre et d'un système de réécriture, de calculer une sur-approximation de l'ensemble des termes accessibles. Cette technique a été utilisée avec succès pour la vérification de protocoles de sécurité et, plus récemment, pour l'analyse des programmes Java. Comme dans toute approche de vérification par model-cheking, nous sommes confrontés à des problèmes d'explosion combinatoire lorsque les exemples deviennent plus conséquents. Dans cet article nous montrons comment faire face à cette situation en parallélisant et distribuant les calculs. Nous présentons aussi quelques résultats expérimentaux qui montrent que l'approche permet d'obtenir de meilleures performances en temps d'exécution

    Antibodies to Enteroviruses in Cerebrospinal Fluid of Patients with Acute Flaccid Myelitis.

    Get PDF
    Acute flaccid myelitis (AFM) has caused motor paralysis in >560 children in the United States since 2014. The temporal association of enterovirus (EV) outbreaks with increases in AFM cases and reports of fever, respiratory, or gastrointestinal illness prior to AFM in >90% of cases suggest a role for infectious agents. Cerebrospinal fluid (CSF) from 14 AFM and 5 non-AFM patients with central nervous system (CNS) diseases in 2018 were investigated by viral-capture high-throughput sequencing (VirCapSeq-VERT system). These CSF and serum samples, as well as multiple controls, were tested for antibodies to human EVs using peptide microarrays. EV RNA was confirmed in CSF from only 1 adult AFM case and 1 non-AFM case. In contrast, antibodies to EV peptides were present in CSF of 11 of 14 AFM patients (79%), significantly higher than controls, including non-AFM patients (1/5 [20%]), children with Kawasaki disease (0/10), and adults with non-AFM CNS diseases (2/11 [18%]) (P = 0.023, 0.0001, and 0.0028, respectively). Six of 14 CSF samples (43%) and 8 of 11 sera (73%) from AFM patients were immunoreactive to an EV-D68-specific peptide, whereas the three control groups were not immunoreactive in either CSF (0/5, 0/10, and 0/11; P = 0.008, 0.0003, and 0.035, respectively) or sera (0/2, 0/8, and 0/5; P = 0.139, 0.002, and 0.009, respectively).IMPORTANCE The presence in cerebrospinal fluid of antibodies to EV peptides at higher levels than non-AFM controls supports the plausibility of a link between EV infection and AFM that warrants further investigation and has the potential to lead to strategies for diagnosis and prevention of disease

    Exclusion and Extraction: Rural Resistance to Gold Mining in Northeast Thailand

    Get PDF
    BackgroundExisting tools for the aggregation and visualization of differential expression data have discrete functionality and require that end-users rely on multiple software packages with complex dependencies or manually manipulate data for analysis and interpretation. Furthermore, at present, data aggregation and visualization are laborious, time consuming, and subject to human error. This is a serious limitation on the current state of differential transcriptomic analysis, which makes it necessary to expend extensive time and resources to reach the point where biological meaning can be interpreted. Such an approach for analysis also leads to scattered and non-standardized code, unsystematic project management and non-reproducible result sets.ResultsHere, we present a differential expression analysis toolkit, DEvis, that provides a powerful, integrated solution for the analysis of differential expression data with a rapid turnaround time. DEvis has simple installation requirements and provides a convenient, user-friendly R package that addresses the issues inherent to complex multi-factor experiments, such as multiple contrast aggregation and integration, result sorting and selection, visualization, project management, and reproducibility. This tool increases the capabilities of differential expression analysis while reducing workload and the potential for manual error. Furthermore, it provides a much-needed encapsulation of scattered functionality, making large and complex analysis more efficient and reproducible.ConclusionDEvis provides a wide range of powerful visualization, data aggregation, and project management tools that provide flexibility and speed in analysis. The functionality provided by DEVis increases efficiency of analysis and supplies researchers with new and relevant means for the analysis of large and complicated transcriptomic experiments. DEvis furthermore incorporates automatic project management capabilities, which standardizes analysis and ensures the reproducibility of results. After the establishment of statistical frameworks that identify differentially expressed genes, this package is the next logical step for differential transcriptomic analysis, establishing the critical framework necessary to manipulate, explore, and extract biologically relevant meaning from differential expression data
    corecore