719 research outputs found

    FusionFinder: A Software Tool to Identify Expressed Gene Fusion Candidates from RNA-Seq Data

    Get PDF
    The hallmarks of many haematological malignancies and solid tumours are chromosomal translocations, which may lead to gene fusions. Recently, next-generation sequencing techniques at the transcriptome level (RNA-Seq) have been used to verify known and discover novel transcribed gene fusions. We present FusionFinder, a Perl-based software designed to automate the discovery of candidate gene fusion partners from single-end (SE) or paired-end (PE) RNA-Seq read data. FusionFinder was applied to data from a previously published analysis of the K562 chronic myeloid leukaemia (CML) cell line. Using FusionFinder we successfully replicated the findings of this study and detected additional previously unreported fusion genes in their dataset, which were confirmed experimentally. These included two isoforms of a fusion involving the genes BRK1 and VHL, whose co-deletion has previously been associated with the prevalence and severity of renal-cell carcinoma. FusionFinder is made freely available for non-commercial use and can be downloaded from the project website (http://bioinformatics.childhealthresearch.org.au/software/fusionfinder/)

    High-throughput sequence alignment using Graphics Processing Units

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The recent availability of new, less expensive high-throughput DNA sequencing technologies has yielded a dramatic increase in the volume of sequence data that must be analyzed. These data are being generated for several purposes, including genotyping, genome resequencing, metagenomics, and <it>de novo </it>genome assembly projects. Sequence alignment programs such as MUMmer have proven essential for analysis of these data, but researchers will need ever faster, high-throughput alignment tools running on inexpensive hardware to keep up with new sequence technologies.</p> <p>Results</p> <p>This paper describes MUMmerGPU, an open-source high-throughput parallel pairwise local sequence alignment program that runs on commodity Graphics Processing Units (GPUs) in common workstations. MUMmerGPU uses the new Compute Unified Device Architecture (CUDA) from nVidia to align multiple query sequences against a single reference sequence stored as a suffix tree. By processing the queries in parallel on the highly parallel graphics card, MUMmerGPU achieves more than a 10-fold speedup over a serial CPU version of the sequence alignment kernel, and outperforms the exact alignment component of MUMmer on a high end CPU by 3.5-fold in total application time when aligning reads from recent sequencing projects using Solexa/Illumina, 454, and Sanger sequencing technologies.</p> <p>Conclusion</p> <p>MUMmerGPU is a low cost, ultra-fast sequence alignment program designed to handle the increasing volume of data produced by new, high-throughput sequencing technologies. MUMmerGPU demonstrates that even memory-intensive applications can run significantly faster on the relatively low-cost GPU than on the CPU.</p

    Gene co-regulation by Fezf2 selects neurotransmitter identity and connectivity of corticospinal neurons

    Get PDF
    The neocortex contains an unparalleled diversity of neuronal subtypes, each defined by distinct traits that are developmentally acquired under the control of subtype-specific and pan-neuronal genes. The regulatory logic that orchestrates the expression of these unique combinations of genes is unknown for any class of cortical neuron. Here, we report that Fezf2 is a selector gene able to regulate the expression of gene sets that collectively define mouse corticospinal motor neurons (CSMN). We find that Fezf2 directly induces the glutamatergic identity of CSMN via activation of Vglut1 (Slc17a7) and inhibits a GABAergic fate by repressing transcription of Gad1. In addition, we identify the axon guidance receptor EphB1 as a target of Fezf2 necessary to execute the ipsilateral extension of the corticospinal tract. Our data indicate that co-regulated expression of neuron subtype–specific and pan-neuronal gene batteries by a single transcription factor is one component of the regulatory logic responsible for the establishment of CSMN identity

    Comparative analysis of RNA sequencing methods for degraded or low-input samples

    Get PDF
    available in PMC 2014 January 01RNA-seq is an effective method for studying the transcriptome, but it can be difficult to apply to scarce or degraded RNA from fixed clinical samples, rare cell populations or cadavers. Recent studies have proposed several methods for RNA-seq of low-quality and/or low-quantity samples, but the relative merits of these methods have not been systematically analyzed. Here we compare five such methods using metrics relevant to transcriptome annotation, transcript discovery and gene expression. Using a single human RNA sample, we constructed and sequenced ten libraries with these methods and compared them against two control libraries. We found that the RNase H method performed best for chemically fragmented, low-quality RNA, and we confirmed this through analysis of actual degraded samples. RNase H can even effectively replace oligo(dT)-based methods for standard RNA-seq. SMART and NuGEN had distinct strengths for measuring low-quantity RNA. Our analysis allows biologists to select the most suitable methods and provides a benchmark for future method development.National Institutes of Health (U.S.) (Pioneer Award DP1-OD003958-01)National Human Genome Research Institute (U.S.) (NHGRI) 1P01HG005062-01)National Human Genome Research Institute (U.S.) (NHGRI Center of Excellence in Genome Science Award 1P50HG006193-01)Howard Hughes Medical Institute (Investigator)Merkin Family Foundation for Stem Cell ResearchBroad Institute of MIT and Harvard (Klarman Cell Observatory)National Human Genome Research Institute (U.S.) (NHGRI grant HG03067)Fonds voor Wetenschappelijk Onderzoek--Vlaandere

    Transcriptional and Linkage Analyses Identify Loci that Mediate the Differential Macrophage Response to Inflammatory Stimuli and Infection

    Get PDF
    Macrophages display flexible activation states that range between pro-inflammatory (classical activation) and anti-inflammatory (alternative activation). These macrophage polarization states contribute to a variety of organismal phenotypes such as tissue remodeling and susceptibility to infectious and inflammatory diseases. Several macrophage- or immune-related genes have been shown to modulate infectious and inflammatory disease pathogenesis. However, the potential role that differences in macrophage activation phenotypes play in modulating differences in susceptibility to infectious and inflammatory disease is just emerging. We integrated transcriptional profiling and linkage analyses to determine the genetic basis for the differential murine macrophage response to inflammatory stimuli and to infection with the obligate intracellular parasite Toxoplasma gondii. We show that specific transcriptional programs, defined by distinct genomic loci, modulate macrophage activation phenotypes. In addition, we show that the difference between AJ and C57BL/6J macrophages in controlling Toxoplasma growth after stimulation with interferon gamma and tumor necrosis factor alpha mapped to chromosome 3, proximal to the Guanylate binding protein (Gbp) locus that is known to modulate the murine macrophage response to Toxoplasma. Using an shRNA-knockdown strategy, we show that the transcript levels of an RNA helicase, Ddx1, regulates strain differences in the amount of nitric oxide produced by macrophage after stimulation with interferon gamma and tumor necrosis factor. Our results provide a template for discovering candidate genes that modulate macrophage-mediated complex traits

    Transcriptomic landscape of breast cancers through mRNA sequencing

    Get PDF
    Breast cancer is a heterogeneous disease with a poorly defined genetic landscape, which poses a major challenge in diagnosis and treatment. By massively parallel mRNA sequencing, we obtained 1.2 billion reads from 17 individual human tissues belonging to TNBC, Non-TNBC, and HER2-positive breast cancers and defined their comprehensive digital transcriptome for the first time. Surprisingly, we identified a high number of novel and unannotated transcripts, revealing the global breast cancer transcriptomic adaptations. Comparative transcriptomic analyses elucidated differentially expressed transcripts between the three breast cancer groups, identifying several new modulators of breast cancer. Our study also identified common transcriptional regulatory elements, such as highly abundant primary transcripts, including osteonectin, RACK1, calnexin, calreticulin, FTL, and B2M, and “genomic hotspots” enriched in primary transcripts between the three groups. Thus, our study opens previously unexplored niches that could enable a better understanding of the disease and the development of potential intervention strategies

    Chd8 mediates cortical neurogenesis via transcriptional regulation of cell cycle and Wnt signaling

    Get PDF
    De novo mutations in CHD8 are strongly associated with autism spectrum disorder, but the basic biology of CHD8 remains poorly understood. Here we report that Chd8 knockdown during cortical development results in defective neural progenitor proliferation and differentiation that ultimately manifests in abnormal neuronal morphology and behaviors in adult mice. Transcriptome analysis revealed that while Chd8 stimulates the transcription of cell cycle genes, it also precludes the induction of neural-specific genes by regulating the expression of PRC2 complex components. Furthermore, knockdown of Chd8 disrupts the expression of key transducers of Wnt signaling, and enhancing Wnt signaling rescues the transcriptional and behavioral deficits caused by Chd8 knockdown. We propose that these roles of Chd8 and the dynamics of Chd8 expression during development help negotiate the fine balance between neural progenitor proliferation and differentiation. Together, these observations provide new insights into the neurodevelopmental role of Chd8.National Institutes of Health (U.S.) (Grant UH1-MH106018-03

    BRIE: transcriptome-wide splicing quantication in single cells

    Get PDF
    Abstract Single-cell RNA-seq (scRNA-seq) provides a comprehensive measurement of stochasticity in transcription, but the limitations of the technology have prevented its application to dissect variability in RNA processing events such as splicing. Here, we present BRIE (Bayesian regression for isoform estimation), a Bayesian hierarchical model that resolves these problems by learning an informative prior distribution from sequence features. We show that BRIE yields reproducible estimates of exon inclusion ratios in single cells and provides an effective tool for differential isoform quantification between scRNA-seq data sets. BRIE, therefore, expands the scope of scRNA-seq experiments to probe the stochasticity of RNA processing
    corecore