39 research outputs found

    Genomic Profiling of Collaborative Cross Founder Mice Infected with Respiratory Viruses Reveals Novel Transcripts and Infection-Related Strain-Specific Gene and Isoform Expression

    Get PDF
    Genetic variation between diverse mouse species is well-characterized, yet existing knowledge of the mouse transcriptome comes largely from one mouse strain (C57BL/6J). As such, it is unlikely to reflect the transcriptional complexity of the mouse species. Gene transcription is dynamic and condition-specific; therefore, to better understand the mouse transcriptional response to respiratory virus infection, we infected the eight founder strains of the Collaborative Cross with either influenza A virus or severe acute respiratory syndrome coronavirus and sequenced lung RNA samples at 2 and 4 days after infection. We found numerous instances of transcripts that were not present in the C57BL/6J reference annotation, indicating that a nontrivial proportion of the mouse genome is transcribed but poorly annotated. Of these novel transcripts, 2150 could be aligned to human or rat genomes, but not to existing mouse genomes, suggesting functionally conserved sequences not yet recorded in mouse genomes. We also found that respiratory virus infection induced differential expression of 4287 splicing junctions, resulting in strain-specific isoform expression. Of these, 59 were influenced by strain-specific mutations within 2 base pairs of key intron–exon boundaries, suggesting cis-regulated expression. Our results reveal the complexity of the transcriptional response to viral infection, previously undocumented genomic elements, and extensive diversity in the response across mouse strains. These findings identify hitherto unexplored transcriptional patterns and undocumented transcripts in genetically diverse mice. Host genetic variation drives the complexity and diversity of the host response by eliciting starkly different transcriptional profiles in response to a viral infection

    Annotation of long non-coding RNAs expressed in Collaborative Cross founder mice in response to respiratory virus infection reveals a new class of interferon-stimulated transcripts

    Get PDF
    The outcome of respiratory virus infection is determined by a complex interplay of viral and host factors. Some potentially important host factors for the antiviral response, whose functions remain largely unexplored, are long non-coding RNAs (lncRNAs). Here we systematically inferred the regulatory functions of host lncRNAs in response to influenza A virus and severe acute respiratory syndrome coronavirus (SARS-CoV) based on their similarity in expression with genes of known function. We performed total RNA-Seq on viral-infected lungs from eight mouse strains, yielding a large data set of transcriptional responses. Overall 5,329 lncRNAs were differentially expressed after infection. Most of the lncRNAs were co-expressed with coding genes in modules enriched in genes associated with lung homeostasis pathways or immune response processes. Each lncRNA was further individually annotated using a rank-based method, enabling us to associate 5,295 lncRNAs to at least one gene set and to predict their potential cis effects. We validated the lncRNAs predicted to be interferon-stimulated by profiling mouse responses after interferon-α treatment. Altogether, these results provide a broad categorization of potential lncRNA functions and identify subsets of lncRNAs with likely key roles in respiratory virus pathogenesis. These data are fully accessible through the MOuse NOn-Code Lung interactive database (MONOCLdb)

    The distribution of inverted repeat sequences in the Saccharomyces cerevisiae genome

    Get PDF
    Although a variety of possible functions have been proposed for inverted repeat sequences (IRs), it is not known which of them might occur in vivo. We investigate this question by assessing the distributions and properties of IRs in the Saccharomyces cerevisiae (SC) genome. Using the IRFinder algorithm we detect 100,514 IRs having copy length greater than 6 bp and spacer length less than 77 bp. To assess statistical significance we also determine the IR distributions in two types of randomization of the S. cerevisiae genome. We find that the S. cerevisiae genome is significantly enriched in IRs relative to random. The S. cerevisiae IRs are significantly longer and contain fewer imperfections than those from the randomized genomes, suggesting that processes to lengthen and/or correct errors in IRs may be operative in vivo. The S. cerevisiae IRs are highly clustered in intergenic regions, while their occurrence in coding sequences is consistent with random. Clustering is stronger in the 3′ flanks of genes than in their 5′ flanks. However, the S. cerevisiae genome is not enriched in those IRs that would extrude cruciforms, suggesting that this is not a common event. Various explanations for these results are considered

    Global analysis of estrogen receptor beta binding to breast cancer cell genome reveals an extensive interplay with estrogen receptor alpha for target gene regulation

    Get PDF
    Background: Estrogen receptors alpha (ERa) and beta (ERb) are transcription factors (TFs) that mediate estrogen signaling and define the hormone-responsive phenotype of breast cancer (BC). The two receptors can be found co-expressed and play specific, often opposite, roles, with ERb being able to modulate the effects of ERa on gene transcription and cell proliferation. ERb is frequently lost in BC, where its presence generally correlates with a better prognosis of the disease. The identification of the genomic targets of ERb in hormone-responsive BC cells is thus a critical step to elucidate the roles of this receptor in estrogen signaling and tumor cell biology. Results: Expression of full-length ERb in hormone-responsive, ERa-positive MCF-7 cells resulted in a marked reduction in cell proliferation in response to estrogen and marked effects on the cell transcriptome. By ChIP-Seq we identified 9702 ERb and 6024 ERa binding sites in estrogen-stimulated cells, comprising sites occupied by either ERb, ERa or both ER subtypes. A search for TF binding matrices revealed that the majority of the binding sites identified comprise one or more Estrogen Response Element and the remaining show binding matrixes for other TFs known to mediate ER interaction with chromatin by tethering, including AP2, E2F and SP1. Of 921 genes differentially regulated by estrogen in ERb+ vs ERb- cells, 424 showed one or more ERb site within 10 kb. These putative primary ERb target genes control cell proliferation, death, differentiation, motility and adhesion, signal transduction and transcription, key cellular processes that might explain the biological and clinical phenotype of tumors expressing this ER subtype. ERb binding in close proximity of several miRNA genes and in the mitochondrial genome, suggests the possible involvement of this receptor in small non-coding RNA biogenesis and mitochondrial genome functions. Conclusions: Results indicate that the vast majority of the genomic targets of ERb can bind also ERa, suggesting that the overall action of ERb on the genome of hormone-responsive BC cells depends mainly on the relative concentration of both ERs in the cell

    Base-Pair Resolution DNA Methylation Sequencing Reveals Profoundly Divergent Epigenetic Landscapes in Acute Myeloid Leukemia

    Get PDF
    We have developed an enhanced form of reduced representation bisulfite sequencing with extended genomic coverage, which resulted in greater capture of DNA methylation information of regions lying outside of traditional CpG islands. Applying this method to primary human bone marrow specimens from patients with Acute Myelogeneous Leukemia (AML), we demonstrated that genetically distinct AML subtypes display diametrically opposed DNA methylation patterns. As compared to normal controls, we observed widespread hypermethylation in IDH mutant AMLs, preferentially targeting promoter regions and CpG islands neighboring the transcription start sites of genes. In contrast, AMLs harboring translocations affecting the MLL gene displayed extensive loss of methylation of an almost mutually exclusive set of CpGs, which instead affected introns and distal intergenic CpG islands and shores. When analyzed in conjunction with gene expression profiles, it became apparent that these specific patterns of DNA methylation result in differing roles in gene expression regulation. However, despite this subtype-specific DNA methylation patterning, a much smaller set of CpG sites are consistently affected in both AML subtypes. Most CpG sites in this common core of aberrantly methylated CpGs were hypermethylated in both AML subtypes. Therefore, aberrant DNA methylation patterns in AML do not occur in a stereotypical manner but rather are highly specific and associated with specific driving genetic lesions

    Enrichment post-library preparation enhances the sensitivity of high-throughput sequencing-based detection and characterization of viruses from complex samples

    No full text
    Abstract Background Sequencing-based detection and characterization of viruses in complex samples can suffer from lack of sensitivity due to a variety of factors including, but not limited to, low titer, small genome size, and contribution of host or environmental nucleic acids. Hybridization-based target enrichment is one potential method for increasing the sensitivity of viral detection via high-throughput sequencing. Results This study expands upon two previously developed panels of virus enrichment probes (for filoviruses and for respiratory viruses) to include other viruses of biodefense and/or biosurveillance concern to the U.S. Department of Defense and various international public health agencies. The newly expanded and combined panel is tested using carefully constructed synthetic metagenomic samples that contain clinically relevant amounts of viral genetic material. Target enrichment results in a dramatic increase in sensitivity for virus detection as compared to shotgun sequencing, yielding full, deeply covered viral genomes from materials with Ct values suggesting that amplicon sequencing would be likely to fail. Increased pooling to improve cost- and time-effectiveness does not negatively affect the ability to obtain full-length viral genomes, even in the case of co-infections, although as expected, it does decrease depth of coverage. Conclusions Hybridization-based target enrichment is an effective solution to obtain full-length viral genomes for samples from which virus detection would fail via unbiased, shotgun sequencing or even via amplicon sequencing. As the development and testing of probe sets for viral target enrichment expands and continues, the application of this technique, in conjunction with deeper pooling strategies, could make high-throughput sequencing more economical for routine use in biosurveillance, biodefense and outbreak investigations

    Bead-linked transposomes enable a normalization-free workflow for NGS library preparation

    No full text
    Abstract Background Transposome-based technologies have enabled the streamlined production of sequencer-ready DNA libraries; however, current methods are highly sensitive to the amount and quality of input nucleic acid. Results We describe a new library preparation technology (Nextera DNA Flex) that utilizes a known concentration of transposomes conjugated directly to beads to bind a fixed amount of DNA, and enables direct input of blood and saliva using an integrated extraction protocol. We further report results from libraries generated outside the standard parameters of the workflow, highlighting novel applications for Nextera DNA Flex, including human genome builds and variant calling from below 1 ng DNA input, customization of insert size, and preparation of libraries from short fragments and severely degraded FFPE samples. Using this bead-linked library preparation method, library yield saturation was observed at an input amount of 100 ng. Preparation of libraries from a range of species with varying GC levels demonstrated uniform coverage of small genomes. For large and complex genomes, coverage across the genome, including difficult regions, was improved compared with other library preparation methods. Libraries were successfully generated from amplicons of varying sizes (from 50 bp to 11 kb), however, a decrease in efficiency was observed for amplicons smaller than 250 bp. This library preparation method was also compatible with poor-quality DNA samples, with sequenceable libraries prepared from formalin-fixed paraffin-embedded samples with varying levels of degradation. Conclusions In contrast to solution-based library preparation, this bead-based technology produces a normalized, sequencing-ready library for a wide range of DNA input types and amounts, largely obviating the need for DNA quantitation. The robustness of this bead-based library preparation kit and flexibility of input DNA facilitates application across a wide range of fields

    mRNA-Seq of Single Prostate Cancer Circulating Tumor Cells Reveals Recapitulation of Gene Expression and Pathways Found in Prostate Cancer

    No full text
    <div><p>Circulating tumor cells (CTC) mediate metastatic spread of many solid tumors and enumeration of CTCs is currently used as a prognostic indicator of survival in metastatic prostate cancer patients. Some evidence suggests that it is possible to derive additional information about tumors from expression analysis of CTCs, but the technical difficulty of isolating and analyzing individual CTCs has limited progress in this area. To assess the ability of a new generation of MagSweeper to isolate intact CTCs for downstream analysis, we performed mRNA-Seq on single CTCs isolated from the blood of patients with metastatic prostate cancer and on single prostate cancer cell line LNCaP cells spiked into the blood of healthy donors. We found that the MagSweeper effectively isolated CTCs with a capture efficiency that matched the CellSearch platform. However, unlike CellSearch, the MagSweeper facilitates isolation of individual live CTCs without contaminating leukocytes. Importantly, mRNA-Seq analysis showed that the MagSweeper isolation process did not have a discernible impact on the transcriptional profile of single LNCaPs isolated from spiked human blood, suggesting that any perturbations caused by the MagSweeper process on the transcriptional signature of isolated cells are modest. Although the RNA from patient CTCs showed signs of significant degradation, consistent with reports of short half-lives and apoptosis amongst CTCs, transcriptional signatures of prostate tissue and of cancer were readily detectable with single CTC mRNA-Seq. These results demonstrate that the MagSweeper provides access to intact CTCs and that these CTCs can potentially supply clinically relevant information.</p> </div

    Genomic Profiling of Collaborative Cross Founder Mice Infected with Respiratory Viruses Reveals Novel Transcripts and Infection-Related Strain-Specific Gene and Isoform Expression

    No full text
    Genetic variation between diverse mouse species is well-characterized, yet existing knowledge of the mouse transcriptome comes largely from one mouse strain (C57BL/6J). As such, it is unlikely to reflect the transcriptional complexity of the mouse species. Gene transcription is dynamic and condition-specific; therefore, to better understand the mouse transcriptional response to respiratory virus infection, we infected the eight founder strains of the Collaborative Cross with either influenza A virus or severe acute respiratory syndrome coronavirus and sequenced lung RNA samples at 2 and 4 days after infection. We found numerous instances of transcripts that were not present in the C57BL/6J reference annotation, indicating that a nontrivial proportion of the mouse genome is transcribed but poorly annotated. Of these novel transcripts, 2150 could be aligned to human or rat genomes, but not to existing mouse genomes, suggesting functionally conserved sequences not yet recorded in mouse genomes. We also found that respiratory virus infection induced differential expression of 4287 splicing junctions, resulting in strain-specific isoform expression. Of these, 59 were influenced by strain-specific mutations within 2 base pairs of key intron–exon boundaries, suggesting cis-regulated expression. Our results reveal the complexity of the transcriptional response to viral infection, previously undocumented genomic elements, and extensive diversity in the response across mouse strains. These findings identify hitherto unexplored transcriptional patterns and undocumented transcripts in genetically diverse mice. Host genetic variation drives the complexity and diversity of the host response by eliciting starkly different transcriptional profiles in response to a viral infection
    corecore