196 research outputs found

    CAGE-TSSchip: promoter-based expression profiling using the 5'-leading label of capped transcripts

    Get PDF
    A novel approach that combines CAGE expression analysis with oligonucleotide array technology allows for the accurate and sensitive detection of promoter-based transcriptional activity

    Embryonic LTR retrotransposons supply promoter modules to somatic tissues

    Get PDF
    Long terminal repeat (LTR) retrotransposons are widely distributed across the human genome. They have accumulated through retroviral integration into germline DNA and are latent genetic modules. Active LTR promoters are observed in germline cells; however, little is known about the mechanisms underlying their active transcription in somatic tissues. Here, by integrating our previous transcriptome data set with publicly available data sets, we show that the LTR families MLT2A1 and MLT2A2 are primarily expressed in human four-cell and eight-cell embryos and are also activated in some adult somatic tissues, particularly pineal gland. Three MLT2A elements function as the promoters and first exons of the protein-coding genes ABCE1, COL5A1, and GALNT13 specifically in the pineal gland of humans but not in that of macaques, suggesting that the exaptation of these LTRs as promoters occurred during recent primate evolution. This analysis provides insight into the possible transition from germline insertion to somatic expression of LTR retrotransposons.Peer reviewe

    CAGE Basic/Analysis Databases: the CAGE resource for comprehensive promoter analysis

    Get PDF
    Cap-analysis gene expression (CAGE) Basic and Analysis Databases store an original resource produced by CAGE, which measures expression levels of transcription starting sites by sequencing large amounts of transcript 5′ ends, termed CAGE tags. Millions of human and mouse high-quality CAGE tags derived from different conditions in >20 tissues consisting of >250 RNA samples are essential for identification of novel promoters and promoter characterization in the aspect of expression profile. CAGE Basic Database is a primary database of the CAGE resource, RNA samples, CAGE libraries, CAGE clone and tag sequences and so on. CAGE Analysis Database stores promoter related information, such as counts of related transcripts, CpG islands and conserved genome region. It also provides expression profiles at base pair and promoter levels. Both databases are based on the same framework, CAGE tag starting sites, tag clusters for defining promoters and transcriptional units (TUs). Their associations and TU attributes are available to find promoters of interest. These databases were provided for Functional Annotation Of Mouse 3 (FANTOM3), an international collaboration research project focusing on expanding the transcriptome and subsequent analyses. Now access is free for all users through the World Wide Web at

    Dynamic usage of transcription start sites within core promoters

    Get PDF
    BACKGROUND: Mammalian promoters do not initiate transcription at single, well defined base pairs, but rather at multiple, alternative start sites spread across a region. We previously characterized the static structures of transcription start site usage within promoters at the base pair level, based on large-scale sequencing of transcript 5' ends. RESULTS: In the present study we begin to explore the internal dynamics of mammalian promoters, and demonstrate that start site selection within many mouse core promoters varies among tissues. We also show that this dynamic usage of start sites is associated with CpG islands, broad and multimodal promoter structures, and imprinting. CONCLUSION: Our results reveal a new level of biologic complexity within promoters - fine-scale regulation of transcription starting events at the base pair level. These events are likely to be related to epigenetic transcriptional regulation

    The Continued Absence of Functional Germline Stem Cells in Adult Ovaries

    Get PDF
    Ovaries are central to development, fertility, and reproduction of women. A particularly interesting feature of ovaries is their accelerated aging compared to other tissues, leading to loss of function far before other organs senesce. The limited pool of ovarian follicles is generated before birth and once exhausted, menopause will inevitably commence around the age of 50 years marking the end of fertility. Yet, there are reports suggesting the presence of germline stem cells and neo-oogenesis in adult human ovaries. These observations have fueled a long debate, created experimental fertility treatments, and opened business opportunities. Our recent analysis of cell types in the ovarian cortex of women of fertile age could not find evidence of germline stem cells. Like before, our work has been met with critique suggesting methodological shortcomings. We agree that excellence starts with methods and welcome discussion on the pros and cons of different protocols. In this commentary, we discuss the recent re-interpretation of our work.Peer reviewe

    Guide for library design and bias correction for large-scale transcriptome studies using highly multiplexed RNAseq methods

    Get PDF
    Background Standard RNAseq methods using bulk RNA and recent single-cell RNAseq methods use DNA barcodes to identify samples and cells, and the barcoded cDNAs are pooled into a library pool before high throughput sequencing. In cases of single-cell and low-input RNAseq methods, the library is further amplified by PCR after the pooling. Preparation of hundreds or more samples for a large study often requires multiple library pools. However, sometimes correlation between expression profiles among the libraries is low and batch effect biases make integration of data between library pools difficult. Results We investigated 166 technical replicates in 14 RNAseq libraries made using the STRT method. The patterns of the library biases differed by genes, and uneven library yields were associated with library biases. The former bias was corrected using the NBGLM-LBC algorithm, which we present in the current study. The latter bias could not be corrected directly, but could be solved by omitting libraries with particularly low yields. A simulation experiment suggested that the library bias correction using NBGLM-LBC requires a consistent sample layout. The NBGLM-LBC correction method was applied to an expression profile for a cohort study of childhood acute respiratory illness, and the library biases were resolved. Conclusions The R source code for the library bias correction named NBGLM-LBC is available at and . This method is applicable to correct the library biases in various studies that use highly multiplexed sequencing-based profiling methods with a consistent sample layout with samples to be compared (e.g., "cases" and "controls") equally distributed in each library.Peer reviewe
    • …
    corecore