26 research outputs found

    Comparative analysis of genome tiling array data reveals many novel primate-specific functional RNAs in human

    Get PDF
    BACKGROUND: Widespread transcription activities in the human genome were recently observed in high-resolution tiling array experiments, which revealed many novel transcripts that are outside of the boundaries of known protein or RNA genes. Termed as "TARs" (Transcriptionally Active Regions), these novel transcribed regions represent "dark matter" in the genome, and their origin and functionality need to be explained. Many of these transcripts are thought to code for novel proteins or non-protein-coding RNAs. We have applied an integrated bioinformatics approach to investigate the properties of these TARs, including cross-species conservation, and the ability to form stable secondary structures. The goal of this study is to identify a list of potential candidate sequences that are likely to code for functional non-protein-coding RNAs. We are particularly interested in the discovery of those functional RNA candidates that are primate-specific, i.e. those that do not have homologs in the mouse or dog genomes but in rhesus. RESULTS: Using sequence conservation and the probability of forming stable secondary structures, we have identified ~300 possible candidates for primate-specific noncoding RNAs. We are currently in the process of sequencing the orthologous regions of these candidate sequences in several other primate species. We will then be able to apply a "phylogenetic shadowing" approach to analyze the functionality of these ncRNA candidates. CONCLUSION: The existence of potential primate-specific functional transcripts has demonstrated the limitation of previous genome comparison studies, which put too much emphasis on conservation between human and rodents. It also argues for the necessity of sequencing additional primate species to gain a better and more comprehensive understanding of the human genome

    Secondary structure prediction for aligned RNA sequences.

    Get PDF
    Most functional RNA molecules have characteristic secondary structures that are highly conserved in evolution. Here we present a method for computing the consensus structure of a set aligned RNA sequences taking into account both thermodynamic stability and sequence covariation. Comparison with phylogenetic structures of rRNAs shows that a reliability of prediction of more than 80% is achieved for only five related sequences. As an application we show that the Early Noduline mRNA contains significant secondary structure that is supported by sequence covariation

    Identification of long non-protein coding RNAs in chicken skeletal muscle using next generation sequencing

    Get PDF
    AbstractVertebrate genomes encode thousands of non-coding RNAs including short non-coding RNAs (such as microRNAs) and long non-coding RNAs (lncRNAs). Chicken (Gallus gallus) is an important model organism for developmental biology, and the recently assembled genome sequences for chicken will facilitate the understanding of the functional roles of non-coding RNA genes during development. The present study concerns the first systematic identification of lncRNAs using RNA-Seq to sample the transcriptome during chicken muscle development. A computational approach was used to identify 281 new intergenic lncRNAs in the chicken genome. Novel lncRNAs in general are less conserved than protein-coding genes and slightly more conserved than random non-coding sequences. The present study has provided an initial chicken lncRNA catalog and greatly increased the number of chicken ncRNAs in the non-protein coding RNA database. Furthermore, the computational pipeline presented in the current work will be useful for characterizing lncRNAs obtained from deep sequencing data

    Function of lncRNAs and approaches to lncRNA-protein interactions

    Get PDF

    Allele-Specific Gene Expression Is Widespread Across the Genome and Biological Processes

    Get PDF
    Allelic specific gene expression (ASGE) appears to be an important factor in human phenotypic variability and as a consequence, for the development of complex traits and diseases. In order to study ASGE across the human genome, we have performed a study in which genotyping was coupled with an analysis of ASGE by screening 11,500 SNPs using the Mapping 10 K Array to identify differential allelic expression. We found that from the 5,133 SNPs that were suitable for analysis (heterozygous in our sample and expressed in peripheral blood mononuclear cells), 2,934 (57%) SNPs had differential allelic expression. Such SNPs were equally distributed along human chromosomes and biological processes. We validated the presence or absence of ASGE in 18 out 20 SNPs (90%) randomly selected by real time PCR in 48 human subjects. In addition, we observed that SNPs close to -but not included in- segmental duplications had increased levels of ASGE. Finally, we found that transcripts of unknown function or non-coding RNAs, also display ASGE: from a total of 2,308 intronic SNPs, 1510 (65%) SNPs underwent differential allelic expression. In summary, ASGE is a widespread mechanism in the human genome whose regulation seems to be far more complex than expected

    Mammalian microRNAs: a small world for fine-tuning gene expression

    Get PDF
    The basis of eukaryotic complexity is an intricate genetic architecture where parallel systems are involved in tuning gene expression, via RNA-DNA, RNA-RNA, RNA-protein, and DNA-protein interactions. In higher organisms, about 97% of the transcriptional output is represented by noncoding RNA (ncRNA) encompassing not only rRNA, tRNA, introns, 5′ and 3′ untranslated regions, transposable elements, and intergenic regions, but also a large, rapidly emerging family named microRNAs. MicroRNAs are short 20-22-nucleotide RNA molecules that have been shown to regulate the expression of other genes in a variety of eukaryotic systems. MicroRNAs are formed from larger transcripts that fold to produce hairpin structures and serve as substrates for the cytoplasmic Dicer, a member of the RNase III enzyme family. A recent analysis of the genomic location of human microRNA genes suggested that 50% of microRNA genes are located in cancer-associated genomic regions or in fragile sites. This review focuses on the possible implications of microRNAs in post-transcriptional gene regulation in mammalian diseases, with particular focus on cancer. We argue that developing mouse models for deleted and/or overexpressed microRNAs will be of invaluable interest to decipher the regulatory networks where microRNAs are involved

    Allele-specific gene expression is widespread across the genome and biological processes. PLoS One 4

    Get PDF
    Abstract Allelic specific gene expression (ASGE) appears to be an important factor in human phenotypic variability and as a consequence, for the development of complex traits and diseases. In order to study ASGE across the human genome, we have performed a study in which genotyping was coupled with an analysis of ASGE by screening 11,500 SNPs using the Mapping 10 K Array to identify differential allelic expression. We found that from the 5,133 SNPs that were suitable for analysis (heterozygous in our sample and expressed in peripheral blood mononuclear cells), 2,934 (57%) SNPs had differential allelic expression. Such SNPs were equally distributed along human chromosomes and biological processes. We validated the presence or absence of ASGE in 18 out 20 SNPs (90%) randomly selected by real time PCR in 48 human subjects. In addition, we observed that SNPs close to -but not included in-segmental duplications had increased levels of ASGE. Finally, we found that transcripts of unknown function or non-coding RNAs, also display ASGE: from a total of 2,308 intronic SNPs, 1510 (65%) SNPs underwent differential allelic expression. In summary, ASGE is a widespread mechanism in the human genome whose regulation seems to be far more complex than expected
    corecore