Article thumbnail

Ultraconserved cDNA segments in the human transcriptome exhibit resistance to folding and implicate function in translation and alternative splicing

By J. Fah Sathirapongsasuti, Nuankanya Sathira, Yutaka Suzuki, Curtis Huttenhower and Sumio Sugano


Ultraconservation, defined as perfect human-to-rodent sequence identity at least 200-bp long, is a strong indicator of evolutionary and functional importance and has been explored extensively at the genome level. However, it has not been investigated at the transcript level, where such extreme conservation might highlight loci with important post-transcriptional regulatory roles. We present 96 ultraconserved cDNA segments (UCSs), stretches of human mature mRNAs that match identically with orthologous regions in the mouse and rat genomes. UCSs can span multiple exons, a feature we leverage here to elucidate the role of ultraconservation in post-transcriptional regulation. UCS sites are implicated in functions at essentially every post-transcriptional stage: pre-mRNA splicing and degradation through alternative splicing and nonsense-mediated decay (AS-NMD), mature mRNA silencing by miRNA, fast mRNA decay rate and translational repression by upstream AUGs. We also found UCSs to exhibit resistance to formation of RNA secondary structure. These multiple layers of regulation underscore the importance of the UCS-containing genes as key global RNA processing regulators, including members of the serine/arginine-rich protein and heterogeneous nuclear ribonucleoprotein (hnRNP) families of essential splicing regulators. The discovery of UCSs shed new light on the multifaceted, fine-tuned and tight post-transcriptional regulation of gene families as conserved through the majority of the mammalian lineage

Topics: Computational Biology
Publisher: Oxford University Press
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. (2009). Algorithms for locating extremely conserved elements in multiple sequence alignments.
  2. (2004). Aligning multiple genomic sequences with the threaded blockset aligner.
  3. (2008). Alternative isoform regulation in human tissue transcriptomes.
  4. (2000). Alternative pre-mRNA splicing: the logic of combinatorial control.
  5. (2008). Alternative splicing: current perspectives.
  6. (2002). Alternative splicing: multiple control mechanisms and involvement in human disease.
  7. (2000). Analysis of oligonucleotide AUG start codon context in eukariotic mRNAs.
  8. (2004). Autoregulation of polypyrimidine tract binding protein by alternative splicing leading to nonsense-mediated decay.
  9. (2000). CART classification of human 50 UTR sequences.
  10. (2006). Close sequence comparisons are sufficient to identify human cis-regulatory elements.
  11. (2003). Comparative analyses of multi-species sequences from targeted genomic regions.
  12. (2004). Comparative genomics at the vertebrate extremes.
  13. (2004). Complete sequencing and characterization of 21,243 full-length human cDNAs.
  14. (2005). Conserved non-genic sequences - an unexpected feature of mammalian genomes.
  15. (2003). Construction of a full-length enriched and a 50-end enriched cDNA library using the oligo-capping method. Methods Mol.
  16. (2009). Database for mRNA half-life of 19 977 genes obtained by DNA microarray analysis of pluripotent and differentiating mouse embryonic stem cells.
  17. (2008). DBTSS: database of transcription start sites, progress report 2008.
  18. (2005). Diversity and complexity of the mu opioid receptor gene: alternative pre-mRNA splicing and promoters.
  19. (2006). Evidence for purifying selection against synonymous mutations in mammalian exonic splicing enhancers.
  20. (2005). Evidence for selection on synonymous mutations affecting stability of mRNA secondary structure in mammals.
  21. (2003). Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans.
  22. (2009). Evolution of alternative and constitutive regions of mammalian 50UTRs.
  23. (2005). Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes.
  24. (2004). Evolutionary conservation of a 2-kb intronic sequence flanking a tissue-specific alternative exon in the PTBP2 gene.
  25. (2005). Evolutionary conservation suggests a regulatory function of AUG triplets in 50-UTRs of eukaryotic genes.
  26. (2005). Expression of axin2 is regulated by the alternative 50-untranslated regions of its mRNA.
  27. (2005). Galaxy: a platform for interactive large-scale genome analysis.
  28. (2004). Global identification of human transcribed sequences with genome tiling arrays.
  29. (2007). hnRNP proteins and splicing control.
  30. (2007). Human genome ultraconserved elements are ultraselected.
  31. (2008). Identification and characterization of new long conserved noncoding sequences in vertebrates.
  32. (2006). In vivo enhancer analysis of human conserved non-coding sequences.
  33. (2002). Initial sequencing and comparative analysis of the mouse genome.
  34. (2008). Mapping and quantifying mammalian transcriptomes by RNA-Seq.
  35. (2005). Mapping of conserved RNA secondary structures predicts thousands of functional noncoding RNAs in the human genome.
  36. (2003). Mechanisms of alternative pre-messenger RNA splicing.
  37. (2003). Mfold web server for nucleic acid folding and hybridization prediction.
  38. (2006). miRBase: microRNA sequences, targets and gene nomenclature.
  39. (2008). miRBase: tools for microRNA genomics.
  40. (2009). Most mammalian mRNAs are conserved targets of microRNAs.
  41. (2007). RNA maps reveal new RNA classes and a possible function for pervasive transcription.
  42. (2005). Sequence conservation, relative isoform frequencies, and nonsense-mediated decay in evolutionarily conserved alternative splicing.
  43. (2000). Statistical analysis of the 50 untranslated region of human mRNA using ‘‘Oligo-Capped’’ cDNA libraries.
  44. (1993). Strong conservation of non-coding sequences during vertebrates evolution: potential involvement in post-transcriptional regulation of gene expression.
  45. (2009). Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.
  46. (2006). The Evf-2 noncoding RNA is transcribed from the Dlx-5/6 ultraconserved region and functions as a Dlx-2 transcriptional coactivator.
  47. (2004). The genetic association database.
  48. (2009). The SR protein family.
  49. (2008). The UCSC Genome Browser Database:
  50. (2004). Translational regulation of angiotensin type 1a receptor expression and signaling by upstream AUGs in the 50 leader sequence.
  51. (2003). Translational regulation of human neuronal nitric-oxide synthase by an alternatively spliced 50-untranslated region leader exon.
  52. (2007). Translational repression of mouse mu opioid receptor expression via leaky scanning.
  53. (2005). uAUG and uORFs in human and rodent 50untranslated mRNAs.
  54. (2008). Ultraconservation identifies a small subset of extremely constrained developmental enhancers.
  55. (2007). Ultraconserved elements are associated with homeostatic control of splicing regulators by alternative splicing and nonsense-mediated decay.
  56. (2004). Ultraconserved elements in the human genome.
  57. (2007). Ultraconserved non-coding sequence element controls a subset of spatiotemporal GLI3 expression.
  58. (2007). Ultraconserved regions encoding ncRNAs are altered in human leukemias and carcinomas.
  59. (2005). Understanding alternative splicing: towards a cellular code.
  60. (2007). Unproductive splicing of SR genes associated with highly conserved and ultraconserved DNA elements.