Article thumbnail

Characterization of Transcription Start Sites of Putative Non-coding RNAs by Multifaceted Use of Massively Paralleled Sequencer

By Nuankanya Sathira, Riu Yamashita, Kousuke Tanimoto, Akinori Kanai, Takako Arauchi, Soutaro Kanematsu, Kenta Nakai, Yutaka Suzuki and Sumio Sugano


On the basis of integrated transcriptome analysis, we show that not all transcriptional start site clusters (TSCs) in the intergenic regions (iTSCs) have the same properties; thus, it is possible to discriminate the iTSCs that are likely to have biological relevance from the other noise-level iTSCs. We used a total of 251 933 381 short-read sequence tags generated from various types of transcriptome analyses in order to characterize 6039 iTSCs, which have significant expression levels. We analyzed and found that 23% of these iTSCs were located in the proximal regions of the RefSeq genes. These RefSeq-linked iTSCs showed similar expression patterns with the neighboring RefSeq genes, had widely fluctuating transcription start sites and lacked ordered nucleosome positioning. These iTSCs seemed not to form independent transcriptional units, simply representing the by-products of the neighboring RefSeq genes, in spite of their significant expression levels. Similar features were also observed for the TSCs located in the antisense regions of the RefSeq genes. Furthermore, for the remaining iTSCs that were not associated with any RefSeq genes, we demonstrate that integrative interpretation of the transcriptome data provides essential information to specify their biological functions in the hypoxic responses of the cells

Topics: Full Papers
Publisher: Oxford University Press
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. (2004). 50-end SAGE for the analysis of transcriptional start sites,
  2. (2005). A systematic search for new mammalian noncoding RNAs indicates little conserved intergenic transcription,
  3. (2004). Characterization of evolutionary rates and constraints in three mammalian genomes,
  4. (2009). Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals,
  5. (2004). Complete sequencing and characterization of 21,243 full-length human cDNAs,
  6. (2003). Construction of a fulllength enriched and a 50-end enriched cDNA library using the oligo-capping method,
  7. (2008). DBTSS: database of transcription start sites progress report,
  8. (2005). Distribution and intensity of constraint in mammalian genomic sequence,
  9. (2008). Divergent transcription from active promoters, Science, 322, 1849–51. 182 Characterization of TSSs of Putative ncRNAs [Vol.
  10. (2001). Diverse transcriptional initiation revealed by fine, largescale mapping of mRNA start sites,
  11. (2008). Dynamic regulation of nucleosome positioning in the human genome,
  12. (2009). Evolution and functions of long noncoding RNAs,
  13. (2006). Experimental validation of the regulated expression of large numbers of non-coding RNAs from the mouse genome,
  14. (2006). Filtering transcriptional noise during development: concepts and mechanisms,
  15. (2006). Functionality of intergenic transcription: an evolutionary comparison,
  16. (2007). Genome mapping and expression analyses of human intronic noncoding RNAs reveal tissue-specific patterns and enrichment in genes related to regulation of transcription,
  17. (2007). Genome-wide mapping of in vivo protein-DNA interactions,
  18. (2004). Global identification of human transcribed sequences with genome tiling arrays,
  19. (2007). High-throughput mapping of the chromatin structure of human promoters,
  20. (2007). Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project,
  21. (2003). Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection,
  22. (2008). Long, abundantly expressed non-coding transcripts are altered in cancer,
  23. (2009). Massive transcriptional start site analysis of human genes in hypoxia cells,
  24. (2008). miRBase: tools for microRNA genomics,
  25. (2004). Mouse transcriptome: neutral evolution of ‘non-coding’ complementary DNAs,
  26. (2005). Non-coding RNAs: hope or hype?,
  27. (2009). Nucleosome positioning and gene regulation: advances through genomics,
  28. Ponjavic,J.,Ponting,C.P.andLunter,G.2007,Functionality ortranscriptionalnoise?Evidenceforselectionwithinlong noncoding RNAs,
  29. (2008). Predicting human nucleosome occupancy from primary sequence,
  30. (2008). Ripples from neighbouring transcription,
  31. (2007). Sno/scaRNAbase: a curated database for small nucleolar RNAs and cajal body-specific RNAs,
  32. (2008). Specific expression of long noncoding RNAs in the mouse brain,
  33. (2004). Systematic identification of sense-antisense transcripts in mammalian cells,
  34. (2008). The enigmatic world of mRNA-like ncRNAs: their role in human evolution and in human diseases,
  35. (2006). The gene ontology (GO) project in 2006,
  36. (2007). The H19 non-coding RNA is essential for human tumor growth,
  37. (2007). The imprinted H19 noncoding RNA is a primary microRNA precursor,
  38. (2003). The transcriptional activity of human chromosome 22,
  39. (2005). The transcriptional landscape of the mammalian genome,
  40. (2007). Transcriptional noise and the fidelity of initiation by RNA polymerase II,
  41. (2007). Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genome,
  42. (2005). Waste not, want not–transcript excess in multicellular eukaryotes,
  43. (2006). Whole-genome re-sequencing,
  44. (2003). Widespread occurrence of antisense transcription in the human genome,