Search CORE

Public Library of Science (PLOS)

High Sensitivity TSS Prediction: Estimates of Locations Where TSS Cannot Occur

Author: A Kanhere
C Wei
Chikatoshi Kai
GJ McLachlan
H Kawaji
H Wakaguri
I Korf
I Ovcharenko
JL Rinn
Jun Kawai
K Maruyama
L Ponger
MC Frith
MG Reese
N Cohen
P Carninci
P Carninci
P Carninci
P Carninci
P Kapranov
P Kapranov
P Ng
Piero Carninci
Rimantas Kodzius
RV Davuluri
S Hashimoto
S Knudsen
T Shiraki
TA Down
Timothy Ravasi
U Ohler
Ulf Schaefer
VB Bajic
VB Bajic
VB Bajic
VB Bajic
VB Bajic
VB Bajic
VB Bajic
VB Bajic
Vladimir B. Bajic
VV Solovyev
Y Sugahara
Yoshihide Hayashizaki
Publication venue: Public Library of Science
Publication date: 15/11/2010
Field of study

Although transcription in mammalian genomes can initiate from various genomic positions (e.g., 3′UTR, coding exons, etc.), most locations on genomes are not prone to transcription initiation. It is of practical and theoretical interest to be able to estimate such collections of non-TSS locations (NTLs). The identification of large portions of NTLs can contribute to better focusing the search for TSS locations and thus contribute to promoter and gene finding. It can help in the assessment of 5′ completeness of expressed sequences, contribute to more successful experimental designs, as well as more accurate gene annotation.Using comprehensive collections of Cap Analysis of Gene Expression (CAGE) and other transcript data from mouse and human genomes, we developed a methodology that allows us, by performing computational TSS prediction with very high sensitivity, to annotate, with a high accuracy in a strand specific manner, locations of mammalian genomes that are highly unlikely to harbor transcription start sites (TSSs). The properties of the immediate genomic neighborhood of 98,682 accurately determined mouse and 113,814 human TSSs are used to determine features that distinguish genomic transcription initiation locations from those that are not likely to initiate transcription. In our algorithm we utilize various constraining properties of features identified in the upstream and downstream regions around TSSs, as well as statistical analyses of these surrounding regions.

Mapping the strand-specific transcriptome of fission yeast

Author: E Birney
JR Manak
N Dutrow
P Carninci
P Kapranov
S Efroni
S Katayama
Thomas R Gingeras
TR Gingeras
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Pervasive genome-wide transcription is widespread in eukaryotic cells, but key features of the transcriptome have yet to be fully characterized. a new study using antibody-based detection of RNA-DNA duplexes on tiling arrays now reveals a complex, strand-specific transcriptional world in fission yeast

Cold Spring Harbor Laboratory Institutional Repository

The effect of genetic variation on promoter usage and enhancer activity.

Author: Antonarakis S.E.
Carninci P.
Delaneau O.
Dermitzakis E.T.
Fish R.J.
Fort A.
Garieri M.
Mull D.
Santoni F.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

The identification of genetic variants affecting gene expression, namely expression quantitative trait loci (eQTLs), has contributed to the understanding of mechanisms underlying human traits and diseases. The majority of these variants map in non-coding regulatory regions of the genome and their identification remains challenging. Here, we use natural genetic variation and CAGE transcriptomes from 154 EBV-transformed lymphoblastoid cell lines, derived from unrelated individuals, to map 5376 and 110 regulatory variants associated with promoter usage (puQTLs) and enhancer activity (eaQTLs), respectively. We characterize five categories of genes associated with puQTLs, distinguishing single from multi-promoter genes. Among multi-promoter genes, we find puQTL effects either specific to a single promoter or to multiple promoters with variable effect orientations. Regulatory variants associated with opposite effects on different mRNA isoforms suggest compensatory mechanisms occurring between alternative promoters. Our analyses identify differential promoter usage and modulation of enhancer activity as molecular mechanisms underlying eQTLs related to regulatory elements

Serveur académique lausannois

Cold Spring Harbor Laboratory Institutional Repository

Archive ouverte UNIGE

Multiplicity of 5' Cap Structures Present on Short RNAs

Author: Abdelhamid R. F.
Carninci P.
de Hoon M.
Gingeras T. R.
Isobe T.
Plessy C.
Taoka M.
Yamauchi Y.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 31/07/2014
Field of study

Most RNA molecules are co- or post-transcriptionally modified to alter their chemical and functional properties to assist in their ultimate biological function. Among these modifications, the addition of 5' cap structure has been found to regulate turnover and localization. Here we report a study of the cap structure of human short (<200 nt) RNAs (sRNAs), using sequencing of cDNA libraries prepared by enzymatic pretreatment of the sRNAs with cap sensitive-specificity, thin layer chromatographic (TLC) analyses of isolated cap structures and mass spectrometric analyses for validation of TLC analyses. Processed versions of snoRNAs and tRNAs sequences of less than 50 nt were observed in capped sRNA libraries, indicating additional processing and recapping of these annotated sRNAs biotypes. We report for the first time 2,7 dimethylguanosine in human sRNAs cap structures and surprisingly we find multiple type 0 cap structures (mGpppC, 7mGpppG, GpppG, GpppA, and 7mGpppA) in RNA length fractions shorter than 50 nt. Finally, we find the presence of additional uncharacterized cap structures that wait determination by the creation of needed reference compounds to be used in TLC analyses. These studies suggest the existence of novel biochemical pathways leading to the processing of primary and sRNAs and the modifications of their RNA 5' ends with a spectrum of chemical modifications

Suppression of artifacts and barcode bias in high-throughput transcriptome analyses utilizing template switching

Author: Alon
Ana Maria Suzuki
Baltimore
Batut
Benjamini
Bourgon
Carninci
Carninci
Charles Plessy
Cloonan
Dave T. P. Tang
Fan
Goetz
Guttman
Hirzmann
Islam
Islam
Jayaprakash
Kapteyn
Kawano
Kivioja
Ko
Konig
Lassmann
Li
Li
Maeda
Marioni
Matsumura
Matz
Md Salimullah
Needleman
Ohtake
Piero Carninci
Plessy
Raffaella Calligaris
Ramskold
Robinson
Salimullah
Schmidt
Schneider
Shiroguchi
Stefano Gustincich
Takahashi
Temin
Trapnell
Wang
Zhu
Publication venue: 'Oxford University Press (OUP)'
Publication date: 23/11/2012
Field of study

Template switching (TS) has been an inherent mechanism of reverse transcriptase, which has been exploited in several transcriptome analysis methods, such as CAGE, RNA-Seq and short RNA sequencing. TS is an attractive option, given the simplicity of the protocol, which does not require an adaptor mediated step and thus minimizes sample loss. As such, it has been used in several studies that deal with limited amounts of RNA, such as in single cell studies. Additionally, TS has also been used to introduce DNA barcodes or indexes into different samples, cells or molecules. This labeling allows one to pool several samples into one sequencing flow cell, increasing the data throughput of sequencing and takes advantage of the increasing throughput of current sequences. Here, we report TS artifacts that form owing to a process called strand invasion. Due to the way in which barcodes/indexes are introduced by TS, strand invasion becomes more problematic by introducing unsystematic biases. We describe a strategy that eliminates these artifacts in silico and propose an experimental solution that suppresses biases from TS

Archivio istituzionale della ricerca - Università di Trieste

University of Missouri: MOspace

Sissa Digital Library

Opening the black box of outer space: the case of Jason-3

Author: Antonarakis S. E.
Borel C.
Carninci P.
Cobellis G.
Falconnet E.
Fort A.
Garieri M.
Guipponi M.
Letourneau A.
Ribaux P.
Santoni F.
Vannier A.
Publication venue: 'University of Missouri Libraries'
Publication date: 01/01/2015
Field of study

If you look at a rendering of planet Earth from a bird's eye view, you will see satellites orbiting the planet like electrons, each one a testament to humanity's expansion beyond Earth's atmosphere. It begs the question: what is this new humanized landscape? The dominant voice that has attempted to answer this question is the realist one, which has led the charge of academic inquiry into outer space since the fateful launch of the Sputnik in 1957. Though enlightening in some respects, the realist perspective oftentimes obscures the heterogeneous complexity of the actors, actions, limits and possibilities that have constructed this very humanized outer space. This paper looks at the humanization of outer space through the lens of JASON-3, an internationally collaborative satellite designed primarily to measure the topography of the Earth's oceans. A vast number of actors collaborated to enact the network that created JASON-3, including bureaucratic agencies, academics, private contractors, political bodies, other satellites, the sun and even gravity. This paper will focus on these actors and the work that they did to form the network, showing a glimpse of the entangled connections that eventually produced JASON-3. Through telling this story, I argue: (1) outer space is more complex than state-level relations and (2) critical geography -- with its insight into relational spaces and deconstructing power structures -- has a unique place to fill in outer space literature

Serveur académique lausannois

Archivio Istituzionale della Ricerca - Università degli Studi della Campania "Luigi Vanvitelli"

The Francis Crick Institute

Archive ouverte UNIGE

Dual-initiation promoters with intertwined canonical and TCT/TOP transcription start sites diversify transcript processing

Author: Andersen JB
Balwierz P
Cardenas R
Carninci P
Hadzhiev Y
Lenhard B
Müller F
Nepal C
Peers B
Suzuki A-M
Tarifeño-Saldivia E
Wragg JW
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/11/2019
Field of study

Variations in transcription start site (TSS) selection reflect diversity of preinitiation complexes and can impact on post-transcriptional RNA fates. Most metazoan polymerase II-transcribed genes carry canonical initiation with pyrimidine/purine (YR) dinucleotide, while translation machinery-associated genes carry polypyrimidine initiator (5'-TOP or TCT). By addressing the developmental regulation of TSS selection in zebrafish we uncovered a class of dual-initiation promoters in thousands of genes, including snoRNA host genes. 5'-TOP/TCT initiation is intertwined with canonical initiation and used divergently in hundreds of dual-initiation promoters during maternal to zygotic transition. Dual-initiation in snoRNA host genes selectively generates host and snoRNA with often different spatio-temporal expression. Dual-initiation promoters are pervasive in human and fruit fly, reflecting evolutionary conservation. We propose that dual-initiation on shared promoters represents a composite promoter architecture, which can function both coordinately and divergently to diversify RNAs

Spiral - Imperial College Digital Repository

Automated Workflow for Preparation of cDNA for Cap Analysis of Gene Expression on a Single Molecule Sequencer

Author: A Barski
A Goren
A Mortazavi
Ai Kaiho
Alistair R. R. Forrest
BT Wilhelm
C Bock
C Hart
E Farias-Hesson
E Valen
Eri Saijo
G Robertson
GJ Faulkner
GKCo Scientists
H Suzuki
Hideya Kawaji
J Kaiser
JC Marioni
M Frommer
M Kanamori-Katayama
M Maekawa
M Sultan
M Vitezic
Marina Lizio
Masayoshi Itoh
Miki Kojima
MM DeAngelis
MS Hestand
Mutsumi Kanamori-Katayama
N Cloonan
NJ Lennon
P Carninci
P Carninci
P Carninci
Piero Carninci
PJ Park
S Tsuchiya
Sayaka Nagao-Sato
T Shiraki
TEP Consortium
Timo Lassmann
TL Hawkins
Toshi Shioda
TS Mikkelsen
U Nagalakshmi
WF Scherer
Yoshihide Hayashizaki
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Background: Cap analysis of gene expression (CAGE) is a 59 sequence tag technology to globally determine transcriptional starting sites in the genome and their expression levels and has most recently been adapted to the HeliScope single molecule sequencer. Despite significant simplifications in the CAGE protocol, it has until now been a labour intensive protocol. Methodology: In this study we set out to adapt the protocol to a robotic workflow, which would increase throughput and reduce handling. The automated CAGE cDNA preparation system we present here can prepare 96 ‘HeliScope ready ’ CAGE cDNA libraries in 8 days, as opposed to 6 weeks by a manual operator.We compare the results obtained using the same RNA in manual libraries and across multiple automation batches to assess reproducibility. Conclusions: We show that the sequencing was highly reproducible and comparable to manual libraries with an 8 fold increase in productivity. The automated CAGE cDNA preparation system can prepare 96 CAGE sequencing samples simultaneously. Finally we discuss how the system could be used for CAGE on Illumina/SOLiD platforms, RNA-seq and fulllengt

CiteSeerX

Public Library of Science (PLOS)

The Francis Crick Institute

Large-scale clustering of CAGE tag expression data

Author: A Clare
DB Kell
DJ Lockhart
FJ Stott
GC Tseng
H Akaike
J Rissanen
JH Badger
JM Berg
Jun Kawai
Kazuro Shimokawa
KY Yeung
L Cai
M Dash
M Furuno
M Schena
Martin C Frith
MB Eisen
MLJ de Hoon
MR Anderberg
P Carninci
P Carninci
P Kapranov
Piero Carninci
R Kodzius
R Miki
S Saha
S Tavazoie
T Kasukawa
T Kurita
T Shiraki
Takio Kurita
VE Velculescu
Y Okazaki
Y Zhang
Yoshihide Hayashizaki
Yuko Okamura-Oho
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Background: Recent analyses have suggested that many genes possess multiple transcription start sites (TSSs) that are differentially utilized in different tissues and cell lines. We have identified a huge number of TSSs mapped onto the mouse genome using the cap analysis of gene expression (CAGE) method. The standard hierarchical clustering algorithm, which gives us easily understandable graphical tree images, has difficulties in processing such huge amounts of TSS data and a better method to calculate and display the results is needed. Results: We use a combination of hierarchical and non-hierarchical clustering to cluster expression profiles of TSSs based on a large amount of CAGE data to profit from the best of both methods. We processed the genome-wide expression data, including 159,075 TSSs derived from 127 RNA samples of various organs of mouse, and succeeded in categorizing them into 70-100 clusters. The clusters exhibited intriguing biological features: a cluster supergroup with a ubiquitous expression profile, tissue-specific patterns, a distinct distribution of non-coding RNA and functional TSS groups. Conclusion: Our approach succeeded in greatly reducing the calculation cost, and is an appropriate solution for analyzing large-scale TSS usage data

Springer - Publisher Connector