Search CORE

53 research outputs found

Interoperability and FAIRness through a novel combination of Web technologies

Author: Bolleman Jerven T.
Bonino da Silva Santos Luiz Olavo
Ciccarese Paolo
Clark Tim
Dumontier Michel
Gavai Anand
Gray Alasdair J. G.
Kaliyaperumal Rajaram
Kelpin Fleur D. L.
Kuzniar Arnold
Schultes Erik A.
Swertz Morris A.
Thompson Mark
van Mulligen Erik M.
Verborgh Ruben
Wilkinson Mark D.
Publication venue: 'PeerJ'
Publication date: 01/01/2017
Field of study

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare, Zenodo, Dataverse or EUDAT). These data have widely different levels of sensitivity and security considerations. For example, clinical observations about genetic mutations in patients are highly sensitive, while observations of species diversity are generally not. The lack of uniformity in data models from one repository to another, and in the richness and availability of metadata descriptions, makes integration and analysis of these data a manual, time-consuming task with no scalability. Here we explore a set of resource-oriented Web design patterns for data discovery, accessibility, transformation, and integration that can be implemented by any general- or special-purpose repository as a means to assist users in finding and reusing their data holdings. We show that by using off-the-shelf technologies, interoperability can be achieved atthe level of an individual spreadsheet cell. We note that the behaviours of this architecture compare favourably to the desiderata defined by the FAIR Data Principles, and can therefore represent an exemplar implementation of those principles. The proposed interoperability design patterns may be used to improve discovery and integration of both new and legacy data, maximizing the utility of all scholarly outputs

Maastricht University Research Portal

Heriot Watt Pure

Proceedings - University of Groningen

Crossref

University of Groningen

ARTS repository - University of Groningen

Ghent University Academic Bibliography

Directory of Open Access Journals

Dissertations of the University of Groningen

The implicitome: A resource for rationalizing gene-disease associations

Author: Bruskiewich R. (Richard)
Dunnen J.T. (Johan) den
Emmelien A. (Aten)
Good B.M. (Benjamin M.)
Haagen H.H.H.B.M. (Herman) van
Hettne K.M. (Kristina)
Hoen P.A.C. (Peter) 't
Kaliyaperumal R. (Rajaram)
Kors J.A. (Jan)
Laros J.F.J. (Jeroen F.)
Li T.S. (Tong Shu)
Mina E. (Eleni)
Mons B. (Barend)
Roos M. (Marco)
Schuemie M.J. (Martijn)
Schultes E. (Erik)
Su A.I. (Andrew I.)
Tatum Z. (Zuotian)
Thompson M. (Mark)
Van Der Horst E. (Eelke)
Van Mulligen E.M. (Erik M.)
Van Ommen G.-J.B. (Gert-Jan B.)
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/02/2016
Field of study

High-throughput experimental methods such as medical sequencing and genome-wide association studies (GWAS) identify increasingly large numbers of potential relations between genetic variants and diseases. Both biological complexity (millions of potential gene-disease associations) and the accelerating rate of data production necessitate computational approaches to prioritize and rationalize potential gene-disease relations. Here, we use concept profile technology to expose from the biomedical literature both explicitly stated gene-disease relations (the explicitome) and a much larger set of implied gene-disease associations (the implicitome). Implicit relations are largely unknown to, or are even unintended by the original authors, but they vastly extend the reach of existing

Erasmus University Digital Repository

The FAIR Guiding Principles for scientific data management and stewardship

Author: Aalbersberg I.J. (Ijsbrand Jan)
Appleton G. (Gabrielle)
Axton M. (Myles)
Baak A. (Arie)
Blomberg N. (Niklas)
Boiten J.W. (Jan-Willem)
Bourne P.E. (Philip)
Bouwman J. (Jildau)
Brookes A.J. (Anthony)
Clark T. (Tim)
Crosas M. (Mercè)
Dillo I. (Ingrid)
Dumon O. (Olivier)
Dumontier M. (Michel)
Edmunts S. (Scott)
Evelo C.T. (Chris)
Finkers R. (Richard)
Goble C.A. (Carole Ann)
Gonzalez-Beltran A. (Alejandra)
Gray A. (Alastair)
Grethe S. (Jeffrey)
Groth P. (Paul)
Heringa J. (Jaap)
Hoen P.A.C. (Peter) 't
Hooft R. (Rob)
Kok J. (Joost)
Kok R. (Ruben)
Kuhn T. (Tobias)
Lei J. (Johan) van der
Lusher S.J. (Scott)
Martone M.E. (Maryann)
Mons A. (Albert)
Mons B. (Barend)
Mulligen E.M. (Erik) van
Packer A. (Abel)
Persson B. (Bengt)
Roca-Serra P. (Philippe)
Roos M. (Marco)
Sansone S.A. (Susanna-Assunta)
Schaik R. (Rene) van
Schultes E. (Erik)
Sengstag T. (Thierry)
Silva Santos L.B. (Luiz Bonino) da
Slater T. (Ted)
Strawn G. (George)
Swertz M. (Morris)
Thompson M. (Mark)
Velterop J. (Jan)
Waagmeester A. (Andra)
Wilkinson J.M. (Mark)
Wittenburg P. (Peter)
Wolstencroft K. (Katherine)
Zhao J. (Jun)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/03/2016
Field of study

There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community

Erasmus University Digital Repository

Gateways to the FANTOM5 promoter level mammalian expression atlas

Author: A Kruger
A Subramanian
AB Chetverin
AI Su
AJ Enright
Albin Sandelin
Alexander D Diehl
Alistair RR Forrest
AR Quinlan
ARR Forrest
B Mons
B Smith
BE Bernstein
C Bizer
C Rosse
C Wu
Carsten O Daub
Christopher J Mungall
CJ Mungall
D Shalon
Derek W Wright
Emmanuel Dimont
Erik A Schultes
Erik Arner
Fumi Hori
G Rustici
GA Churchill
GP Patrinos
H Kawaji
H Kawaji
H Kawaji
H Kawaji
H Li
H Soejima
H Suzuki
H Takahashi
Hidemasa Bono
Hideya Kawaji
Hiromasa Ono
Hisashi Shimoji
Imad Abugessaisa
J Kawai
J Kenneth Baillie
J Severin
J Takeda
J Wang
Jayson Harshbarger
JC Bryne
Jessica Severin
K Ikeo
Kaori Fujieda
Koro Nishikata
LR Meyer
M Ashburner
M Itoh
M Kanamori-Katayama
M Kapushesky
Marina Lizio
Mark Thompson
Masayoshi Itoh
MD Robinson
Michael Rehli
Michiel de Hoon
N Mitsuhashi
NCBI
Nicolas Bertin
P Carninci
P Flicek
Peter AC ‘t Hoen
Piero Carninci
PJ Cock
PL Whetzel
R Andersson
R Edgar
R Yamashita
Rajaram Kaliyaperumal
S Anders
S Djebali
S Dongen van
S Povey
SA Sansone
Sachi Ishikawa-Kato
Serkan Sahin
Shiro Fukuda
T Beissbarth
T Kasukawa
T Lassmann
T Shiraki
T Suzuki
Takeya Kasukawa
TC Freeman
Terrence F Meehan
Tetsuro Toyoda
TF Rayner
The ENCODE Project Consortium
Timo Lassmann
Tom C Freeman
Toshiaki Katayama
W Fujibuchi
Winston Hide
Y Okazaki
Yoshihide Hayashizaki
YX Qi
Z Tatum
Zuotian Tatum
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

The FANTOM5 project investigates transcription initiation activities in more than 1,000 human and mouse primary cells, cell lines and tissues using CAGE. Based on manual curation of sample information and development of an ontology for sample classification, we assemble the resulting data into a centralized data resource (http://fantom.gsc.riken.jp/5/). This resource contains web-based tools and data-access points for the research community to search and extract data related to samples, genes, promoter activities, transcription factors and enhancers across the FANTOM5 atlas. Electronic supplementary material The online version of this article (doi:10.1186/s13059-014-0560-6) contains supplementary material, which is available to authorized users

Crossref

Harvard University - DASH

Springer - Publisher Connector

PubMed Central

Copenhagen University Research Information System

Edinburgh Research Explorer

Leiden University Scholary Publications

Enlighten

White Rose Research Online

The Constrained Maximal Expression Level Owing to Haploidy Shapes Gene Content on the Mammalian X Chromosome.

Author: 't Hoen Peter A C
Alam Intikhab
Albanese Davide
Altschuler Gabriel M.
Andersson Robin
Arakawa Takahiro
Archer John A C
Arner Erik
Arner Peter
Babina Magda
Bajic Vladimir B.
Baker Sarah
Balwierz Piotr J.
Beckhouse Anthony G.
Bertin Nicolas
Blake Judith A.
Blumenthal Antje
Bodega Beatrice
Bonetti Alessandro
Briggs James
Brombacher Frank
Califano Andrea
Cannistraci Carlo V.
Carbajo Daniel
Carninci Piero
Chen Yun
Chierici Marco
Ciani Yari
Clevers Hans C.
Dalla Emiliano
Daub Carsten O.
Davis Carrie A.
de Hoon Michiel J L
de Lima Morais David A.
Detmar Michael
Diehl Alexander D.
Dimont Emmanuel
Dohi Taeko
Drabløs Finn
Edge Albert S B
Edinger Matthias
Ekwall Karl
Endoh Mitsuhiro
Enomoto Hideki
Fagiolini Michela
Fairbairn Lynsey
Fang Hai
Farach-Carson Mary C.
Faulkner Geoffrey J.
Favorov Alexander V.
Fisher Malcolm E.
Forrest Alistair R R
Francescatto Margherita
Freeman Tom C.
Frith Martin C.
Fujita Rie
Fukuda Shiro
Furlanello Cesare
Furuno Masaaki
Furusawa Jun ichi
Geijtenbeek Teunis B.
Ghanbarian Avazeh T.
Gibson Andrew
Gingeras Thomas
Goldowitz Daniel
Gough Julian
Guhl Sven
Guler Reto
Gustincich Stefano
Ha Thomas J.
Haberle Vanja
Hamaguchi Masahide
Hara Mitsuko
Harbers Matthias
Harshbarger Jayson
Hasegawa Akira
Hasegawa Yuki
Hashimoto Takehiro
Hayashizaki Yoshihide
Herlyn Meenhard
Heutink Peter
Hide Winston
Hitchens Kelly J.
Ho Sui Shannan J.
Hofmann Oliver M.
Hoof Ilka
Hori Fumi
Hume David A.
Huminiecki Lukasz
Huminiecki Lukasz
Hurst Laurence D.
Iida Kei
Ikawa Tomokatsu
Ishizu Yuri
Itoh Masayoshi
Jankovic Boris R.
Jia Hui
Joshi Anagha
Jurman Giuseppe
Jørgensen Mette
Kaczkowski Bogumil
Kai Chieko
Kaida Kaoru
Kaiho Ai
Kajiyama Kazuhiro
Kanamori Mutsumi Katayama
Kasianov Artem S.
Kasukawa Takeya
Katayama Shintaro
Kato Sachi
Kawaguchi Shuji
Kawai Jun
Kawamoto Hiroshi
Kawamura Yuki I.
Kawashima Tsugumi
Kempfle Judith S.
Kenna Tony J.
Kenneth Baillie J.
Kere Juha
Khachigian Levon M.
Kitamura Toshio
Knox Alan J.
Kojima Miki
Kojima Soichi
Kondo Naoto
Koseki Haruhiko
Koyasu Shigeo
Krampitz Sarah
Kubosaki Atsutaka
Kulakovskiy Ivan V.
Kwon Andrew T.
Laros Jeroen F J
Lassmann Timo
Lee Weonju
Lenhard Boris
Lennartsson Andreas
Li Kang
Lilje Berit
Lipovich Leonard
Lizio Marina
Mackay Alan sim
Makeev Vsevolod J.
Manabe Riichiroh
Mar Jessica C.
Marchand Benoit
Mathelier Anthony
Maxwell Burroughs A.
Medvedeva Yulia A.
Meehan Terrence F.
Mejhert Niklas
Meynert Alison
Mizuno Yosuke
Morikawa Hiromasa
Morimoto Mitsuru
Moro Kazuyo
Motakis Efthymios
Motohashi Hozumi
Mummery Christine L.
Mungall Christopher J.
Murata Mitsuyoshi
Nagao Sayaka Sato
Nakachi Yutaka
Nakahara Fumio
Nakamura Toshiyuki
Nakamura Yukio
Nakazato Kenichi
Ninomiya Noriko
Nishiyori Hiromi
Noma Shohei
Nozaki Tadasuke
Ogishima Soichi
Ohkura Naganari
Ohmiya Hiroko
Ohno Hiroshi
Ohshima Mitsuhiro
Okada Mariko Hatakeyama
Okazaki Yasushi
Orlando Valerio
Ovchinnikov Dmitry A.
Pain Arnab
Passier Robert
Patrikakis Margaret
Persson Helena
Peter Klinken S.
Piazza Silvano
Plessy Charles
Pradhan Swati Bhatt
Prendergast James G D
Rackham Owen J L
Ramilowski Jordan A.
Rashid Mamoon
Ravasi Timothy
Rehli Michael
Rizzu Patrizia
Roncador Marco
Roy Sugata
Rye Morten B.
Saijyo Eri
Sajantila Antti
Saka Akiko
Sakaguchi Shimon
Sakai Mizuho
Sandelin Albin Gustav
Sato Hiroki
Satoh Hironori
Savvi Suzana
Saxena Alka
Schaefer Ulf
Schmeier Sebastian
Schmidl Christian
Schneider Claudio
Schultes Erik A.
Schulze-Tanzil Gundula G.
Schwegmann Anita
Semple Colin A.
Sengstag Thierry
Severin Jessica
Sheng Guojun
Shimoji Hisashi
Shimoni Yishai
Shin Jay W.
Simon Christophe
Sugiyama Daisuke
Sugiyama Takaaki
Summers Kim M.
Suzuki Harukazu
Suzuki Masanori
Suzuki Naoko
Swoboda Rolf K.
Tagami Michihira
Takahashi Naoko
Takai Jun
Tanaka Hiroshi
Tatsukawa Hideki
Tatum Zuotian
Taylor Martin S.
Thompson Mark
Toyoda Hiroo
Toyoda Tetsuro
Valen Eivind
van de Wetering Marc
van den Berg Linda M.
van Nimwegen Erik
Verardo Roberto
Vijayan Dipti
Vitezic Morana
Vorontsov Ilya E.
Wasserman Wyeth W.
Watanabe Shoko
Wells Christine A.
Winteringham Louise N.
Wolvetang Ernst
Wood Emily J.
Yamaguchi Yoko
Yamamoto Masayuki
Yoneda Misako
Yonekura Yohei
Yoshida Shigehiro
Young Robert S.
Zabierowski Suzan E.
Zhang Peter G.
Zhao Xiaobei
Zucchelli Silvia
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2015
Field of study

X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter) prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE) and data from the Functional Annotation of the Mammalian Genome (FANTOM5) project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor) if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds), as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased tissue of expression profiles of X-linked genes. Tissues whose tissue-specific genes are very highly expressed (e.g., secretory tissues, tissues abundant in structural proteins) are also tissues in which gene expression is relatively rare on the X chromosome. These trends cannot be fully accounted for in terms of alternative models of biased expression. In conclusion, the notion that it is hard for genes on the Therian X to be highly expressed, owing to transcriptional traffic jams, provides a simple yet robustly supported rationale of many peculiar features of X's gene content, gene expression, and evolution

Cold Spring Harbor Laboratory Institutional Repository

ZENODO

Directory of Open Access Journals

Edinburgh Research Explorer

Electronic Archiving System

FigShare

Public Library of Science (PLOS)

Repository for Publications and Research Data

Crossref

edoc

Dryad Digital Repository (Duke University)

PubMed Central

Copenhagen University Research Information System

eScholarship - University of California

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Utrecht University Repository

DSpace at Rice University

University of Melbourne Institutional Repository

ScholarBank@NUS

Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network

Author: Abugessaisa Imad
Aitken Stuart
Aken Bronwen L.
Alam Intikhab
Alam Tanvir
Alasiri Rami
Alhendi Ahmad M. N.
Alinejad-Rokny Hamid
Alvarez Mariano J.
Andersson Robin
Arakawa Takahiro
Araki Marito
Arbel Taly
Archer John
Archibald Alan L.
Arner Erik
Arner Peter
Asai Kiyoshi
Ashoor Haitham
Astrom Gaby
Babina M.
Baillie J.K.
Bajic V.B.
Bajpai A.
Baker S.
Baldarelli R.M.
Balic A.
Bansal M.
Batagov A.O.
Batzoglou S.
Beckhouse A.G.
Beltrami A.P.
Beltrami C.A.
Bertin Nicolas
Bessière Chloé
Bhattacharya S.
Bickel P.J.
Blake J.A.
Blanchette M.
Bodega B.
Bonetti A.
Bono H.
Bornholdt J.
Bougouffa S.
Boyd M.
Breda J.
Brombacher F.
Brown J.B.
Bréhélin L.
Bttcher M.
Bult C.J.
Burroughs A.M.
Burt D.W.
Busch A.
Caglio G.
Califano A.
Cameron C.J.
Cannistraci C.V.
Carbone A.
Carlisle A.J.
Carninci Piero
Carninci Piero
Carter K.W.
Cesselli D.
Chang J.-C.
Chatelain Clement
Chen J.C.
Chen Y.
Chierici M.
Christodoulou J.
Ciani Y.
Clark E.L.
Coskun M.
Dalby M.
Dalla E.
Daub C.O.
Davis C.A.
de Hoom Michiel J. L.
de Hoom Michiel J. L.
de Rie D.
Denisenko E.
Deplancke B.
Detmar M.
Deviatiiarov R.
Di Bernardo D.
Diehl A.D.
Dieterich L.C.
Dimont E.
Djebali S.
Dohi T.
Dostie J.
Drablos F.
Edge A.S.B.
Edinger M.
Ehrlund A.
Ekwall K.
Elofsson A.
Endoh M.
Enomoto H.
Enomoto S.
Faghihi M.
Fagiolini M.
FANTOM consortium.
Farach-Carson M.C.
Faulkner G.J.
Favorov A.
Fernandes A.M.
Ferrai C.
Forrest A.R.R.
Forrester L.M.
Forsberg M.
Fort A.
Francescatto M.
Freeman T.C.
Frith Martin C.
Frith Martin C.
Fukuda S.
Funayama M.
Furlanello C.
Furuno M.
Furusawa C.
Gao H.
Gazova I.
Gebhard C.
Geier F.
Geijtenbeek T.B.H.
Ghosh S.
Ghosheh Y.
Gingeras T.R.
Gojobori T.
Goldberg T.
Goldowitz D.
Gough J.
Grapotte Mathys
Greco D.
Gruber A.J.
Guhl S.
Guigo R.
Guler R.
Gusev O.
Gustincich S.
Ha T.J.
Haberle V.
Hale P.
Hallstrom B.M.
Hamada M.
Handoko L.
Hara M.
Harbers M.
Harrow J.
Harshbarger J.
Hase T.
Hasegawa Akira
Hasegawa Akira
Hashimoto K.
Hatano T.
Hattori N.
Hayashi R.
Hayashizaki Yoshihide
Hayashizaki Yoshihide
Herlyn M.
Hettne K.
Heutink P.
Hide W.
Hitchens K.J.
Hon C.C.
Hori F.
Horie M.
Horimoto K.
Horton P.
Hou R.
Huang E.
Huang Y.
Hugues R.
Hume D.
Ienasescu H.
Iida K.
Ikawa T.
Ikemura T.
Ikeo K.
Inoue N.
Ishizu Y.
Ito Y.
Itoh Masayoshi
Itoh Masayoshi
Ivshina A.V.
Jankovic B.R.
Jenjaroenpun P.
Johnson R.
Jorgensen M.
Jorjani H.
Joshi A.
Jurman G.
Kaczkowski B.
Kai C.
Kaida K.
Kajiyama K.
Kaliyaperumal R.
Kaminuma E.
Kanaya T.
Kaneda H.
Kapranov P.
Kasianov A.S.
Kasukawa Takeya
Kasukawa Takeya
Katayama T.
Kato S.
Kawaguchi S.
Kawai J.
Kawaji H.
Kawamoto H.
Kawamura Y.I.
Kawasaki S.
Kawashima T.
Kempfle J.S.
Kenna T.J.
Kere J.
Khachigian L.
Kiryu H.
Kishima M.
Kitajima H.
Kitamura T.
Kitano H.
Klaric E.
Klepper K.
Klinken S.P.
Kloppmann E.
Knox A.J.
Kodama Y.
Kogo Y.
Kojima M.
Kojima S.
Kojima-Ishiyama Miki
Komatsu N.
Komiyama H.
Kono T.
Koseki H.
Koyasu S.
Kratz A.
Kukalev A.
Kulakovskiy I.
Kundaje A.
Kunikata H.
Kuo R.
Kuo T.
Kuraku S.
Kuznetsov V.A.
Kwon T.J.
Larouche M.
Lassmann T.
Laurent G.S.
Law A.
Le-Cao K.-A.
Lecellier C.-H.
Lecellier C.-H.
Lee W.
Lenhard B.
Lennartsson A.
Li K.
Li R.
Lilje B.
Lipovich L.
Lizio M.
Lopez G.
Magi S.
Mak G.K.
Makeev V.
Manabe R.
Mandai M.
Mar J.
Maruyama K.
Maruyama T.
Mason E.
Mathelier A.
Matsuda H.
Medvedeva Y.A.
Meehan T.F.
Mejhert N.
Menichelli Christophe
Meynert A.
Mikami N.
Minoda A.
Miura H.
Miyagi Y.
Miyawaki A.
Mizuno Y.
Morikawa H.
Morimoto M.
Morioka M.
Morishita S.
Moro K.
Motakis E.
Motohashi H.
Mukarram A.K.
Mummery C.L.
Mungall C.J.
Murakawa Y.
Muramatsu M.
Murata Mitsuyoshi
Murata Mitsuyoshi
Nagasaka K.
Nagase T.
Nakachi Y.
Nakahara F.
Nakai K.
Nakamura K.
Nakamura Y.
Nakamura Y.
Nakazawa T.
Nason G.P.
Nepal C.
Nguyen Q.H.
Nielsen L.K.
Nishida K.
Nishiguchi K.M.
Nishiyori H.
Nishiyori-Sueki Hiromi
Nitta K.
Noguchi Shuhei
Noguchi Shuhei
Noma Shohei
Noma Shohei
Notredame C.
Ogishima S.
Ohkura N.
Ohno H.
Ohshima M.
Ohtsu T.
Okada Y.
Okada-Hatakeyama M.
Okazaki Y.
Oksvold P.
Orlando V.
Ow G.S.
Ozturk M.
Pachkov M.
Paparountas T.
Parihar S.P.
Park S.-J.
Pascarella G.
Passier R.
Persson H.
Philippens I.H.
Piazza S.
Plessy C.
Pombo A.
Ponten F.
Poulain S.
Poulsen T.M.
Pradhan S.
Prezioso C.
Pridans C.
Qin X.-Y.
Quackenbush J.
Rackham O.
Ramilowski Jordan A.
Ramilowski Jordan A.
Ravasi T.
Rehli M.
Rennie S.
Rito T.
Rizzu P.
Robert C.
Roos M.
Rost B.
Roudnicky F.
Roy R.
Rye M.B.
Sachenkova O.
Saetrom P.
Sai H.
Saiki S.
Saito A.
Saito M.
Sakaguchi S.
Sakai M.
Sakaue S.
Sakaue-Sawano A.
Sandelin A.
Sano H.
Saraswat Manu
Sasamoto Y.
Sato H.
Saxena A.
Saya H.
Schafferhans A.
Schmeier S.
Schmidl C.
Schmocker D.
Schneider C.
Schueler M.
Schultes E.A.
Schulze-Tanzil G.
Semple C.A.
Seno S.
Seo W.
Sese J.
Severin Jessica
Severin Jessica
Sheng G.
Shi J.
Shimoni Y.
Shin J.W.
SimonSanchez J.
Sivertsson A.
Sjostedt E.
Soderhall C.
Stoiber M.H.
Sugiyama D.
Sui S.H.
Summers K.M.
Suzuki A.M.
Suzuki Harukazu
Suzuki Harukazu
Suzuki K.
Suzuki M.
Suzuki N.
Suzuki T.
Swanson D.J.
Swoboda R.K.
Tagami Michihira
Tagami Michihira
Taguchi A.
Takahashi H.
Takahashi M.
Takamochi K.
Takeda S.
Takenaka Y.
Tam K.T.
Tanaka H.
Tanaka R.
Tanaka Y.
Tang D.
Taniuchi I.
Tanzer A.
Tarui H.
Taylor M.S.
Terada A.
Terao Y.
Testa A.C.
Thomas M.
Thongjuea S.
Tomii K.
Toyoda H.
Triglia E.T.
Tsang H.G.
Tsujikawa M.
Uhlén M.
Valen E.
van de Wetering M.
van Nimwegen E.
Velmeshev D.
Verardo R.
Vitezic M.
Vitting-Seerup K.
von Feilitzen K.
Voolstra C.R.
Vorontsov I.E.
Wahlestedt C.
Wasserman Wyeth W.
Wasserman Wyeth W.
Watanabe K.
Watanabe S.
Wells C.A.
Winteringham L.N.
Wolvetang E.
Yabukami H.
Yagi K.
Yamada T.
Yamaguchi Y.
Yamamoto M.
Yamamoto Y.
Yamamoto Y.
Yamanaka Y.
Yano K.
Yasuzawa K.
Yatsuka Y.
Yo M.
Yokokura S.
Yoneda M.
Yoshida E.
Yoshida Y.
Yoshihara M.
Young R.
Young R.S.
Yu N.Y.
Yumoto N.
Zabierowski S.E.
Zhang P.G.
Zucchelli S.
Zwahlen M.
’t Hoen P.A.C.
Publication venue: Nature Publishing Group
Publication date: 15/12/2020
Field of study

Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep learning models able to predict CAGE signal at STRs with high accuracy. These models unveil the importance of STR surrounding sequences not only to distinguish STR classes, but also to predict the level of transcription initiation. Importantly, genetic variants linked to human diseases are preferentially found at STRs with high transcription initiation level, supporting the biological and clinical relevance of transcription initiation at STRs. Together, our results extend the repertoire of non-coding transcription associated with DNA tandem repeats and complexify STR polymorphism