Search CORE

102 research outputs found

Personalized Proteome: Comparing Proteogenomics and Open Variant Search Approaches for Single Amino Acid Variant Detection

Author: Bouwmeester Robbin
Degroeve Sven
Gabriels Ralf
Martens Lennart
Salz Renee
Volders Pieter-Jan
’t Hoen Peter A.C.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2021
Field of study

Item does not contain fulltex

Ghent University Academic Bibliography

Radboud Repository

Clinical improvement of DM1 patients reflected by reversal of disease-induced gene expression in blood

Author: Glennon Jeffrey C.
van As Daniël
van Cruchten Remco T.P.
van Engelen Baziel G.M.
‘t Hoen Peter A.C.
Publication venue
Publication date: 01/01/2022
Field of study

Background: Myotonic dystrophy type 1 (DM1) is an incurable multisystem disease caused by a CTG-repeat expansion in the DM1 protein kinase (DMPK) gene. The OPTIMISTIC clinical trial demonstrated positive and heterogenous effects of cognitive behavioral therapy (CBT) on the capacity for activity and social participations in DM1 patients. Through a process of reverse engineering, this study aims to identify druggable molecular biomarkers associated with the clinical improvement in the OPTIMISTIC cohort. Methods: Based on full blood samples collected during OPTIMISTIC, we performed paired mRNA sequencing for 27 patients before and after the CBT intervention. Linear mixed effect models were used to identify biomarkers associated with the disease-causing CTG expansion and the mean clinical improvement across all clinical outcome measures. Results: We identified 608 genes for which their expression was significantly associated with the CTG-repeat expansion, as well as 1176 genes significantly associated with the average clinical response towards the intervention. Remarkably, all 97 genes associated with both returned to more normal levels in patients who benefited the most from CBT. This main finding has been replicated based on an external dataset of mRNA data of DM1 patients and controls, singling these genes out as candidate biomarkers for therapy response. Among these candidate genes were DNAJB12, HDAC5, and TRIM8, each belonging to a protein family that is being studied in the context of neurological disorders or muscular dystrophies. Across the different gene sets, gene pathway enrichment analysis revealed disease-relevant impaired signaling in, among others, insulin-, metabolism-, and immune-related pathways. Furthermore, evidence for shared dysregulations with another neuromuscular disease, Duchenne muscular dystrophy, was found, suggesting a partial overlap in blood-based gene dysregulation. Conclusions: DM1-relevant disease signatures can be identified on a molecular level in peripheral blood, opening new avenues for drug discovery and therapy efficacy assessments.</p

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

University of Dundee Online Publications

Towards FAIRification of sensitive and fragmented rare disease patient data:challenges and solutions in European reference network registries

Author: Abaza Haitham
Benis Nirupama
Bernabé César H.
Cornet Ronald
Cámara Alberto
dos Santos Vieira Bruna
Jacobsen Annika
Le Cornec Clémence M.A.
Roos Marco
Schaefer Franz
Swertz Morris A.
van der Velde K. Joeri
Wilkinson Mark D.
Zhang Shuxin
’t Hoen Peter A.C.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

INTRODUCTION: Rare disease patient data are typically sensitive, present in multiple registries controlled by different custodians, and non-interoperable. Making these data Findable, Accessible, Interoperable, and Reusable (FAIR) for humans and machines at source enables federated discovery and analysis across data custodians. This facilitates accurate diagnosis, optimal clinical management, and personalised treatments. In Europe, twenty-four European Reference Networks (ERNs) work on rare disease registries in different clinical domains. The process and the implementation choices for making data FAIR (‘FAIRification’) differ among ERN registries. For example, registries use different software systems and are subject to different legal regulations. To support the ERNs in making informed decisions and to harmonise FAIRification, the FAIRification steward team was established to work as liaisons between ERNs and researchers from the European Joint Programme on Rare Diseases. RESULTS: The FAIRification steward team inventoried the FAIRification challenges of the ERN registries and proposed solutions collectively with involved stakeholders to address them. Ninety-eight FAIRification challenges from 24 ERNs’ registries were collected and categorised into “training” (31), “community” (9), “modelling” (12), “implementation” (26), and “legal” (20). After curating and aggregating highly similar challenges, 41 unique FAIRification challenges remained. The two categories with the most challenges were “training” (15) and “implementation” (9), followed by “community” (7), and then “modelling” (5) and “legal” (5). To address all challenges, eleven types of solutions were proposed. Among them, the provision of guidelines and the organisation of training activities resolved the “training” challenges, which ranged from less-technical “coffee-rounds” to technical workshops, from informal FAIR Games to formal hackathons. Obtaining implementation support from technical experts was the solution type for tackling the “implementation” challenges. CONCLUSION: This work shows that a dedicated team of FAIR data stewards is an asset for harmonising the various processes of making data FAIR in a large organisation with multiple stakeholders. Additionally, multi-levelled training activities are required to accommodate the diverse needs of the ERNs. Finally, the lessons learned from the experience of the FAIRification steward team described in this paper may help to increase FAIR awareness and provide insights into FAIRification challenges and solutions of rare disease registries. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13023-022-02558-5

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

PubMed Central

Digital.CSIC

Dissertations of the University of Groningen

Correction to: Solve-RD: systematic pan-European data sharing and collaborative analysis to solve rare diseases (European journal of human genetics : EJHG (2021) 29 9 (1325-1331))

Author: 't Hoen Peter A.C.
Beltran Sergi
Bonne Gisèle
Brookes Anthony J.
Brunner Han G.
de Voer Richarda M.
Ellwanger Kornelia
Evangelista Teresinha
Gilissen Christian
Graessner Holm
Gumus Gulcin
Harmuth Tina
Hoischen Alexander
Hoogerbrugge Nicoline
Laurie Steven
Matalonga Leslie
Ossowski Stephan
Rath Ana
Riess Olaf
Schulze-Hentrich Julia M.
Schüle Rebecca
Spalding Dylan
Swertz Morris
Synofzik Matthis
Töpf Ana
Verloes Alain
Vissers Lisenka E.L.M.
Vitobello Antonio
Zurek Birte
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2021
Field of study

Maastricht University Research Portal

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

PubMed Central

Dissertations of the University of Groningen

A Resource for Guiding Data Stewards to Make European Rare Disease Patient Registries FAIR

Author: Alberto Cámara Ballesteros
Annika Jacobsen
Bruna Dos Santos Vieira
Claudio Carta
Clémence M. A. Le Cornec
César H. Bernabé
K. Joeri van der Velde
Marco Roos
Morris A. Swertz
Nirupama Benis
Pablo Alarcón Moreno
Peter A.C. ’t Hoen
Philip van Damme
Ronald Cornet
Shuxin Zhang
Publication venue: 'Ubiquity Press, Ltd.'
Publication date: 01/06/2023
Field of study

Objective: This paper reports on the development of a dynamic data management planning questionnaire to guide data stewards of the European Reference Network (ERN) rare disease patient registries to make their data findable, accessible, interoperable, and reusable (FAIR). As part of this work, the questionnaire was validated through expert review and aligned with existing resources on rare diseases and FAIR data management. Materials and Methods: The questionnaire was developed for the Data Stewardship Wizard, a tool for data management planning. Knowledge sources on FAIR data, ERN patient registries, and data management were used to compose questions. Ten domain experts validated the questionnaire. The topics in the questionnaire were aligned with existing knowledge bases. Results: A total of 57 questions were included in the questionnaire. Twenty-three references to the FAIR Cookbook and Research Data Management toolkit for Life Sciences were added. Expert validation provided a total of 166 comments on content, structure, and software-related issues. A public instance of the Data Stewardship Wizard was deployed for use by data stewards of ERN patient registries. Discussion: The questionnaire addresses issues that ERNs encounter when making their registries FAIR and follows the implementation choices made by the European rare disease community. A challenging task for future research is to extend the questionnaire to other types of registries and to validate with users. Conclusion: This smart questionnaire is the first model created for the Data Stewardship Wizard that helps ERN patient registries with making their data FAIR. It will assist data stewards in aligning their efforts and providing guidance on FAIR data

ARTS repository - University of Groningen

Directory of Open Access Journals

Digital.CSIC

Controlling bias and inflation in epigenome- and transcriptome-wide association studies using the empirical null distribution

Author: 't Hoen P.-B. (Peter-Bram)
't Hoen P.A.C. (Peter A.C.)
Arindrarto W. (Wibowo)
Beekman M. (Marian)
Berg L.H. (Leonard) van den
Bonder M.J. (Marc)
Boomsma D.I. (Dorret)
Bot J.J. (Jan)
Breggen R. (Ruud) van der
Deelen J. (Joris)
Deelen P. (Patrick)
Dijk F. (Freerk) van
Dongen J. (Jenny) van
Duijn C.M. (Cornelia) van
Franke L. (Lude)
Greevenbroek M.M. van
Heemst D. (Diana) van
Heijmans B.T. (Bastiaan)
Hofman B.
Hottenga J.J. (Jouke Jan)
Isaacs A.J. (Aaron)
Iterson M. (Maarten) van
Jansen R.
Jhamai P.M. (Mila)
Kallen C.J. van der
Kielbasa S.M. (Szymon M.)
Lakenberg N. (Nico)
Luijk R. (René)
Mei S. (Shan)
Meurs J.B.J. (Joyce) van
Moed H. (Heleen)
Nooren I. (Irene)
Pool R. (Reńe)
Rooij J.G.J. (Jeroen) van
Schalkwijk C.G. (Casper)
Slagboom P.E. (Eline)
Stehouwer C.D. (Coen)
Suchiman H.E.D. (Eka)
Swertz M.A. (Morris A.)
Tigchelaar E.F. (Ettje F.)
Uitterlinden A.G. (André)
van 't Hof P. (Peter)
Van Galen M. (Michiel)
Veldink J.H. (Jan)
Verbiest M.M.P.J. (Michael)
Verkerk M. (Marijn)
Vermaat M. (Martijn)
Wijmenga C. (Cisca)
Zhernakova A. (Alexandra)
Zhernakova S. (Sasha)
Zwet E.W. (Erik) van
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/01/2017
Field of study

We show that epigenome- and transcriptome-wide association studies (EWAS and TWAS) are prone to significant inflation and bias of test statistics, an unrecognized phenomenon introducing spurious findings if left unaddressed. Neither GWAS-based methodology nor state-of-the-art confounder adjustment methods completely remove bias and inflation. We propose a Bayesian method to control bias and inflation in EWAS and TWAS based on estimation of the empirical null distribution. Using simulations and real data, we demonstrate that our method maximizes power while properly controlling the false positive rate. We illustrate the utility of our method in large-scale EWAS and TWAS meta-analyses of age and smoking

Erasmus University Digital Repository

Refining Attention-Deficit/Hyperactivity Disorder and Autism Spectrum Disorder Genetic Loci by Integrating Summary Data From Genome-wide Association, Gene Expression, and DNA Methylation Studies

Author: Agbessi Mawussé
Ahsan Habibul
Alves Isabel
Andiappan Anand
Arindrarto Wibowo
Arindrarto Wibowo
Awadalla Philip
Bartels Meike
Battle Alexis
Beekman Marian
Beutner Frank
Bonder Marc Jan
Bonder Marc Jan
Bot Jan
Byrne Enda M.
Christiansen Mark
Claringbould Annique
Deelen Joris
Deelen Patrick
Deelen Patrick
Deelen Patrick
Esko Tõnu
Favé Marie-julie
Franke Lude
Franke Lude
Franke Lude
Franke Lude
Frayling Timothy
Gharib Sina A.
Gibson Gregory
Hammerschlag Anke R.
Heijmans Bastiaan T.
Heijmans Bastiaan T.
Heijmans Bastiaan T.
Heijmans Bastiaan T.
Hemani Gibran
Hoen Peter-bram ‘t
Hof Peter Van ‘t
Hofman Bert A.
Hottenga Jouke J.
Isaacs Aaron
Isaacs Aaron
Jhamai P. Mila
Kalnapenkis Anette
Kasela Silva
Kettunen Johannes
Kielbasa Szymon M.
Kim Yungil
Kirsten Holger
Kovacs Peter
Krohn Knut
Kronberg-guzman Jaanika
Kukushkina Viktorija
Kutalik Zoltan
Kähönen Mika
Lakenberg Nico
Lee Bernett
Lehtimäki Terho
Loeffler Markus
Luijk René
Marigorta Urko M.
Mei Hailang
Middeldorp Christel M.
Milani Lili
Moed Matthijs
Montgomery Grant W.
Müller-nurasyid Martina
Nauck Matthias
Nivard Michel
Nooren Irene
Penninx Brenda
Perola Markus
Pervjakova Natalia
Pierce Brandon L.
Pool René
Powell Joseph
Prokisch Holger
Psaty Bruce M.
Raitakari Olli T.
Ripatti Samuli
Rotzschke Olaf
Rüeger Sina
Saha Ashis
Schalkwijk Casper G.
Scholz Markus
Schramm Katharina
Seppälä Ilkka
Slagboom P. Eline
Slagboom P. Eline
Stehouwer Coen D.a.
Stehouwer Coen D.a.
Stumvoll Michael
Suchiman H.eka D.
Sullivan Patrick
Swertz Morris A.
Teumer Alexander
Thiery Joachim
Tigchelaar Ettje F.
Tong Lin
Tönjes Anke
Uitterlinden André G.
Van Den Berg Leonard H.
Van Der Breggen Ruud
Van Der Kallen Carla J.h.
Van Dijk Freerk
Van Dongen Jenny
Van Duijn Cornelia M.
Van Galen Michiel
Van Galen Michiel
Van Greevenbroek Marleen Mj.
Van Heemst Diana
Van Iterson Maarten
Van Iterson Maarten
Van Iterson Maarten
Van Meurs Joyce
Van Meurs Joyce
Van Meurs Joyce
Van Rooij Jeroen
Van Zwet Erik. W.
Veldink Jan H.
Veldink Jan H.
Verbiest Michael
Verkerk Marijn
Verlouw Joost
Vermaat Martijn
Visscher Peter M.
Võsa Urmo
Völker Uwe
Westra Harm-jan
Wijmenga Cisca
Wijmenga Cisca
Wray Naomi R.
Yaghootkar Hanieh
Yang Jian
Zeng Biao
Zhang Futao
Zhernakova Daria V.
Zhernakova Daria V.
Zhernakova Sasha
‘t Hoen Peter A.c.
‘t Hoen Peter A.c.
Publication venue: 'Elsevier BV'
Publication date: 15/09/2020
Field of study

Background: Recent genome-wide association studies (GWASs) identified the first genetic loci associated with attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorder (ASD). The next step is to use these results to increase our understanding of the biological mechanisms involved. Most of the identified variants likely influence gene regulation. The aim of the current study is to shed light on the mechanisms underlying the genetic signals and prioritize genes by integrating GWAS results with gene expression and DNA methylation (DNAm) levels. Methods: We applied summary-data–based Mendelian randomization to integrate ADHD and ASD GWAS data with fetal brain expression and methylation quantitative trait loci, given the early onset of these disorders. We also analyzed expression and methylation quantitative trait loci datasets of adult brain and blood, as these provide increased statistical power. We subsequently used summary-data–based Mendelian randomization to investigate if the same variant influences both DNAm and gene expression levels. Results: We identified multiple gene expression and DNAm levels in fetal brain at chromosomes 1 and 17 that were associated with ADHD and ASD, respectively, through pleiotropy at shared genetic variants. The analyses in brain and blood showed additional associated gene expression and DNAm levels at the same and additional loci, likely because of increased statistical power. Several of the associated genes have not been identified in ADHD and ASD GWASs before. Conclusions: Our findings identified the genetic variants associated with ADHD and ASD that likely act through gene regulation. This facilitates prioritization of candidate genes for functional follow-up studies

VU Research Portal

Blood lipids influence DNA methylation in circulating cells

Author: 't Hoen P.A.C. (Peter A.C.)
Berg L.H. (Leonard) van den
Bonder M.J. (Marc)
Boomsma D.I. (Dorret)
Deelen J. (Joris)
Dekkers K.F. (Koen F.)
Dongen J. (Jenny) van
Duijn C.M. (Cornelia) van
Franke L. (Lude)
Greevenbroek M.M. van
Heemst D. (Diana) van
Heijmans B.T. (Bastiaan)
Hofman A. (Albert)
Hottenga J.J. (Jouke Jan)
Iterson M. (Maarten) van
Jansen R.
Jukema J.W. (Jan Wouter)
Kallen C.J. van der
Mei S. (Shan)
Meurs J.B.J. (Joyce) van
Moed H. (Heleen)
Schalkwijk C.G. (Casper)
Slagboom P.E. (Eline)
Slieker R. (Roderick)
Stehouwer C.D. (Coen)
Tigchelaar E.F. (Ettje F.)
Uitterlinden A.G. (André)
Van Galen M. (Michiel)
Veldink J.H. (Jan)
Wijmenga C. (Cisca)
Willemsen G.A.H.M. (Gonneke)
Zhernakova A. (Alexandra)
Zhernakova A. (Alexandra)
Zwet E.W. (Erik) van
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/06/2016
Field of study

Background: Cells can be primed by external stimuli to obtain a long-term epigenetic memory. We hypothesize that long-term exposure to elevated blood lipids can prime circulating immune cells through changes in DNA methylation, a process that may contribute to the development of atherosclerosis. To interrogate the causal relationship between triglyceride, low-density lipoprotein (LDL) cholesterol, and high-density lipoprotein (HDL) cholesterol levels and genome-wide DNA methylation while excluding confounding and pleiotropy, we perform a stepwise Mendelian randomization analysis in whole blood of 3296 individuals. Results: This analysis shows that differential methylation is the consequence of inter-individual variation in blood lipid levels and not vice versa. Specifically, we observe an effect of triglycerides on DNA methylation at three CpGs, of LDL cholesterol at one CpG, and of HDL cholesterol at two CpGs using multivariable Mendelian randomization. Using RNA-seq data available for a large subset of individuals (N = 2044), DNA methylation of these six CpGs is associated with the expression of CPT1A and SREBF1 (for triglycerides), DHCR24 (for LDL cholesterol) and

Erasmus University Digital Repository

Improving Phenotypic Prediction by Combining Genetic and Epigenetic Associations

Author: Aaron Isaacs
Abdullah
Alexandra Zhernakova
Allan F. McRae
André G. Uitterlinden
Anjali K. Henders
Bastiaan T. Heijmans
Bert A. Hofman
Bibikova
Bonder
Carla J.H. van der Kallen
Casper G. Schalkwijk
Chunyu Liu
Cisca Wijmenga
Cisca Wijmenga
Coen D.A. Stehouwer
Cornelia M. van Duijn
Daniel Levy
Dasha V. Zhernakova
Dave Liewald
Deary
Deary
Deary
Deelen
Delano
Diana van Heemst
Dick
Dixon
Dorret I. Boomsma
Elks
Erik W. van Zwet
Ettje F. Tigchelaar
Fehrmann
Freerk van Dijk
Grant W. Montgomery
H. Eka D. Suchiman
Hailiang Mei
Hannum
Hemani
Horvath
Howie
Howie
Huynh
Ian J. Deary
Irene Nooren
Jan Bot
Jan H. Veldink
Jenny van Dongen
Jeroen van Rooij
Jian Yang
John M. Starr
Joris Deelen
Jouke J. Hottenga
Joyce van Meurs
Lavie
Leonard H. van den Berg
Liming Liang
Locke
Lude Franke
Lude Franke
Maarten van Iterson
Macgregor
Marc Jan Bonder
Marc J. Bonder
Marian Beekman
Marijn Verkerk
Marleen M.J. van Greevenbroek
Martijn Vermaat
Matthijs Moed
McRae
Medland
Michael Verbiest
Michael M. Mendelson
Michiel van Galen
Morris A. Swertz
Must
Møller
Naomi R. Wray
Nicholas G. Martin
Nico Lakenberg
Ong
P. Eline Slagboom
P. Mila Jhamai
Parkes
Patrick Deelen
Peter A.C. ’t Hoen
Peter M. Visscher
Peter van ’t Hof
Pidsley
Poirier
Powell
René Luijk
René Pool
Riccardo E. Marioni
Rick Jansen
Roby Joehanes
Ruud van der Breggen
Sarah E. Harris
Sasha Zhernakova
Shah
Shenker
Silventoinen
Slieker
Sonia Shah
Speliotes
Szymon M. Kielbasa
Tigchelaar
Touleimat
Wibowo Arindrarto
Wood
Yang
Yang
Zaitlen
Zhihong Zhu
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

We tested whether DNA-methylation profiles account for inter-individual variation in body mass index (BMI) and height and whether they predict these phenotypes over and above genetic factors. Genetic predictors were derived from published summary results from the largest genome-wide association studies on BMI (n ∼ 350,000) and height (n ∼ 250,000) to date. We derived methylation predictors by estimating probe-trait effects in discovery samples and tested them in external samples. Methylation profiles associated with BMI in older individuals from the Lothian Birth Cohorts (LBCs, n = 1,366) explained 4.9% of the variation in BMI in Dutch adults from the LifeLines DEEP study (n = 750) but did not account for any BMI variation in adolescents from the Brisbane Systems Genetic Study (BSGS, n = 403). Methylation profiles based on the Dutch sample explained 4.9% and 3.6% of the variation in BMI in the LBCs and BSGS, respectively. Methylation profiles predicted BMI independently of genetic profiles in an additive manner: 7%, 8%, and 14% of variance of BMI in the LBCs were explained by the methylation predictor, the genetic predictor, and a model containing both, respectively. The corresponding percentages for LifeLines DEEP were 5%, 9%, and 13%, respectively, suggesting that the methylation profiles represent environmental effects. The differential effects of the BMI methylation profiles by age support previous observations of age modulation of genetic contributions. In contrast, methylation profiles accounted for almost no variation in height, consistent with a mainly genetic contribution to inter-individual variation. The BMI results suggest that combining genetic and epigenetic information might have greater utility for complex-trait prediction

VU Research Portal

University of Groningen

Edinburgh Research Explorer

Leiden University Scholary Publications

University of Queensland eSpace

Elsevier - Publisher Connector

Crossref

Proceedings - University of Groningen

ARTS repository - University of Groningen

PubMed Central

Dissertations of the University of Groningen

Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network

Author: Abugessaisa Imad
Aitken Stuart
Aken Bronwen L.
Alam Intikhab
Alam Tanvir
Alasiri Rami
Alhendi Ahmad M. N.
Alinejad-Rokny Hamid
Alvarez Mariano J.
Andersson Robin
Arakawa Takahiro
Araki Marito
Arbel Taly
Archer John
Archibald Alan L.
Arner Erik
Arner Peter
Asai Kiyoshi
Ashoor Haitham
Astrom Gaby
Babina M.
Baillie J.K.
Bajic V.B.
Bajpai A.
Baker S.
Baldarelli R.M.
Balic A.
Bansal M.
Batagov A.O.
Batzoglou S.
Beckhouse A.G.
Beltrami A.P.
Beltrami C.A.
Bertin Nicolas
Bessière Chloé
Bhattacharya S.
Bickel P.J.
Blake J.A.
Blanchette M.
Bodega B.
Bonetti A.
Bono H.
Bornholdt J.
Bougouffa S.
Boyd M.
Breda J.
Brombacher F.
Brown J.B.
Bréhélin L.
Bttcher M.
Bult C.J.
Burroughs A.M.
Burt D.W.
Busch A.
Caglio G.
Califano A.
Cameron C.J.
Cannistraci C.V.
Carbone A.
Carlisle A.J.
Carninci Piero
Carninci Piero
Carter K.W.
Cesselli D.
Chang J.-C.
Chatelain Clement
Chen J.C.
Chen Y.
Chierici M.
Christodoulou J.
Ciani Y.
Clark E.L.
Coskun M.
Dalby M.
Dalla E.
Daub C.O.
Davis C.A.
de Hoom Michiel J. L.
de Hoom Michiel J. L.
de Rie D.
Denisenko E.
Deplancke B.
Detmar M.
Deviatiiarov R.
Di Bernardo D.
Diehl A.D.
Dieterich L.C.
Dimont E.
Djebali S.
Dohi T.
Dostie J.
Drablos F.
Edge A.S.B.
Edinger M.
Ehrlund A.
Ekwall K.
Elofsson A.
Endoh M.
Enomoto H.
Enomoto S.
Faghihi M.
Fagiolini M.
FANTOM consortium.
Farach-Carson M.C.
Faulkner G.J.
Favorov A.
Fernandes A.M.
Ferrai C.
Forrest A.R.R.
Forrester L.M.
Forsberg M.
Fort A.
Francescatto M.
Freeman T.C.
Frith Martin C.
Frith Martin C.
Fukuda S.
Funayama M.
Furlanello C.
Furuno M.
Furusawa C.
Gao H.
Gazova I.
Gebhard C.
Geier F.
Geijtenbeek T.B.H.
Ghosh S.
Ghosheh Y.
Gingeras T.R.
Gojobori T.
Goldberg T.
Goldowitz D.
Gough J.
Grapotte Mathys
Greco D.
Gruber A.J.
Guhl S.
Guigo R.
Guler R.
Gusev O.
Gustincich S.
Ha T.J.
Haberle V.
Hale P.
Hallstrom B.M.
Hamada M.
Handoko L.
Hara M.
Harbers M.
Harrow J.
Harshbarger J.
Hase T.
Hasegawa Akira
Hasegawa Akira
Hashimoto K.
Hatano T.
Hattori N.
Hayashi R.
Hayashizaki Yoshihide
Hayashizaki Yoshihide
Herlyn M.
Hettne K.
Heutink P.
Hide W.
Hitchens K.J.
Hon C.C.
Hori F.
Horie M.
Horimoto K.
Horton P.
Hou R.
Huang E.
Huang Y.
Hugues R.
Hume D.
Ienasescu H.
Iida K.
Ikawa T.
Ikemura T.
Ikeo K.
Inoue N.
Ishizu Y.
Ito Y.
Itoh Masayoshi
Itoh Masayoshi
Ivshina A.V.
Jankovic B.R.
Jenjaroenpun P.
Johnson R.
Jorgensen M.
Jorjani H.
Joshi A.
Jurman G.
Kaczkowski B.
Kai C.
Kaida K.
Kajiyama K.
Kaliyaperumal R.
Kaminuma E.
Kanaya T.
Kaneda H.
Kapranov P.
Kasianov A.S.
Kasukawa Takeya
Kasukawa Takeya
Katayama T.
Kato S.
Kawaguchi S.
Kawai J.
Kawaji H.
Kawamoto H.
Kawamura Y.I.
Kawasaki S.
Kawashima T.
Kempfle J.S.
Kenna T.J.
Kere J.
Khachigian L.
Kiryu H.
Kishima M.
Kitajima H.
Kitamura T.
Kitano H.
Klaric E.
Klepper K.
Klinken S.P.
Kloppmann E.
Knox A.J.
Kodama Y.
Kogo Y.
Kojima M.
Kojima S.
Kojima-Ishiyama Miki
Komatsu N.
Komiyama H.
Kono T.
Koseki H.
Koyasu S.
Kratz A.
Kukalev A.
Kulakovskiy I.
Kundaje A.
Kunikata H.
Kuo R.
Kuo T.
Kuraku S.
Kuznetsov V.A.
Kwon T.J.
Larouche M.
Lassmann T.
Laurent G.S.
Law A.
Le-Cao K.-A.
Lecellier C.-H.
Lecellier C.-H.
Lee W.
Lenhard B.
Lennartsson A.
Li K.
Li R.
Lilje B.
Lipovich L.
Lizio M.
Lopez G.
Magi S.
Mak G.K.
Makeev V.
Manabe R.
Mandai M.
Mar J.
Maruyama K.
Maruyama T.
Mason E.
Mathelier A.
Matsuda H.
Medvedeva Y.A.
Meehan T.F.
Mejhert N.
Menichelli Christophe
Meynert A.
Mikami N.
Minoda A.
Miura H.
Miyagi Y.
Miyawaki A.
Mizuno Y.
Morikawa H.
Morimoto M.
Morioka M.
Morishita S.
Moro K.
Motakis E.
Motohashi H.
Mukarram A.K.
Mummery C.L.
Mungall C.J.
Murakawa Y.
Muramatsu M.
Murata Mitsuyoshi
Murata Mitsuyoshi
Nagasaka K.
Nagase T.
Nakachi Y.
Nakahara F.
Nakai K.
Nakamura K.
Nakamura Y.
Nakamura Y.
Nakazawa T.
Nason G.P.
Nepal C.
Nguyen Q.H.
Nielsen L.K.
Nishida K.
Nishiguchi K.M.
Nishiyori H.
Nishiyori-Sueki Hiromi
Nitta K.
Noguchi Shuhei
Noguchi Shuhei
Noma Shohei
Noma Shohei
Notredame C.
Ogishima S.
Ohkura N.
Ohno H.
Ohshima M.
Ohtsu T.
Okada Y.
Okada-Hatakeyama M.
Okazaki Y.
Oksvold P.
Orlando V.
Ow G.S.
Ozturk M.
Pachkov M.
Paparountas T.
Parihar S.P.
Park S.-J.
Pascarella G.
Passier R.
Persson H.
Philippens I.H.
Piazza S.
Plessy C.
Pombo A.
Ponten F.
Poulain S.
Poulsen T.M.
Pradhan S.
Prezioso C.
Pridans C.
Qin X.-Y.
Quackenbush J.
Rackham O.
Ramilowski Jordan A.
Ramilowski Jordan A.
Ravasi T.
Rehli M.
Rennie S.
Rito T.
Rizzu P.
Robert C.
Roos M.
Rost B.
Roudnicky F.
Roy R.
Rye M.B.
Sachenkova O.
Saetrom P.
Sai H.
Saiki S.
Saito A.
Saito M.
Sakaguchi S.
Sakai M.
Sakaue S.
Sakaue-Sawano A.
Sandelin A.
Sano H.
Saraswat Manu
Sasamoto Y.
Sato H.
Saxena A.
Saya H.
Schafferhans A.
Schmeier S.
Schmidl C.
Schmocker D.
Schneider C.
Schueler M.
Schultes E.A.
Schulze-Tanzil G.
Semple C.A.
Seno S.
Seo W.
Sese J.
Severin Jessica
Severin Jessica
Sheng G.
Shi J.
Shimoni Y.
Shin J.W.
SimonSanchez J.
Sivertsson A.
Sjostedt E.
Soderhall C.
Stoiber M.H.
Sugiyama D.
Sui S.H.
Summers K.M.
Suzuki A.M.
Suzuki Harukazu
Suzuki Harukazu
Suzuki K.
Suzuki M.
Suzuki N.
Suzuki T.
Swanson D.J.
Swoboda R.K.
Tagami Michihira
Tagami Michihira
Taguchi A.
Takahashi H.
Takahashi M.
Takamochi K.
Takeda S.
Takenaka Y.
Tam K.T.
Tanaka H.
Tanaka R.
Tanaka Y.
Tang D.
Taniuchi I.
Tanzer A.
Tarui H.
Taylor M.S.
Terada A.
Terao Y.
Testa A.C.
Thomas M.
Thongjuea S.
Tomii K.
Toyoda H.
Triglia E.T.
Tsang H.G.
Tsujikawa M.
Uhlén M.
Valen E.
van de Wetering M.
van Nimwegen E.
Velmeshev D.
Verardo R.
Vitezic M.
Vitting-Seerup K.
von Feilitzen K.
Voolstra C.R.
Vorontsov I.E.
Wahlestedt C.
Wasserman Wyeth W.
Wasserman Wyeth W.
Watanabe K.
Watanabe S.
Wells C.A.
Winteringham L.N.
Wolvetang E.
Yabukami H.
Yagi K.
Yamada T.
Yamaguchi Y.
Yamamoto M.
Yamamoto Y.
Yamamoto Y.
Yamanaka Y.
Yano K.
Yasuzawa K.
Yatsuka Y.
Yo M.
Yokokura S.
Yoneda M.
Yoshida E.
Yoshida Y.
Yoshihara M.
Young R.
Young R.S.
Yu N.Y.
Yumoto N.
Zabierowski S.E.
Zhang P.G.
Zucchelli S.
Zwahlen M.
’t Hoen P.A.C.
Publication venue: Nature Publishing Group
Publication date: 15/12/2020
Field of study

Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep learning models able to predict CAGE signal at STRs with high accuracy. These models unveil the importance of STR surrounding sequences not only to distinguish STR classes, but also to predict the level of transcription initiation. Importantly, genetic variants linked to human diseases are preferentially found at STRs with high transcription initiation level, supporting the biological and clinical relevance of transcription initiation at STRs. Together, our results extend the repertoire of non-coding transcription associated with DNA tandem repeats and complexify STR polymorphism