Search CORE

8 research outputs found

Getting personal with epigenetics:towards individual-specific epigenomic imputation with machine learning

Author: Hawkins-Hooker Alex
Narendra Tanmayee
Rojas-Carulla Mateo
Schweikert Gabriele
Schölkopf Bernhard
Visonà Giovanni
Publication venue
Publication date: 07/08/2023
Field of study

Epigenetic modifications are dynamic mechanisms involved in the regulation of gene expression. Unlike the DNA sequence, epigenetic patterns vary not only between individuals, but also between different cell types within an individual. Environmental factors, somatic mutations and ageing contribute to epigenetic changes that may constitute early hallmarks or causal factors of disease. Epigenetic modifications are reversible and thus promising therapeutic targets for precision medicine. However, mapping efforts to determine an individual's cell-type-specific epigenome are constrained by experimental costs and tissue accessibility. To address these challenges, we developed eDICE, an attention-based deep learning model that is trained to impute missing epigenomic tracks by conditioning on observed tracks. Using a recently published set of epigenomes from four individual donors, we show that transfer learning across individuals allows eDICE to successfully predict individual-specific epigenetic variation even in tissues that are unmapped in a given donor. These results highlight the potential of machine learning-based imputation methods to advance personalized epigenomics.</p

University of Dundee Online Publications

Recommended from our members

Integration of multiple epigenomic marks improves prediction of variant impact in saturation mutagenesis reporter assay

Author: Adato Orit
Adhikari Aashish N.
Ahituv Nadav
Beer Michael A.
Boyle Alan P.
Dong Shengcheng
Hawkins‐hooker Alex
Inoue Fumitaka
Juven‐gershon Tamar
Kenlay Henry
Kircher Martin
Kreimer Anat
Kulakovskiy Ivan V.
Martin Beth
Patra Ayoti
Penzar Dmitry D.
Reid John
Schubach Max
Shendure Jay
Shigaki Dustin
Unger Ron
Xiong Chenling
Yan Zhongxia
Yosef Nir
Publication venue: 'Wiley'
Publication date: 01/09/2019
Field of study

The integrative analysis of highâ throughput reporter assays, machine learning, and profiles of epigenomic chromatin state in a broad array of cells and tissues has the potential to significantly improve our understanding of noncoding regulatory element function and its contribution to human disease. Here, we report results from the CAGI 5 regulation saturation challenge where participants were asked to predict the impact of nucleotide substitution at every base pair within five diseaseâ associated human enhancers and nine diseaseâ associated promoters. A library of mutations covering all bases was generated by saturation mutagenesis and altered activity was assessed in a massively parallel reporter assay (MPRA) in relevant cell lines. Reporter expression was measured relative to plasmid DNA to determine the impact of variants. The challenge was to predict the functional effects of variants on reporter expression. Comparative analysis of the full range of submitted prediction results identifies the most successful models of transcription factor binding sites, machine learning algorithms, and ways to choose among or incorporate diverse datatypes and cellâ types for training computational models. These results have the potential to improve the design of future studies on more diverse sets of regulatory elements and aid the interpretation of diseaseâ associated genetic variation.Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/151884/1/humu23797_am.pdfhttps://deepblue.lib.umich.edu/bitstream/2027.42/151884/2/humu23797.pd

eScholarship - University of California

Deep Blue Documents

The ENCODE Imputation Challenge: a critical assessment of methods for cross-cell type imputation of epigenomic profiles

A promising alternative to comprehensively performing genomics experiments is to, instead, perform a subset of experiments and use computational methods to impute the remainder. However, identifying the best imputation methods and what measures meaningfully evaluate performance are open questions. We address these questions by comprehensively analyzing 23 methods from the ENCODE Imputation Challenge. We find that imputation evaluations are challenging and confounded by distributional shifts from differences in data collection and processing over time, the amount of available data, and redundancy among performance measures. Our analyses suggest simple steps for overcoming these issues and promising directions for more robust research

University of Dundee Online Publications

The ENCODE Imputation Challenge: a critical assessment of methods for cross-cell type imputation of epigenomic profiles

University of Dundee Online Publications

Generating functional protein variants with variational autoencoders

Author: Baur Sebastien
Bikard David
Chen Arthur
Couairon Guillaume
Depardieu Florence
Hawkins-Hooker Alex
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/02/2021
Field of study

International audienceThe vast expansion of protein sequence databases provides an opportunity for new protein design approaches which seek to learn the sequence-function relationship directly from natural sequence variation. Deep generative models trained on protein sequence data have been shown to learn biologically meaningful representations helpful for a variety of downstream tasks, but their potential for direct use in the design of novel proteins remains largely unexplored. Here we show that variational autoencoders trained on a dataset of almost 70000 luciferase-like oxidoreductases can be used to generate novel, functional variants of the luxA bacterial luciferase. We propose separate VAE models to work with aligned sequence input (MSA VAE) and raw sequence input (AR-VAE), and offer evidence that while both are able to reproduce patterns of amino acid usage characteristic of the family, the MSA VAE is better able to capture long-distance dependencies reflecting the influence of 3D structure. To confirm the practical utility of the models, we used them to generate variants of luxA whose luminescence activity was validated experimentally. We further showed that conditional variants of both models could be used to increase the solubility of luxA without disrupting function. Altogether 6/12 of the variants generated using the unconditional AR-VAE and 9/11 generated using the unconditional MSA VAE retained measurable luminescence, together with all 23 of the less distant variants generated by conditional versions of the models; the most distant functional variant contained 35 differences relative to the nearest training set sequence. These results demonstrate the feasibility of using deep generative models to explore the space of possible protein sequences and generate useful variants, providing a method complementary to rational design and directed evolution approaches

Directory of Open Access Journals

HAL Descartes

HAL-Pasteur

Hal-Diderot

Getting personal with epigenetics: towards individual-specific epigenomic imputation with machine learning

Author: Hawkins-Hooker Alex
Narendra Tanmayee
Rojas-Carulla Mateo
Schweikert Gabriele
Schölkopf Bernhard
Visona Giovanni
Publication venue: Berlin : Nature Portfolio
Publication date: 01/01/2023
Field of study

Publikationsserver der Universität Tübingen

Recommended from our members

Integration of multiple epigenomic marks improves prediction of variant impact in saturation mutagenesis reporter assay

Author: Adato Orit
Adhikari Aashish N
Ahituv Nadav
Beer Michael A
Boyle Alan P
Dong Shengcheng
Hawkins‐Hooker Alex
Inoue Fumitaka
Juven‐Gershon Tamar
Kenlay Henry
Kircher Martin
Kreimer Anat
Kulakovskiy Ivan V
Martin Beth
Patra Ayoti
Penzar Dmitry D
Reid John
Schubach Max
Shendure Jay
Shigaki Dustin
Unger Ron
Xiong Chenling
Yan Zhongxia
Yosef Nir
Publication venue: eScholarship, University of California
Publication date: 01/09/2019
Field of study

The integrative analysis of high-throughput reporter assays, machine learning, and profiles of epigenomic chromatin state in a broad array of cells and tissues has the potential to significantly improve our understanding of noncoding regulatory element function and its contribution to human disease. Here, we report results from the CAGI 5 regulation saturation challenge where participants were asked to predict the impact of nucleotide substitution at every base pair within five disease-associated human enhancers and nine disease-associated promoters. A library of mutations covering all bases was generated by saturation mutagenesis and altered activity was assessed in a massively parallel reporter assay (MPRA) in relevant cell lines. Reporter expression was measured relative to plasmid DNA to determine the impact of variants. The challenge was to predict the functional effects of variants on reporter expression. Comparative analysis of the full range of submitted prediction results identifies the most successful models of transcription factor binding sites, machine learning algorithms, and ways to choose among or incorporate diverse datatypes and cell-types for training computational models. These results have the potential to improve the design of future studies on more diverse sets of regulatory elements and aid the interpretation of disease-associated genetic variation

eScholarship - University of California

Literatur

Author: Ben
Dautzenberg Gerhard
Dautzenberg Gerhard
Davies William
Davies William David
De Boer Martinus Christianus
De Wette
Del Verme Mercello
Delling Gerhard
Dibelius Martin
Die Apostelgeschichte KEK
Dschulnigg Peter
Dunn James
Dunn James
Dunn James
Dunn James
Dunn James
Dunn James
Ebner Martin
Elliott James Keith
Ernst Josef
Ernst Josef
Farmer William Reuben
Feine Paul
Feldman Louis Harry
Fenton John
Feuillet AndrØ
Fiorenza
Fitzmyer Joseph Augustine
Fitzmyer Joseph Augustine
Fitzmyer Joseph Augustine
Friedrich Gerhard
Furnish Victor Paul
Gathercole Simon
Gnilka Joachim
Gnilka Joachim
Goodspeed Edgar Johnson
Gould Ezra Palmer
Goulder Michael Douglas
Goulder Michael Douglas
Goulder Michael Douglas
Goulder Michael Douglas
Green Barbara
Grundmann Walter
Gundry Robert Horton
Haenchen Ernst
Hahn Ferdinand
Harris Horton
Hawkins
Heckel Theo
Held Heinz Joachim
Hengel Martin
Hengel Martin
Heubült Chr
Hochschild Ralph
Hock Ronald
Holtz Traugott
Hooker Morna Dorothy
Hooker Morna Dorothy
Hooker Morna Dorothy
Horn Friedrich Wihelm
Horsley Richard
Horsley Richard
Howard George
Hupe Henning
Jeremias Joachim
Jeremias Joachim
Jervell Jakob
Keck Louis Martyn
Kezbere Ilze
Kilgallen John
Kim Seyoon
Kittel Gerhard
Klein Günter
Klinghardt Matthias
Koch Dietrich-Alex
Koester Craig
Koester Helmut
Koester Helmut
Kok Ezra
Konradt Matthias
Kristeva Julia
Kuhn Heinz-Wolfgang
Kähler Martin
Käsemann Ernst
Kçster Helmut
Kçster Helmut
Kçstlin Karl Reinhold
Lachmann Karl
Lampe Peter
Lampe Peter
Lane William
Leppä Heikki
Lietzmann Hans
Lindemann Andreas
Loisy Alfred
Louw Eugene Nida
Luz Ulrich
Luz Ulrich
Lçning Karl
Lüdemann Gerd
Mann Christopher Stephen
Manson Thomas Walter
Mantey Julius Robert
Mantey Julius Robert
Marcus Joel
Marcus Joel
Marshall Ian Howard
Marshall Ian Howard
Martin
Martyn James Louis
Marxsen Willi
Marxsen Willi
McKnight Scot
Moiser Jeremy
Moo Douglas
Müller Paul-Gerhard
Müller Ulrich
Nave Guy Dale
Niederwimmer Kurt
Oden Thomas
Ollrog Wolf-Henning
Pesch Rudolf
Pesch Rudolf
Pfister Manfred
Pfleiderer Otto
Pilhofer Peter
Pilhofer Peter
Plummer Alfred
Popp Thomas
Porter Stanley
Przybylski Benno
Quintilianus Marcus Fabius
Rehm Bernhard
Reimarus Hermann Samuel
Resch Alfred
Riches David Sim
Robinson McConkey
Roetzel Calvin
Roh Taeseong
Rosner Brian
Salo Kalervo
Sampley Paul
Sanders
Schenk Wolfgang
Schille Gottfried
Schlatter Adolf
Schmidt Karl Matthias
Schmithals Walter
Schmithals Walter
Schmithals Walter
Schmithals Walter
Schnackenburg Rudolf
Schneider Gerhard
Schneider Gerhard
Schnelle Udo
Schnelle Udo
Schniewind Julius
Schniewind Julius
Schottroff Luise
Schrage Wolfgang
Schreiber Stefan
Schulthess Friedrich
Schulz Siegfried
Schweizer Eduard
Seifrid Mark
Shen Philip
Silva Moises
Sim David
Sim David
Sim David
Sim David
Sim David
Stanton Graham Norman
Stanton Graham Norman
Stendahl Krister
Strecker Christian
Strecker Georg
Strecker Georg
Strecker Georg
Stuhlmacher
Stuhlmacher
Swete Henry Barclay
Taylor David Bruce
Taylor Justin
Taylor Vincent
Theissen Annette Merz
Theissen Dagmar Winter
Theissen Gerd
Theissen Gerd
Theissen Gerd
Theissen Gerd
Theissen Gerd
Theissen Gerd
Theissen Gerd
Theissen Gerd
Theissen Gerd
Theissen Gerd
Theissen Gerd
Theissen Gerd
Tuckett Christopher Mark
Van de Sandt David Flusser
Van Dodewaard J. A. E
Van Unnik Willem Cornelis
Vielhauer Philipp
Vielhauer Philipp
Volkmar Gustav
Volkmar Gustav
Von Dobschütz Ernst
Von Harnack Adolf
Von Harnack Adolf
Von Harnack Adolf
Von Soden Hermann Freiherr
Vouga FranÅois
Wallace Daniel Baird
Wengst Klaus
Wenham David
Werner Martin
Wernle Paul
Wilckens Ulrich
Wilckens Ulrich
Wohlenberg Gustav
Wong Eric Kun
Wong Eric Kun
Zeller Dieter
Publication venue: 'Vandenhoeck & Ruprecht GmbH & Co, KG'
Publication date
Field of study

Crossref