Search CORE

23 research outputs found

Analysis, Visualization, and Machine Learning of Epigenomic Data

Author: Purcaro Michael J.
Publication venue: eScholarship@UMassChan
Publication date: 12/12/2017
Field of study

The goal of the Encyclopedia of DNA Elements (ENCODE) project has been to characterize all the functional elements of the human genome. These elements include expressed transcripts and genomic regions bound by transcription factors (TFs), occupied by nucleosomes, occupied by nucleosomes with modified histones, or hypersensitive to DNase I cleavage, etc. Chromatin Immunoprecipitation (ChIP-seq) is an experimental technique for detecting TF binding in living cells, and the genomic regions bound by TFs are called ChIP-seq peaks. ENCODE has performed and compiled results from tens of thousands of experiments, including ChIP-seq, DNase, RNA-seq and Hi-C. These efforts have culminated in two web-based resources from our lab—Factorbook and SCREEN—for the exploration of epigenomic data for both human and mouse. Factorbook is a peak-centric resource presenting data such as motif enrichment and histone modification profiles for transcription factor binding sites computed from ENCODE ChIP-seq data. SCREEN provides an encyclopedia of ~2 million regulatory elements, including promoters and enhancers, identified using ENCODE ChIP-seq and DNase data, with an extensive UI for searching and visualization. While we have successfully utilized the thousands of available ENCODE ChIP-seq experiments to build the Encyclopedia and visualizers, we have also struggled with the practical and theoretical inability to assay every possible experiment on every possible biosample under every conceivable biological scenario. We have used machine learning techniques to predict TF binding sites and enhancers location, and demonstrate machine learning is critical to help decipher functional regions of the genome

eScholarship@UMMS

Factorbook: an Updated Catalog of Transcription Factor Motifs and Candidate Regulatory Motif Sites [preprint]

Author: Andrews Gregory
Moore Jill E.
Phalke Nishigandha
Pratt Henry E.
Purcaro Michael J.
van der Velde Arjan
Weng Zhiping
Publication venue: 'Oxford University Press (OUP)'
Publication date: 12/10/2021
Field of study

The human genome contains roughly 1,600 transcription factors (TFs) (1), DNA-binding proteins recognizing characteristic sequence motifs to exert regulatory effects on gene expression. The binding specificities of these factors have been profiled both in vitro, using techniques such as HT-SELEX (2), and in vivo, using techniques including ChIP-seq (3, 4). We previously developed Factorbook, a TF-centric database of annotations, motifs, and integrative analyses based on ChIP-seq data from Phase II of the ENCODE Project. Here we present an update to Factorbook which significantly expands the breadth of cell type and TF coverage. The update includes an expanded motif catalog derived from thousands of ENCODE Phase II and III ChIP-seq experiments and HT-SELEX experiments; this motif catalog is integrated with the ENCODE registry of candidate cis-regulatory elements to annotate a comprehensive collection of genome-wide candidate TF binding sites. The database also offers novel tools for applying the motif models within machine learning frameworks and using these models for integrative analysis, including annotation of variants and disease and trait heritability. We will continue to expand the resource as ENCODE Phase IV data are released

PubMed Central

eScholarship@UMMS

Differential analysis of chromatin accessibility and histone modifications for predicting mouse developmental enhancers

Author: Fan Kaili
Fu Shaliu
Gu Cuihua
Jiang Cizhong
Kundaje Anshul
Lu Aiping
Moore Jill E.
Pratt Henry E.
Purcaro Michael J.
Wang Qin
Weng Zhiping
Zhu Ruixin
Publication venue: eScholarship@UMassChan
Publication date: 22/08/2018
Field of study

Enhancers are distal cis-regulatory elements that modulate gene expression. They are depleted of nucleosomes and enriched in specific histone modifications; thus, calling DNase-seq and histone mark ChIP-seq peaks can predict enhancers. We evaluated nine peak-calling algorithms for predicting enhancers validated by transgenic mouse assays. DNase and H3K27ac peaks were consistently more predictive than H3K4me1/2/3 and H3K9ac peaks. DFilter and Hotspot2 were the best DNase peak callers, while HOMER, MUSIC, MACS2, DFilter and F-seq were the best H3K27ac peak callers. We observed that the differential DNase or H3K27ac signals between two distant tissues increased the area under the precision-recall curve (PR-AUC) of DNase peaks by 17.5-166.7% and that of H3K27ac peaks by 7.1-22.2%. We further improved this differential signal method using multiple contrast tissues. Evaluated using a blind test, the differential H3K27ac signal method substantially improved PR-AUC from 0.48 to 0.75 for predicting heart enhancers. We further validated our approach using postnatal retina and cerebral cortex enhancers identified by massively parallel reporter assays, and observed improvements for both tissues. In summary, we compared nine peak callers and devised a superior method for predicting tissue-specific mouse developmental enhancers by reranking the called peaks

eScholarship@UMMS

Expanded encyclopaedias of DNA elements in the human and mouse genomes

Author: A Breschi
A Frankish
A Tanay
AG West
Alec Victorsen
Alexander Dobin
Ali Mortazavi
Anshul Kundaje
AS Hinrichs
Axel Visel
Barbara Wold
BE Bernstein
Bing Ren
Bradley E. Bernstein
Brenton R. Graveley
Brian A. Williams
CA Sloan
Carrie A. Davis
Charles B. Epstein
Cheryl A. Keller
Christopher B. Burge
D Dominguez
D Thanos
David M. Gilbert
David U. Gorkin
Diane E. Dickel
E Lieberman-Aiden
EL Van Nostrand
ENCODE Project Consortium
ENCODE Project Consortium
ENCODE Project Consortium
EP Nora
Eric L. Van Nostrand
Eric Lécuyer
Eric M. Mendenhall
F Yue
Florencia Pauli-Behn
G Kelsey
G Xiang
Gene W. Yeo
H Li
Henry E. Pratt
J Ernst
J Shendure
J van Arensbergen
J Wang
J. Michael Cherry
Jack Huey
JC Rivera-Mulia
JC Rivera-Mulia
JC Rivera-Mulia
JD Buenrostro
JE Phillips
Jessica Halow
Jessika Adrian
Jialing Zhang
Jill E. Moore
Jing Zhang
Job Dekker
Joel Rozowsky
John A. Stamatoyannopoulos
John Rinn
Joseph R. Ecker
JR Dixon
JR Dixon
Juan Carlos Rivera-Mulia
Kevin P. White
KK-H Farh
KS Pollard
LA Pennacchio
Len A. Pennacchio
M Kanamori-Katayama
Manolis Kellis
Mark B. Gerstein
Mark Mackiewicz
ME Ritchie
Michael J. Purcaro
Michael P. Snyder
MJ Fullwood
MO Dorschner
MP Creyghton
N Lambert
ND Heintzman
Noam Shoresh
O Ram
P Batut
Peggy J. Farnham
Peter Freese
R Lister
R Tewhey
Rajinder Kaul
RE Thurman
Richard M. Myers
Roadmap Epigenomics Consortium et al
Robert J. Klein
Roderic Guigó
Ross C. Hardison
S Djebali
S Gerstberger
S Rahmanian
SA Lambert
SF Cai
SG Landt
Shaimae I. Elhajjajy
SSP Rao
Surya B. Chhetri
The ENCODE Project Consortium
Thomas R. Gingeras
Trupti Kawli
Valentina Snetkova
WB Langdon
William S. Noble
X-O Zhang
Xiao-Ou Zhang
Xiaofeng Wang
Xintao Wei
Y Zhang
Yin Shen
Yupeng He
Zhiping Weng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

All data are available on the ENCODE data portal: www.encodeproject. org. All code is available on GitHub from the links provided in the methods section. Code related to the Registry of cCREs can be found at https:// github.com/weng-lab/ENCODE-cCREs. Code related to SCREEN can be found at https://github.com/weng-lab/SCREEN.© The Author(s) 2020. The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE1 and Roadmap Epigenomics2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.This work was supported by grants from the NIH under U01HG007019, U01HG007033, U01HG007036, U01HG007037, U41HG006992, U41HG006993, U41HG006994, U41HG006995, U41HG006996, U41HG006997, U41HG006998, U41HG006999, U41HG007000, U41HG007001, U41HG007002, U41HG007003, U54HG006991, U54HG006997, U54HG006998, U54HG007004, U54HG007005, U54HG007010 and UM1HG009442

Crossref

DSpace@MIT

Cold Spring Harbor Laboratory Institutional Repository

Serveur académique lausannois

eScholarship - University of California

Caltech Authors

Spiral - Imperial College Digital Repository

UPF Digital Repository

Bern Open Repository and Information System (BORIS)

Brunel University Research Archive

A curated benchmark of enhancer-gene interactions for evaluating enhancer-target gene prediction methods

Author: Moore Jill E.
Pratt Henry E.
Purcaro Michael J.
Weng Zhiping
Publication venue: eScholarship@UMassChan
Publication date: 22/01/2020
Field of study

BACKGROUND: Many genome-wide collections of candidate cis-regulatory elements (cCREs) have been defined using genomic and epigenomic data, but it remains a major challenge to connect these elements to their target genes. RESULTS: To facilitate the development of computational methods for predicting target genes, we develop a Benchmark of candidate Enhancer-Gene Interactions (BENGI) by integrating the recently developed Registry of cCREs with experimentally derived genomic interactions. We use BENGI to test several published computational methods for linking enhancers with genes, including signal correlation and the TargetFinder and PEP supervised learning methods. We find that while TargetFinder is the best-performing method, it is only modestly better than a baseline distance method for most benchmark datasets when trained and tested with the same cell type and that TargetFinder often does not outperform the distance method when applied across cell types. CONCLUSIONS: Our results suggest that current computational methods need to be improved and that BENGI presents a useful framework for method development and testing

eScholarship@UMMS

Expanded encyclopaedias of DNA elements in the human and mouse genomes

Author: Bernstein Bradley E.
Cherry J. Michael
Dekker Job
Elhajjajy Shaimae I.
ENCODE Project Consortium
Gerstein Mark B.
Gingeras Thomas R.
Graveley Brenton R.
Hardison Ross C.
Huey Jack
Moore Jill E.
Myers Richard M.
Pennacchio Len A.
Pratt Henry E.
Purcaro Michael J.
Ren Bing
Snyder Michael P.
Stamatoyannopoulos John A.
Weng Zhiping
Wold Barbara
Zhang Xiao-Ou
Publication venue: eScholarship@UMassChan
Publication date: 29/07/2020
Field of study

The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE(1) and Roadmap Epigenomics(2) data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes

eScholarship@UMMS

Neuronal and glial 3D chromatin architecture informs the cellular etiology of brain disorders

Author: Akbarian Schahram
Borrman Tyler M.
Crowley Cheynna A.
Dracheva Stella
Geschwind Daniel H.
Hu Benxia
Huey Jack
Kassim Bibi
Kozlenkov Alexey
Li Yun
Mah Won
Mattei Eugenio
Moore Jill E.
Park Royce B.
Pochareddy Sirisha
Pratt Henry E.
PsychENCODE Consortium
Purcaro Michael J.
Sestan Nenad
Spiess Keeley
Weng Zhiping
Won Hyejung
Publication venue: eScholarship@UMassChan
Publication date: 25/06/2021
Field of study

Cellular heterogeneity in the human brain obscures the identification of robust cellular regulatory networks, which is necessary to understand the function of non-coding elements and the impact of non-coding genetic variation. Here we integrate genome-wide chromosome conformation data from purified neurons and glia with transcriptomic and enhancer profiles, to characterize the gene regulatory landscape of two major cell classes in the human brain. We then leverage cell-type-specific regulatory landscapes to gain insight into the cellular etiology of several brain disorders. We find that Alzheimer\u27s disease (AD)-associated epigenetic dysregulation is linked to neurons and oligodendrocytes, whereas genetic risk factors for AD highlighted microglia, suggesting that different cell types may contribute to disease risk, via different mechanisms. Moreover, integration of glutamatergic and GABAergic regulatory maps with genetic risk factors for schizophrenia (SCZ) and bipolar disorder (BD) identifies shared (parvalbumin-expressing interneurons) and distinct cellular etiologies (upper layer neurons for BD, and deeper layer projection neurons for SCZ). Collectively, these findings shed new light on cell-type-specific gene regulatory networks in brain disorders

eScholarship@UMMS

A prospective study of loss of consciousness in epilepsy using virtual reality driving simulation and other video games

Author: Anaya Joseph
Astur Robert
Blumenfeld Hal
Bod Jessica
Danielson Nathan
DeSalvo Matthew N
Detyniecki Kamil
Duckrow Robert B
Elrich Susan
Farooque Pue
Hamid Hamada
Huh Linda
Kurashvili Pimen
Manza Peter
Morland Thomas B
Motelow Joshua E
Naidu Yamini
Narasimhan Poojitha
Oh Taemin
Padin-Rosado Jose
Peng Kathy
Purcaro Michael J
Ransom Christopher B
Raouf Saned
Rawson Elizabeth
Schmits Kristen
Srinivasan Aditya
Wilkerson Jerome
Xiao Bo
Yang Li
Publication venue: Elsevier Inc
Publication date: 01/01/2010
Field of study

Patients with epilepsy are at risk of traffic accidents when they have seizures while driving. However, driving is an essential part of normal daily life in many communities, and depriving patients of driving privileges can have profound consequences for their economic and social well-being. In the current study, we collected ictal performance data from a driving simulator and two other video games in patients undergoing continuous video/EEG monitoring. We captured 22 seizures in 13 patients and found that driving impairment during seizures differed in terms of both magnitude and character, depending on the seizure type. Our study documents the feasibility of a prospective study of driving and other behaviors during seizures through the use of computer-based tasks. This methodology may be applied to further describe differential driving impairment in specific types of seizures and to gain data on anatomical networks disrupted in seizures that impair consciousness and driving safety

PubMed Central

University of Miami: Scholarship Miami

Focal BOLD fMRI changes in bicuculline-induced tonic–clonic seizures in the rat

Author: Ackermann
Andre
Asht M. Mishra
Benner
Blumenfeld
Blumenfeld
Blumenfeld
Brevard
Browning
Chahboune
David
Dibbens
Enev
Engel
Englot
Fahmeed Hyder
Faingold
Foerster
Gale
Gruetter
Hal Blumenfeld
Joshua E. Motelow
Karpova
Klein
Lehmann
Macdonald
Matthew N. DeSalvo
McCown
McIntyre
McNally
Meeren
Meeren
Michael J. Purcaro
Motelow
Nathan Danielson
Nersesyan
Nersesyan
Paxinos
Sanganahalli
Schindler
Schridde
Shulman
Shulman
Smith
Strauss
Ulrich Schridde
Varghese
Xiaoxiao Bai
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Unusual form of cardiac rupture: Sealed subacute left ventricular free wall rupture, evolving to intramyocardial dissecting hematoma and to pseudoaneurysm formation— A case report and review of the literature

Author: Adamick
Amram J. Cohen
Aronstein
Balakumaran
Bates
Benjamin Medalion
Blinc
Brown
Burn
Catherwood
Clearkin
Coma-Canella
David Harpaz
Davidson
Desoutter
Dvorak
Figueras
Figueras
Frances
Gatewood
Grollier
Grube
Gueron
Hurst
Juergens
Killen
Kretz
Lee
Lee
Lewis
López-Sendón
Mahoney
March
Mcllmoyle
Michael Kriwisky
Milgalter
Mundth
Natarajan
NuÑez
O'Rourke
Pappas
Pifarre
Pliam
Purcaro
Qizilbash
Reddy
Rittenhouse
Roelandt
Shapira
Stewart
Sutherland
Van Tassel
Vlodaver
Yeo
Yoseph Rozenman
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref