Search CORE

Harvard University - DASH

Directory of Open Access Journals

The Francis Crick Institute

Long non-coding RNAs: spatial amplifiers that control nuclear structure and gene expression

Author: Engreitz Jesse M.
Guttman Mitchell
Ollikainen Noah
Publication venue: Nature Publishing Group
Publication date: 01/12/2016
Field of study

Over the past decade, it has become clear that mammalian genomes encode thousands of long non-coding RNAs (lncRNAs), many of which are now implicated in diverse biological processes. Recent work studying the molecular mechanisms of several key examples — including Xist, which orchestrates X chromosome inactivation — has provided new insights into how lncRNAs can control cellular functions by acting in the nucleus. Here we discuss emerging mechanistic insights into how lncRNAs can regulate gene expression by coordinating regulatory proteins, localizing to target loci and shaping three-dimensional (3D) nuclear organization. We explore these principles to highlight biological challenges in gene regulation, in which lncRNAs are well-suited to perform roles that cannot be carried out by DNA elements or protein regulators alone, such as acting as spatial amplifiers of regulatory signals in the nucleus

Positional specificity of different transcription factor classes within enhancers

Author: Engreitz Jesse
Grossman Sharon Rachel
Hacohen Nir
Lander Eric Steven
Nguyen Tung H.
Ray John P.
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 08/02/2019
Field of study

Gene expression is controlled by sequence-specific transcription factors (TFs), which bind to regulatory sequences in DNA. TF binding occurs in nucleosome-depleted regions of DNA (NDRs), which generally encompass regions with lengths similar to those protected by nucleosomes. However, less is known about where within these regions specific TFs tend to be found. Here, we characterize the positional bias of inferred binding sites for 103 TFs within ∼500,000 NDRs across 47 cell types. We find that distinct classes of TFs display different binding preferences: Some tend to have binding sites toward the edges, some toward the center, and some at other positions within the NDR. These patterns are highly consistent across cell types, suggesting that they may reflect TF-specific intrinsic structural or functional characteristics. In particular, TF classes with binding sites at NDR edges are enriched for those known to interact with histones and chromatin remodelers, whereas TFs with central enrichment interact with other TFs and cofactors such as p300. Our results suggest distinct regiospecific binding patterns and functions of TF classes within enhancers. Keywords: transcription factor binding; gene regulation; genomics; chromatin structureNational Human Genome Research Institute (U.S.) (Grant 2U54HG003067-10)National Institute of General Medical Sciences (U.S.) (Grant T32GM007753

Content-based microarray search using differential expression profiles

Author: Altman Russ B
Butte Atul J
Chen Rong
Dudley Joel T
Engreitz Jesse M
Morgan Alexander A
Thathoo Rahul
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, may enable more effective experiment retrieval as well as the discovery of novel associations between drugs, diseases, and other perturbations. Results We develop methods to retrieve gene expression experiments that differentially express the same transcriptional programs as a query experiment. Avoiding thresholds, we generate differential expression profiles that include a score for each gene measured in an experiment. We use existing and novel dimension reduction and correlation measures to rank relevant experiments in an entirely data-driven manner, allowing emergent features of the data to drive the results. A combination of matrix decomposition and <it>p</it>-weighted Pearson correlation proves the most suitable for comparing differential expression profiles. We apply this method to index all GEO DataSets, and demonstrate the utility of our approach by identifying pathways and conditions relevant to transcription factors Nanog and FoxO3. Conclusions Content-based gene expression search generates relevant hypotheses for biological inquiry. Experiments across platforms, tissue types, and protocols inform the analysis of new datasets.</p

Springer - Publisher Connector

Directory of Open Access Journals

Neighborhood regulation by lncRNA promoters, transcription, and splicing

Author: Chen Jenny
Engreitz Jesse M.
Guttman Mitchell
Haines Jenna E.
Kane Michael
Lander Eric S.
McDonel Patrick E.
Munson Glen
Perez Elizabeth M.
Publication venue
Publication date: 28/04/2016
Field of study

Mammalian genomes are pervasively transcribed to produce thousands of spliced long noncoding RNAs (lncRNAs), whose functions remain poorly understood. Because recent evidence has implicated several specific lncRNA loci in the local regulation of gene expression, we sought to determine whether such local regulation is a property of many lncRNA loci. We used genetic manipulations to dissect 12 genomic loci that produce lncRNAs and found that 5 of these loci influence the expression of a neighboring gene in cis. Surprisingly, however, none of these effects required the specific lncRNA transcripts themselves and instead involved general processes associated with their production, including enhancer-like activity of gene promoters, the process of transcription, and the splicing of the transcript. Interestingly, such effects are not limited to lncRNA loci: we found similar effects on local gene expression at 4 of 6 protein-coding loci. These results demonstrate that 'crosstalk' among neighboring genes is a prevalent phenomenon that can involve multiple mechanisms and cis regulatory signals, including a novel role for RNA splicing. These mechanisms may explain the function and evolution of some genomic loci that produce lncRNAs

Caltech Authors

RNA-RNA Interactions Enable Specific Targeting of Noncoding RNAs to Nascent Pre-mRNAs and Chromatin Sites

Author: Chow Amy Y.
Engreitz Jesse M.
Grossman Sharon R.
Guttman Mitchell
Lander Eric S.
McDonel Patrick
Russell Pamela
Shishkin Alexander A.
Sirokman Klara
Surka Christine
Publication venue: 'Elsevier BV'
Publication date: 01/06/2014
Field of study

Intermolecular RNA-RNA interactions are used by many noncoding RNAs (ncRNAs) to achieve their diverse functions. To identify these contacts, we developed a method based on RNA antisense purification to systematically map RNA-RNA interactions (RAP-RNA) and applied it to investigate two ncRNAs implicated in RNA processing: U1 small nuclear RNA, a component of the spliceosome, and Malat1, a large ncRNA that localizes to nuclear speckles. U1 and Malat1 interact with nascent transcripts through distinct targeting mechanisms. Using differential crosslinking, we confirmed that U1 directly hybridizes to 5′ splice sites and 5′ splice site motifs throughout introns and found that Malat1 interacts with pre-mRNAs indirectly through protein intermediates. Interactions with nascent pre-mRNAs cause U1 and Malat1 to localize proximally to chromatin at active genes, demonstrating that ncRNAs can use RNA-RNA interactions to target specific pre-mRNAs and genomic sites. RAP-RNA is sensitive to lower abundance RNAs as well, making it generally applicable for investigating ncRNAs

Elsevier - Publisher Connector

Caltech Authors

Deep-coverage whole genome sequences and blood lipids among 16,324 individuals.

Author: Abecasis Goncalo
Alver Maris
Bloom Jonathan M
Chaffin Mark
Correa Adolfo
Cupples L Adrienne
Engreitz Jesse M
Ernst Jason
Esko Tonu
Ganna Andrea
Johnson W Craig
Kathiresan Sekar
Kellis Manolis
Khera Amit V
Lander Eric S
Manichaikul Ani
Mitchell Braxton
Montasser May
Natarajan Pradeep
Neale Benjamin M
NHLBI TOPMed Lipids Working Group
O'Connell Jeffrey R
Peloso Gina M
Perry James A
Poterba Timothy
Rich Stephen S
Ripatti Samuli
Rotter Jerome I
Ruotsalainen Sanni E
Salomaa Veikko
Seed Cotton
Surakka Ida L
Vasan Ramachandran S
Willer Cristen J
Wilson James G
Zekavat Seyedeh Maryam
Zhou Wei
Publication venue: eScholarship, University of California
Publication date: 01/08/2018
Field of study

Large-scale deep-coverage whole-genome sequencing (WGS) is now feasible and offers potential advantages for locus discovery. We perform WGS in 16,324 participants from four ancestries at mean depth >29X and analyze genotypes with four quantitative traits-plasma total cholesterol, low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol, and triglycerides. Common variant association yields known loci except for few variants previously poorly imputed. Rare coding variant association yields known Mendelian dyslipidemia genes but rare non-coding variant association detects no signals. A high 2M-SNP LDL-C polygenic score (top 5th percentile) confers similar effect size to a monogenic mutation (~30 mg/dl higher for each); however, among those with severe hypercholesterolemia, 23% have a high polygenic score and only 2% carry a monogenic mutation. At these sample sizes and for these phenotypes, the incremental value of WGS for discovery is limited but WGS permits simultaneous assessment of monogenic and polygenic models to severe hypercholesterolemia

Directory of Open Access Journals

George Washington University: Health Sciences Research Commons (HSRC)

Recommended from our members

Deep coverage whole genome sequences and plasma lipoprotein(a) in individuals of European and African ancestries.

Author: Alver Maris
Bloom Jonathan
Budoff Matthew
Chaffin Mark
Correa Adolfo
Cupples L Adrienne
Daly Mark J
Engreitz Jesse
Ernst Jason
Esko Tonu
Fu Mao
Ganna Andrea
Handsaker Robert E
Johnson W Craig
Kathiresan Sekar
Kellis Manolis
Manichaikul Ani
McCarroll Steven
Metspalu Andres
Mitchell Braxton D
Natarajan Pradeep
Neale Benjamin M
NHLBI TOPMed Lipids Working Group
Peloso Gina M
Post Wendy
Poterba Timothy
Rich Stephen S
Ripatti Samuli
Rotter Jerome I
Ruotsalainen Sanni
Ryan Kathleen A
Salomaa Veikko
Seed Cotton
Surakka Ida
Tsai Michael
Vasan Ramachandran S
Wilson James G
Yang Chaojie
Zekavat Seyedeh M
Publication venue: eScholarship, University of California
Publication date: 01/07/2018
Field of study

Lipoprotein(a), Lp(a), is a modified low-density lipoprotein particle that contains apolipoprotein(a), encoded by LPA, and is a highly heritable, causal risk factor for cardiovascular diseases that varies in concentrations across ancestries. Here, we use deep-coverage whole genome sequencing in 8392 individuals of European and African ancestry to discover and interpret both single-nucleotide variants and copy number (CN) variation associated with Lp(a). We observe that genetic determinants between Europeans and Africans have several unique determinants. The common variant rs12740374 associated with Lp(a) cholesterol is an eQTL for SORT1 and independent of LDL cholesterol. Observed associations of aggregates of rare non-coding variants are largely explained by LPA structural variation, namely the LPA kringle IV 2 (KIV2)-CN. Finally, we find that LPA risk genotypes confer greater relative risk for incident atherosclerotic cardiovascular diseases compared to directly measured Lp(a), and are significantly associated with measures of subclinical atherosclerosis in African Americans

George Washington University: Health Sciences Research Commons (HSRC)

Publisher Correction: Deep coverage whole genome sequences and plasma lipoprotein(a) in individuals of European and African ancestries.

Author: Alver Maris
Bloom Jonathan
Budoff Matthew
Chaffin Mark
Correa Adolfo
Cupples L Adrienne
Daly Mark J
Engreitz Jesse
Ernst Jason
Esko Tonu
Fu Mao
Ganna Andrea
Handsaker Robert E
Johnson W Craig
Kathiresan Sekar
Kellis Manolis
Manichaikul Ani
McCarroll Steven
Metspalu Andres
Mitchell Braxton D
Natarajan Pradeep
Neale Benjamin M
NHLBI TOPMed Lipids Working Group
Peloso Gina M
Post Wendy
Poterba Timothy
Rich Stephen S
Ripatti Samuli
Rotter Jerome I
Ruotsalainen Sanni
Ryan Kathleen A
Salomaa Veikko
Seed Cotton
Surakka Ida
Tsai Michael
Vasan Ramachandran S
Wilson James G
Yang Chaojie
Zekavat Seyedeh M
Publication venue: eScholarship, University of California
Publication date: 01/08/2018
Field of study

The original version of this article contained an error in the name of the author Ramachandran S. Vasan, which was incorrectly given as Vasan S. Ramachandran. This has now been corrected in both the PDF and HTML versions of the article

George Washington University: Health Sciences Research Commons (HSRC)

Transcriptome-wide Mapping Reveals Widespread Dynamic-Regulated Pseudouridylation of ncRNA and mRNA

Author: Bernstein Douglas A.
Engreitz Jesse M.
Fink Gerald
Guttman Mitchell
Herbst Rebecca H.
Jovanovic Marko
Lander Eric S.
León-Ricardo Brian X.
Mumbach Maxwell R.
Regev Aviv
Satija Rahul
Schwartz Schraga
Publication venue: 'Elsevier BV'
Publication date: 01/08/2014
Field of study

Pseudouridine is the most abundant RNA modification, yet except for a few well-studied cases, little is known about the modified positions and their function(s). Here, we develop Ψ-seq for transcriptome-wide quantitative mapping of pseudouridine. We validate Ψ-seq with spike-ins and de novo identification of previously reported positions and discover hundreds of unique sites in human and yeast mRNAs and snoRNAs. Perturbing pseudouridine synthases (PUS) uncovers which pseudouridine synthase modifies each site and their target sequence features. mRNA pseudouridinylation depends on both site-specific and snoRNA-guided pseudouridine synthases. Upon heat shock in yeast, Pus7p-mediated pseudouridylation is induced at >200 sites, and PUS7 deletion decreases the levels of otherwise pseudouridylated mRNA, suggesting a role in enhancing transcript stability. rRNA pseudouridine stoichiometries are conserved but reduced in cells from dyskeratosis congenita patients, where the PUS DKC1 is mutated. Our work identifies an enhanced, transcriptome-wide scope for pseudouridine and methods to dissect its underlying mechanisms and function

Elsevier - Publisher Connector