Search CORE

105 research outputs found

When needles look like hay: How to find tissue-specific enhancers in model organism genomes

Author: Haeussler Maximilian
Joly Jean-Stéphane
Publication venue: Elsevier Inc.
Publication date
Field of study

AbstractA major prerequisite for the investigation of tissue-specific processes is the identification of cis-regulatory elements. No generally applicable technique is available to distinguish them from any other type of genomic non-coding sequence. Therefore, researchers often have to identify these elements by elaborate in vivo screens, testing individual regions until the right one is found.Here, based on many examples from the literature, we summarize how functional enhancers have been isolated from other elements in the genome and how they have been characterized in transgenic animals. Covering computational and experimental studies, we provide an overview of the global properties of cis-regulatory elements, like their specific interactions with promoters and target gene distances. We describe conserved non-coding elements (CNEs) and their internal structure, nucleotide composition, binding site clustering and overlap, with a special focus on developmental enhancers. Conflicting data and unresolved questions on the nature of these elements are highlighted. Our comprehensive overview of the experimental shortcuts that have been found in the different model organism communities and the new field of high-throughput assays should help during the preparation phase of a screen for enhancers. The review is accompanied by a list of general guidelines for such a project

Elsevier - Publisher Connector

Recommended from our members

Massively parallel profiling and predictive modeling of the outcomes of CRISPR/Cas9-mediated double-strand break repair.

Author: Agarwal Vikram
Chen Wei
Haeussler Maximilian
McKenna Aaron
Noble William Stafford
Schreiber Jacob
Shendure Jay
Yin Yi
Publication venue: eScholarship, University of California
Publication date: 01/09/2019
Field of study

Non-homologous end-joining (NHEJ) plays an important role in double-strand break (DSB) repair of DNA. Recent studies have shown that the error patterns of NHEJ are strongly biased by sequence context, but these studies were based on relatively few templates. To investigate this more thoroughly, we systematically profiled ∼1.16 million independent mutational events resulting from CRISPR/Cas9-mediated cleavage and NHEJ-mediated DSB repair of 6872 synthetic target sequences, introduced into a human cell line via lentiviral infection. We find that: (i) insertions are dominated by 1 bp events templated by sequence immediately upstream of the cleavage site, (ii) deletions are predominantly associated with microhomology and (iii) targets exhibit variable but reproducible diversity with respect to the number and relative frequency of the mutational outcomes to which they give rise. From these data, we trained a model that uses local sequence context to predict the distribution of mutational outcomes. Exploiting the bias of NHEJ outcomes towards microhomology mediated events, we demonstrate the programming of deletion patterns by introducing microhomology to specific locations in the vicinity of the DSB site. We anticipate that our results will inform investigations of DSB repair mechanisms as well as the design of CRISPR/Cas9 experiments for diverse applications including genome-wide screens, gene therapy, lineage tracing and molecular recording

eScholarship - University of California

Recommended from our members

Evaluation and rational design of guide RNAs for efficient CRISPR/Cas9-mediated mutagenesis in Ciona

Author: Christiaen Lionel
Gandhi Shashank
Haeussler Maximilian
Razy-Krajka Florian
Stolfi Alberto
Publication venue: 'Elsevier BV'
Publication date: 01/05/2017
Field of study

The CRISPR/Cas9 system has emerged as an important tool for various genome engineering applications. A current obstacle to high throughput applications of CRISPR/Cas9 is the imprecise prediction of highly active single guide RNAs (sgRNAs). We previously implemented the CRISPR/Cas9 system to induce tissue-specific mutations in the tunicate Ciona. In the present study, we designed and tested 83 single guide RNA (sgRNA) vectors targeting 23 genes expressed in the cardiopharyngeal progenitors and surrounding tissues of Ciona embryo. Using high-throughput sequencing of mutagenized alleles, we identified guide sequences that correlate with sgRNA mutagenesis activity and used this information for the rational design of all possible sgRNAs targeting the Ciona transcriptome. We also describe a one-step cloning-free protocol for the assembly of sgRNA expression cassettes. These cassettes can be directly electroporated as unpurified PCR products into Ciona embryos for sgRNA expression in vivo, resulting in high frequency of CRISPR/Cas9-mediated mutagenesis in somatic cells of electroporated embryos. We found a strong correlation between the frequency of an Ebf loss-of-function phenotype and the mutagenesis efficacies of individual Ebf-targeting sgRNAs tested using this method. We anticipate that our approach can be scaled up to systematically design and deliver highly efficient sgRNAs for the tissue-specific investigation of gene functions in Ciona

eScholarship - University of California

Caltech Authors

Text-mining assisted regulatory annotation

Author: Aerts Stein
Bergman Casey M.
Griffith Obi L.
Haeussler Maximilian
Haussler Maximilian
Hulpiau Paco
Jones Steven J M
Montgomery Stephen B.
van Vooren Steven
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Text-mining technologies can be integrated with genome annotation systems, increasing the availability of annotated cis-regulatory data

Lirias

Crossref

Springer - Publisher Connector

Ghent University Academic Bibliography

PubMed Central

The University of Manchester - Institutional Repository

ProdInra

Recommended from our members

Single-cell genomics identifies cell type-specific molecular changes in autism.

Author: Bhaduri Aparna
Goyal Nitasha
Haeussler Maximilian
Jung Diane
Kriegstein Arnold R
Mayer Simone
Perez Yonatan
Rowitch David H
Schirmer Lucas
Velmeshev Dmitry
Publication venue: Science
Publication date: 17/05/2019
Field of study

Despite the clinical and genetic heterogeneity of autism, bulk gene expression studies show that changes in the neocortex of autism patients converge on common genes and pathways. However, direct assessment of specific cell types in the brain affected by autism has not been feasible until recently. We used single-nucleus RNA sequencing of cortical tissue from patients with autism to identify autism-associated transcriptomic changes in specific cell types. We found that synaptic signaling of upper-layer excitatory neurons and the molecular state of microglia are preferentially affected in autism. Moreover, our results show that dysregulation of specific groups of genes in cortico-cortical projection neurons correlates with clinical severity of autism. These findings suggest that molecular changes in upper-layer cortical circuits are linked to behavioral manifestations of autism

eScholarship - University of California

Apollo (Cambridge)

HNRNPA1 promotes recognition of splice site decoys by U2AF2 in vivo

Author: Draper Jolene M.
Haeussler Maximilian
Howard Jonathan M.
Katzman Sol
Kim Garam
Lin Hai
Liu Yunlong
Sanford Jeremy R.
Toloue Masoud
Wallace Andrew J.
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/05/2018
Field of study

Alternative pre-mRNA splicing plays a major role in expanding the transcript output of human genes. This process is regulated, in part, by the interplay of trans-acting RNA binding proteins (RBPs) with myriad cis-regulatory elements scattered throughout pre-mRNAs. These molecular recognition events are critical for defining the protein-coding sequences (exons) within pre-mRNAs and directing spliceosome assembly on noncoding regions (introns). One of the earliest events in this process is recognition of the 3' splice site (3'ss) by U2 small nuclear RNA auxiliary factor 2 (U2AF2). Splicing regulators, such as the heterogeneous nuclear ribonucleoprotein A1 (HNRNPA1), influence spliceosome assembly both in vitro and in vivo, but their mechanisms of action remain poorly described on a global scale. HNRNPA1 also promotes proofreading of 3'ss sequences though a direct interaction with the U2AF heterodimer. To determine how HNRNPA1 regulates U2AF-RNA interactions in vivo, we analyzed U2AF2 RNA binding specificity using individual-nucleotide resolution crosslinking immunoprecipitation (iCLIP) in control and HNRNPA1 overexpression cells. We observed changes in the distribution of U2AF2 crosslinking sites relative to the 3'ss of alternative cassette exons but not constitutive exons upon HNRNPA1 overexpression. A subset of these events shows a concomitant increase of U2AF2 crosslinking at distal intronic regions, suggesting a shift of U2AF2 to "decoy" binding sites. Of the many noncanonical U2AF2 binding sites, Alu-derived RNA sequences represented one of the most abundant classes of HNRNPA1-dependent decoys. We propose that one way HNRNPA1 regulates exon definition is to modulate the interaction of U2AF2 with decoy or bona fide 3'ss

IUPUIScholarWorks

eScholarship - University of California

AVADA improves automated genetic variant database construction directly from full-text literature

Author: Bejerano Gill
Bernstein Jonathan
Birgmeier Johannes
Cooper David
Deisseroth Cole
Haeussler Maximilian
Jagadeesh Karthik
Stenson Peter
Tierno Andrew
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 04/11/2018
Field of study

Purpose: The primary literature on human genetic diseases includes descriptions of pathogenic variants that are essential for clinical diagnosis. Variant databases such as ClinVar and HGMD collect pathogenic variants by manual curation. We aimed to automatically construct a freely accessible database of pathogenic variants directly from full-text articles about genetic disease. Methods: AVADA (Automatically curated VAriant DAtabase) is a novel machine learning tool that uses natural language processing to automatically identify pathogenic variants and genes in full text of primary literature and converts them to genomic coordinates for rapid downstream use. Results: AVADA automatically curated almost 60% of pathogenic variants deposited in HGMD, a 4.4-fold improvement over the current state of the art in automated variant extraction. AVADA also contains more than 60,000 pathogenic variants that are in HGMD, but not in ClinVar. In a cohort of 245 diagnosed patients, AVADA correctly annotated 38 previously described diagnostic variants, compared to 43 using HGMD, 20 using ClinVar and only 13 (wholly subsumed by AVADA and ClinVar's) using the best automated abstracts-only based approach. Conclusion: AVADA is the first machine learning tool that automatically curates a variants database directly from full text literature. AVADA is available upon publication at http://bejerano.stanford.edu/AVADA

Online Research @ Cardiff

Characterization of the neural stem cell gene regulatory network identifies OLIG2 as a multifunctional regulator of self-renewal

Author: Ben Martynoga
Daniela Drechsel
Debbie L.C. van den Berg
Diogo S. Castro
François Guillemot
Gregory E. Crawford
Joachim Wittbrodt
Juan L. Mateo
Laurence Ettwiller
Maximilian Haeussler
Paul Flicek
Paul Robson
Q. Richard Lu
Zachary B. Gaber
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 07/10/2014
Field of study

The gene regulatory network (GRN) that supports neural stem cell (NS cell) self-renewal has so far been poorly characterized. Knowledge of the central transcription factors (TFs), the noncoding gene regulatory regions that they bind to, and the genes whose expression they modulate will be crucial in unlocking the full therapeutic potential of these cells. Here, we use DNase-seq in combination with analysis of histone modifications to identify multiple classes of epigenetically and functionally distinct cis-regulatory elements (CREs). Through motif analysis and ChIP-seq, we identify several of the crucial TF regulators of NS cells. At the core of the network are TFs of the basic helix-loop-helix (bHLH), nuclear factor I (NFI), SOX, and FOX families, with CREs often densely bound by several of these different TFs. We use machine learning to highlight several crucial regulatory features of the network that underpin NS cell self-renewal and multipotency. We validate our predictions by functional analysis of the bHLH TF OLIG2. This TF makes an important contribution to NS cell self-renewal by concurrently activating pro-proliferation genes and preventing the untimely activation of genes promoting neuronal differentiation and stem cell quiescence.Welcome Trust grants: (WT095908, WT098051), FEBS Long-Term Fellowship, Medical Research Council Grant-in-Aid (U117570528)

Access to Research and Communications Annals

Crossref

PubMed Central

AMELIE speeds Mendelian diagnosis by matching patient phenotype and genotype to primary literature

Author: Beggs Alan H.
Bejerano Gill
Bernstein Jonathan A.
Birgmeier Johannes
Cooper David N.
Deisseroth Cole A.
Diekhans Mark E.
Guturu Harendra
Haeussler Maximilian
Jagadeesh Karthik A.
Ratner Alexander J.
Ré Christopher
Steinberg Ethan H.
Stenson Peter D.
Wenger Aaron M.
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 20/05/2020
Field of study

The diagnosis of Mendelian disorders requires labor-intensive literature research. Trained clinicians can spend hours looking for the right publication(s) supporting a single gene that best explains a patient’s disease. AMELIE (Automatic Mendelian Literature Evaluation) greatly accelerates this process. AMELIE parses all 29 million PubMed abstracts and downloads and further parses hundreds of thousands of full-text articles in search of information supporting the causality and associated phenotypes of most published genetic variants. AMELIE then prioritizes patient candidate variants for their likelihood of explaining any patient’s given set of phenotypes. Diagnosis of singleton patients (without relatives’ exomes) is the most time-consuming scenario, and AMELIE ranked the causative gene at the very top for 66% of 215 diagnosed singleton Mendelian patients from the Deciphering Developmental Disorders project. Evaluating only the top 11 AMELIE-scored genes of 127 (median) candidate genes per patient resulted in a rapid diagnosis in more than 90% of cases. AMELIE-based evaluation of all cases was 3 to 19 times more efficient than hand-curated database–based approaches. We replicated these results on a retrospective cohort of clinical cases from Stanford Children’s Health and the Manton Center for Orphan Disease Research. An analysis web portal with our most recent update, programmatic interface, and code is available at AMELIE.stanford.edu

Online Research @ Cardiff

PubMed Central

eScholarship - University of California