136 research outputs found

    Computational Approaches for the Analysis of Chromosome Conformation Capture Data and Their Application to Study Long-Range Gene Regulation: A Dissertation

    Get PDF
    Over the last decade, development and application of a set of molecular genomic approaches based on the chromosome conformation capture method (3C), combined with increasingly powerful imaging approaches have enabled high resolution and genome-wide analysis of the spatial organization of chromosomes. The aim of this thesis is two-fold; 1), to provide guidelines for analyzing and interpreting data obtained from genome-wide 3C methods such as Hi-C and 3C-seq and 2), to leverage the 3C technology to solve genome function, structure, assembly, development and dosage problems across a broad range of organisms and disease models. First, through the introduction of cWorld, a toolkit for manipulating genome structure data, I accelerate the pace at which *C experiments can be performed, analyzed and biological insights inferred. Next I discuss a set of practical guidelines one should consider while planning an experiment to study the structure of the genome, a simple workflow for data processing unique to *C data and a set of considerations one should be aware of while attempting to gain insights from the data. Next, I apply these guidelines and leverage the cWorld toolkit in the context of two dosage compensation systems. The first is a worm condensin mutant which shows a reduction in dosage compensation in the hermaphrodite X chromosomes. The second is an allele-specific study consisting of genome wide Hi-C, RNA-Seq and ATAC-Seq which can measure the state of the active (Xa) and inactive (Xi) X chromosome. Finally I turn to studying specific gene – enhancer looping interactions across a panel of ENCODE cell-lines. These studies, when taken together, further our understanding of how genome structure relates to genome function

    The long-range interaction landscape of gene promoters

    Get PDF
    The vast non-coding portion of the human genome is full of functional elements and disease-causing regulatory variants. The principles defining the relationships between these elements and distal target genes remain unknown. Promoters and distal elements can engage in looping interactions that have been implicated in gene regulation. Here we have applied chromosome conformation capture carbon copy (5C) to interrogate comprehensively interactions between transcription start sites (TSSs) and distal elements in 1% of the human genome representing the ENCODE pilot project regions. 5C maps were generated for GM12878, K562 and HeLa-S3 cells and results were integrated with data from the ENCODE consortium. In each cell line we discovered \u3e1,000 long-range interactions between promoters and distal sites that include elements resembling enhancers, promoters and CTCF-bound sites. We observed significant correlations between gene expression, promoter-enhancer interactions and the presence of enhancer RNAs. Long-range interactions show marked asymmetry with a bias for interactions with elements located approximately 120 kilobases upstream of the TSS. Long-range interactions are often not blocked by sites bound by CTCF and cohesin, indicating that many of these sites do not demarcate physically insulated gene domains. Furthermore, only approximately 7% of looping interactions are with the nearest gene, indicating that genomic proximity is not a simple predictor for long-range interactions. Finally, promoters and distal elements are engaged in multiple long-range interactions to form complex networks. Our results start to place genes and regulatory elements in three-dimensional context, revealing their functional relationships

    Understanding the mechanisms of IGF2 gene regulation in hepatocellular carcinoma cells

    Get PDF
    Hepatocellular carcinoma (HCC) is a leading cause of cancer-related deaths worldwide. HCC has a very well studied etiology, and is associated with chronic hepatic viral infections (hepatitis viruses B and C), alcohol abuse, or other causes of chronic liver damage. Currently, tumor resection and liver transplantation are the only potentially curative treatments available for HCC. However, the presence of extra-hepatic invasion and metastasis makes patients ineligible for these treatments. High IGF2 levels are associated with metastatic HCC, and we recently showed that IGF2-induced signaling through Igf1R stimulates the invasiveness and metastatic phenotype of HCC cells. However, the precise mechanisms by which IGF2 expression is enhanced in HCC are not well understood. IGF2 is an imprinted gene normally expressed from the paternal allele. Loss of imprinting, which activates the normally silent maternal allele, has been implicated as an epigenetic marker for the enhanced risk of human cancer. However, many HCCs that display elevated IGF2 expression levels retain a normal imprinting pattern. Therefore, additional gene regulation mechanisms must also influence IGF2 expression in HCC. Hypothesis: Long-range genomic interactions are important for the regulation of IGF2 gene expression, and alterations in these long-range interactions lead to elevated IGF2 gene expression in HCC. To address this hypothesis I have utilized chromosome conformation capture carbon copy (5C) technology to elucidate long-range interactions involving the IGF2 promoters in a normal hepatocyte cell line, THLE-2, and an HCC cell line HepG2

    Measuring the reproducibility and quality of Hi-C data

    Get PDF
    BACKGROUND: Hi-C is currently the most widely used assay to investigate the 3D organization of the genome and to study its role in gene regulation, DNA replication, and disease. However, Hi-C experiments are costly to perform and involve multiple complex experimental steps; thus, accurate methods for measuring the quality and reproducibility of Hi-C data are essential to determine whether the output should be used further in a study. RESULTS: Using real and simulated data, we profile the performance of several recently proposed methods for assessing reproducibility of population Hi-C data, including HiCRep, GenomeDISCO, HiC-Spector, and QuASAR-Rep. By explicitly controlling noise and sparsity through simulations, we demonstrate the deficiencies of performing simple correlation analysis on pairs of matrices, and we show that methods developed specifically for Hi-C data produce better measures of reproducibility. We also show how to use established measures, such as the ratio of intra- to interchromosomal interactions, and novel ones, such as QuASAR-QC, to identify low-quality experiments. CONCLUSIONS: In this work, we assess reproducibility and quality measures by varying sequencing depth, resolution and noise levels in Hi-C data from 13 cell lines, with two biological replicates each, as well as 176 simulated matrices. Through this extensive validation and benchmarking of Hi-C data, we describe best practices for reproducibility and quality assessment of Hi-C experiments. We make all software publicly available at http://github.com/kundajelab/3DChromatin_ReplicateQC to facilitate adoption in the community

    Organization of the Mitotic Chromosome

    Get PDF
    Mitotic chromosomes are among the most recognizable structures in the cell, yet for over a century their internal organization remains largely unsolved. We applied chromosome conformation capture methods, 5C and Hi-C, across the cell cycle and revealed two alternative three-dimensional folding states of the human genome. We show that the highly compartmentalized and cell-type-specific organization described previously for non-synchronous cells is restricted to interphase. In metaphase, we identify a homogenous folding state, which is locus-independent, common to all chromosomes, and consistent among cell types, suggesting a general principle of metaphase chromosome organization. Using polymer simulations, we find that metaphase Hi-C data are inconsistent with classic hierarchical models, and is instead best described by a linearly-organized longitudinally compressed array of consecutive chromatin loops.National Cancer Institute (U.S.) (Grant U54CA143874

    Flexible ordering of antibody class switch and V(D)J joining during B-cell ontogeny

    Get PDF
    V(D)J joining is mediated by RAG recombinase during early B-lymphocyte development in the bone marrow (BM). Activation-induced deaminase initiates isotype switching in mature B cells of secondary lymphoid structures. Previous studies questioned the strict ontological partitioning of these processes. We show that pro-B cells undergo robust switching to a subset of immunoglobulin H (IgH) isotypes. Chromatin studies reveal that in pro-B cells, the spatial organization of the Igh locus may restrict switching to this subset of isotypes. We demonstrate that in the BM, V(D)J joining and switching are interchangeably inducible, providing an explanation for the hyper-IgE phenotype of Omenn syndrome

    A User\u27s Guide to the Encyclopedia of DNA Elements (ENCODE)

    Get PDF
    The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome

    Patterns of Growth and Decline in Lung Function in Persistent Childhood Asthma

    Get PDF
    BACKGROUND: Tracking longitudinal measurements of growth and decline in lung function in patients with persistent childhood asthma may reveal links between asthma and subsequent chronic airflow obstruction. METHODS: We classified children with asthma according to four characteristic patterns of lung-function growth and decline on the basis of graphs showing forced expiratory volume in 1 second (FEV1), representing spirometric measurements performed from childhood into adulthood. Risk factors associated with abnormal patterns were also examined. To define normal values, we used FEV1 values from participants in the National Health and Nutrition Examination Survey who did not have asthma. RESULTS: Of the 684 study participants, 170 (25%) had a normal pattern of lung-function growth without early decline, and 514 (75%) had abnormal patterns: 176 (26%) had reduced growth and an early decline, 160 (23%) had reduced growth only, and 178 (26%) had normal growth and an early decline. Lower baseline values for FEV1, smaller bronchodilator response, airway hyperresponsiveness at baseline, and male sex were associated with reduced growth (P \u3c 0.001 for all comparisons). At the last spirometric measurement (mean [+/-SD] age, 26.0+/-1.8 years), 73 participants (11%) met Global Initiative for Chronic Obstructive Lung Disease spirometric criteria for lung-function impairment that was consistent with chronic obstructive pulmonary disease (COPD); these participants were more likely to have a reduced pattern of growth than a normal pattern (18% vs. 3%, P \u3c 0.001). CONCLUSIONS: Childhood impairment of lung function and male sex were the most significant predictors of abnormal longitudinal patterns of lung-function growth and decline. Children with persistent asthma and reduced growth of lung function are at increased risk for fixed airflow obstruction and possibly COPD in early adulthood. (Funded by the Parker B. Francis Foundation and others; ClinicalTrials.gov number, NCT00000575.)

    Cohesin-based chromatin interactions enable regulated gene expression within pre-existing architectural compartments

    Get PDF
    Chromosome conformation capture approaches have shown that interphase chromatin is partitioned into spatially segregated Mb-sized compartments and sub-Mb-sized topological domains. This compartmentalization is thought to facilitate the matching of genes and regulatory elements, but its precise function and mechanistic basis remain unknown. Cohesin controls chromosome topology to enable DNA repair and chromosome segregation in cycling cells. In addition, cohesin associates with active enhancers and promoters and with CTCF to form long-range interactions important for gene regulation. Although these findings suggest an important role for cohesin in genome organization, this role has not been assessed on a global scale. Unexpectedly, we find that architectural compartments are maintained in non-cycling mouse thymocytes after genetic depletion of cohesin in vivo. Cohesin was however required for specific long-range interactions within compartments where cohesin-regulated genes reside. Cohesin depletion diminished interactions between cohesin-bound sites, while alternative interactions between chromatin features associated with transcriptional activation and repression became more prominent, with corresponding changes in gene expression. Our findings indicate that cohesin-mediated long-range interactions facilitate discrete gene expression states within pre-existing chromosomal compartments
    • …
    corecore