15 research outputs found

    Characterization of universal features of partially methylated domains across tissues and species

    Get PDF
    Abstract: Background: Partially methylated domains (PMDs) are a hallmark of epigenomes in reproducible and specific biological contexts, including cancer cells, the placenta, and cultured cell lines. Existing methods for deciding whether PMDs exist in a sample, as well as their identification, are few, often tailored to specific biological questions, and require high coverage samples for accurate identification. Results: In this study, we outline a set of axioms that take a step towards a functional definition for PMDs, describe an improved method for comparable PMD detection across samples with substantially differing sequencing depths, and refine the decision criteria for whether a sample contains PMDs using a data-driven approach. Applying our method to 267 methylomes from 7 species, we corroborated recent results regarding the general association between replication timing and PMD state, and report identification of several reproducibly “escapee” genes within late-replicating domains that escape the reduced expression and hypomethylation of their immediate genomic neighborhood. We also explored the discordant PMD state of orthologous genes between human and mouse, and observed a directional association of PMD state with gene expression and local gene density. Conclusions: Our improved method makes low sequencing depth, population-level studies of PMD variation possible and our results further refine the model of PMD formation as one where sequence context and regional epigenomic features both play a role in gradual genome-wide hypomethylation

    Clinical validation and utility of Percepta GSC for the evaluation of lung cancer

    Get PDF
    The Percepta Genomic Sequencing Classifier (GSC) was developed to up-classify as well as down-classify the risk of malignancy for lung lesions when bronchoscopy is non-diagnostic. We evaluated the performance of Percepta GSC in risk re-classification of indeterminate lung lesions. This multicenter study included individuals who currently or formerly smoked undergoing bronchoscopy for suspected lung cancer from the AEGIS I/ II cohorts and the Percepta Registry. The classifier was measured in normal-appearing bronchial epithelium from bronchial brushings. The sensitivity, specificity, and predictive values were calculated using predefined thresholds. The ability of the classifier to decrease unnecessary invasive procedures was estimated. A set of 412 patients were included in the validation (prevalence of malignancy was 39.6%). Overall, 29% of intermediate-risk lung lesions were down-classified to low-risk with a 91.0% negative predictive value (NPV) and 12.2% of intermediate-risk lesions were up-classified to high-risk with a 65.4% positive predictive value (PPV). In addition, 54.5% of low-risk lesions were down-classified to very low risk with >99% NPV and 27.3% of high-risk lesions were up-classified to very high risk with a 91.5% PPV. If the classifier results were used in nodule management, 50% of patients with benign lesions and 29% of patients with malignant lesions undergoing additional invasive procedures could have avoided these procedures. The Percepta GSC is highly accurate as both a rule-out and rule-in test. This high accuracy of risk re-classification may lead to improved management of lung lesions

    Diagnosis and Treatment Knowledge Graph Modeling Application Based on Chinese Medical Records

    No full text
    In this study, a knowledge graph of Chinese medical record data was constructed based on graph database technology. An entity extraction method based on natural language processing, disambiguation, and reorganization for Chinese medical records is proposed, and dictionaries of drugs and treatment plans are constructed. Examples of applications of the knowledge graph in diagnosis and treatment prediction are given. Experimentally, it is found that the knowledge graph based on the graph database is 116.7% faster than the traditional database in complex relational queries

    Maximizing Small Biopsy Patient Samples: Unified RNA-Seq Platform Assessment of over 120,000 Patient Biopsies

    No full text
    Despite its wide-ranging benefits, whole-transcriptome or RNA exome profiling is challenging to implement in a clinical diagnostic setting. The Unified Assay is a comprehensive workflow wherein exome-enriched RNA-sequencing (RNA-Seq) assays are performed on clinical samples and analyzed by a series of advanced machine learning-based classifiers. Gene expression signatures and rare and/or novel genomic events, including fusions, mitochondrial variants, and loss of heterozygosity were assessed using RNA-Seq data generated from 120,313 clinical samples across three clinical indications (thyroid cancer, lung cancer, and interstitial lung disease). Since its implementation, the data derived from the Unified Assay have allowed significantly more patients to avoid unnecessary diagnostic surgery and have played an important role in guiding follow-up decisions regarding treatment. Collectively, data from the Unified Assay show the utility of RNA-Seq and RNA expression signatures in the clinical laboratory, and their importance to the future of precision medicine

    A Reference Methylome Database and Analysis Pipeline to Facilitate Integrative and Comparative Epigenomics

    No full text
    <div><p>DNA methylation is implicated in a surprising diversity of regulatory, evolutionary processes and diseases in eukaryotes. The introduction of whole-genome bisulfite sequencing has enabled the study of DNA methylation at a single-base resolution, revealing many new aspects of DNA methylation and highlighting the usefulness of methylome data in understanding a variety of genomic phenomena. As the number of publicly available whole-genome bisulfite sequencing studies reaches into the hundreds, reliable and convenient tools for comparing and analyzing methylomes become increasingly important. We present MethPipe, a pipeline for both low and high-level methylome analysis, and MethBase, an accompanying database of annotated methylomes from the public domain. Together these resources enable researchers to extract interesting features from methylomes and compare them with those identified in public methylomes in our database.</p></div

    Comparing biological and technical features of methylome data.

    No full text
    <p>(A) Human methylomes are clustered according to the number and size of promoter HMRs. (B) Correlation between depth of coverage at CpG sites and CpG densities in 1 kb windows for a subset of human methylomes from MethBase. X-axis indicates the index of methylomes sorted by their correlation coefficients. See <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0081148#pone.0081148.s002" target="_blank">Table S1</a> for <i>p</i>-values and the list of samples.</p

    Evolutionary support for commonly observed types of differential methylation.

    No full text
    <p>(A) differential presense of an HMR; (B) partial difference in methylation level; (C) shifting of HMR boundaries; (D) difference in precision of HMR boundaries. Orange bars: HMRs in human; Blue bars: HMRs in chimp; Red bars: DMRs between B cells and neutrophils.</p
    corecore