24 research outputs found

    ORegAnno 3.0: A community-driven resource for curated regulatory annotation

    Get PDF
    The Open Regulatory Annotation database (ORegAnno) is a resource for curated regulatory annotation. It contains information about regulatory regions, transcription factor binding sites, RNA binding sites, regulatory variants, haplotypes, and other regulatory elements. ORegAnno differentiates itself from other regulatory resources by facilitating crowd-sourced interpretation and annotation of regulatory observations from the literature and highly curated resources. It contains a comprehensive annotation scheme that aims to describe both the elements and outcomes of regulatory events. Moreover, ORegAnno assembles these disparate data sources and annotations into a single, high quality catalogue of curated regulatory information. The current release is an update of the database previously featured in the NAR Database Issue, and now contains 1 948 307 records, across 18 species, with a combined coverage of 334 215 080 bp. Complete records, annotation, and other associated data are available for browsing and download at http://www.oreganno.org/

    DGIdb 5.0: Rebuilding the Drug-Gene Interaction Database for precision medicine and drug discovery platforms

    Get PDF
    The Drug-Gene Interaction Database (DGIdb, https://dgidb.org) is a publicly accessible resource that aggregates genes or gene products, drugs and drug-gene interaction records to drive hypothesis generation and discovery for clinicians and researchers. DGIdb 5.0 is the latest release and includes substantial architectural and functional updates to support integration into clinical and drug discovery pipelines. The DGIdb service architecture has been split into separate client and server applications, enabling consistent data access for users of both the application programming interface (API) and web interface. The new interface was developed in ReactJS, and includes dynamic visualizations and consistency in the display of user interface elements. A GraphQL API has been added to support customizable queries for all drugs, genes, annotations and associated data. Updated documentation provides users with example queries and detailed usage instructions for these new features. In addition, six sources have been added and many existing sources have been updated. Newly added sources include ChemIDplus, HemOnc, NCIt (National Cancer Institute Thesaurus), Drugs@FDA, HGNC (HUGO Gene Nomenclature Committee) and RxNorm. These new sources have been incorporated into DGIdb to provide additional records and enhance annotations of regulatory approval status for therapeutics. Methods for grouping drugs and genes have been expanded upon and developed as independent modular normalizers during import. The updates to these sources and grouping methods have resulted in an improvement in FAIR (findability, accessibility, interoperability and reusability) data representation in DGIdb

    CIViCpy: A Python software evelopment and analysis toolkit for the CIViC knowledgebase

    Get PDF
    PURPOSE: Precision oncology depends on the matching of tumor variants to relevant knowledge describing the clinical significance of those variants. We recently developed the Clinical Interpretations for Variants in Cancer (CIViC; civicdb.org) crowd-sourced, expert-moderated, and open-access knowledgebase. CIViC provides a structured framework for evaluating genomic variants of various types (eg, fusions, single-nucleotide variants) for their therapeutic, prognostic, predisposing, diagnostic, or functional utility. CIViC has a documented application programming interface for accessing CIViC records: assertions, evidence, variants, and genes. Third-party tools that analyze or access the contents of this knowledgebase programmatically must leverage this application programming interface, often reimplementing redundant functionality in the pursuit of common analysis tasks that are beyond the scope of the CIViC Web application. METHODS: To address this limitation, we developed CIViCpy (civicpy.org), a software development kit for extracting and analyzing the contents of the CIViC knowledgebase. CIViCpy enables users to query CIViC content as dynamic objects in Python. We assess the viability of CIViCpy as a tool for advancing individualized patient care by using it to systematically match CIViC evidence to observed variants in patient cancer samples. RESULTS: We used CIViCpy to evaluate variants from 59,437 sequenced tumors of the American Association for Cancer Research Project GENIE data set. We demonstrate that CIViCpy enables annotation of \u3e 1,200 variants per second, resulting in precise variant matches to CIViC level A (professional guideline) or B (clinical trial) evidence for 38.6% of tumors. CONCLUSION: The clinical interpretation of genomic variants in cancers requires high-throughput tools for interoperability and analysis of variant interpretation knowledge. These needs are met by CIViCpy, a software development kit for downstream applications and rapid analysis. CIViCpy is fully documented, open-source, and available free online

    Personalized ctDNA micro-panels can monitor and predict clinical outcomes for patients with triple-negative breast cancer

    Get PDF
    Circulating tumor DNA (ctDNA) in peripheral blood has been used to predict prognosis and therapeutic response for triple-negative breast cancer (TNBC) patients. However, previous approaches typically use large comprehensive panels of genes commonly mutated across all breast cancers. Given the reduction in sequencing costs and decreased turnaround times associated with panel generation, the objective of this study was to assess the use of custom micro-panels for tracking disease and predicting clinical outcomes for patients with TNBC. Paired tumor-normal samples from patients with TNBC were obtained at diagnosis (T0) and whole exome sequencing (WES) was performed to identify somatic variants associated with individual tumors. Custom micro-panels of 4-6 variants were created for each individual enrolled in the study. Peripheral blood was obtained at baseline, during Cycle 1 Day 3, at time of surgery, and in 3-6 month intervals after surgery to assess variant allele fraction (VAF) at different timepoints during disease course. The VAF was compared to clinical outcomes to evaluate the ability of custom micro-panels to predict pathological response, disease-free intervals, and patient relapse. A cohort of 50 individuals were evaluated for up to 48 months post-diagnosis of TNBC. In total, there were 33 patients who did not achieve pathological complete response (pCR) and seven patients developed clinical relapse. For all patients who developed clinical relapse and had peripheral blood obtained ≤ 6 months prior to relapse (n = 4), the custom ctDNA micro-panels identified molecular relapse at an average of 4.3 months prior to clinical relapse. The custom ctDNA panel results were moderately associated with pCR such that during disease monitoring, only 11% of patients with pCR had a molecular relapse, whereas 47% of patients without pCR had a molecular relapse (Chi-Square; p-value = 0.10). In this study, we show that a custom micro-panel of 4-6 markers can be effectively used to predict outcomes and monitor remission for patients with TNBC. These custom micro-panels show high sensitivity for detecting molecular relapse in advance of clinical relapse. The use of these panels could improve patient outcomes through early detection of relapse with preemptive intervention prior to symptom onset

    Open-Sourced CIViC Annotation Pipeline to identify and annotate clinically relevant variants using single-molecule molecular inversion probes

    Get PDF
    PURPOSE: Clinical targeted sequencing panels are important for identifying actionable variants for patients with cancer; however, existing approaches do not provide transparent and rationally designed clinical panels to accommodate the rapidly growing knowledge within oncology. MATERIALS AND METHODS: We used the Clinical Interpretations of Variants in Cancer (CIViC) database to develop an Open-Sourced CIViC Annotation Pipeline (OpenCAP). OpenCAP provides methods to identify variants within the CIViC database, build probes for variant capture, use probes on prospective samples, and link somatic variants to CIViC clinical relevance statements. OpenCAP was tested using a single-molecule molecular inversion probe (smMIP) capture design on 27 cancer samples from 5 tumor types. In total, 2,027 smMIPs were designed to target 111 eligible CIViC variants (61.5 kb of genomic space). RESULTS: When compared with orthogonal sequencing, CIViC smMIP sequencing demonstrated a 95% sensitivity for variant detection (n = 61 of 64 variants). Variant allele frequencies for variants identified on both sequencing platforms were highly concordant (Pearson\u27s CONCLUSION: The OpenCAP design paradigm demonstrates the utility of an open-source and open-access database built on attendant community contributions with peer-reviewed interpretations. Use of a public repository for variant identification, probe development, and variant interpretation provides a transparent approach to build dynamic next-generation sequencing-based oncology panels

    Integrated analysis of genomic and transcriptomic data for the discovery of splice-associated variants in cancer

    Get PDF
    Somatic mutations within non-coding regions and even exons may have unidentified regulatory consequences that are often overlooked in analysis workflows. Here we present RegTools ( www.regtools.org ), a computationally efficient, free, and open-source software package designed to integrate somatic variants from genomic data with splice junctions from bulk or single cell transcriptomic data to identify variants that may cause aberrant splicing. We apply RegTools to over 9000 tumor samples with both tumor DNA and RNA sequence data. RegTools discovers 235,778 events where a splice-associated variant significantly increases the splicing of a particular junction, across 158,200 unique variants and 131,212 unique junctions. To characterize these somatic variants and their associated splice isoforms, we annotate them with the Variant Effect Predictor, SpliceAI, and Genotype-Tissue Expression junction counts and compare our results to other tools that integrate genomic and transcriptomic data. While many events are corroborated by the aforementioned tools, the flexibility of RegTools also allows us to identify splice-associated variants in known cancer drivers, such as TP53, CDKN2A, and B2M, and other genes

    Distinct clonal identities of B-ALLs arising after lenolidomide therapy for multiple myeloma

    Get PDF
    Patients with multiple myeloma (MM) who are treated with lenalidomide rarely develop a secondary B-cell acute lymphoblastic leukemia (B-ALL). The clonal and biological relationship between these sequential malignancies is not yet clear. We identified 17 patients with MM treated with lenalidomide, who subsequently developed B-ALL. Patient samples were evaluated through sequencing, cytogenetics/fluorescence in situ hybridization (FISH), immunohistochemical (IHC) staining, and immunoglobulin heavy chain (IgH) clonality assessment. Samples were assessed for shared mutations and recurrently mutated genes. Through whole exome sequencing and cytogenetics/FISH analysis of 7 paired samples (MM vs matched B-ALL), no mutational overlap between samples was observed. Unique dominant IgH clonotypes between the tumors were observed in 5 paired MM/B-ALL samples. Across all 17 B-ALL samples, 14 (83%) had a TP53 variant detected. Three MM samples with sufficient sequencing depth (\u3e500×) revealed rare cells (average of 0.6% variant allele frequency, or 1.2% of cells) with the same TP53 variant identified in the subsequent B-ALL sample. A lack of mutational overlap between MM and B-ALL samples shows that B-ALL developed as a second malignancy arising from a founding population of cells that likely represented unrelated clonal hematopoiesis caused by a TP53 mutation. The recurrent variants in TP53 in the B-ALL samples suggest a common path for malignant transformation that may be similar to that of TP53-mutant, treatment-related acute myeloid leukemia. The presence of rare cells containing TP53 variants in bone marrow at the initiation of lenalidomide treatment suggests that cellular populations containing TP53 variants expand in the presence of lenalidomide to increase the likelihood of B-ALL development
    corecore