65 research outputs found

    A method for comparing multiple imputation techniques: A case study on the U.S. national COVID cohort collaborative.

    Get PDF
    Healthcare datasets obtained from Electronic Health Records have proven to be extremely useful for assessing associations between patients’ predictors and outcomes of interest. However, these datasets often suffer from missing values in a high proportion of cases, whose removal may introduce severe bias. Several multiple imputation algorithms have been proposed to attempt to recover the missing information under an assumed missingness mechanism. Each algorithm presents strengths and weaknesses, and there is currently no consensus on which multiple imputation algorithm works best in a given scenario. Furthermore, the selection of each algorithm’s pa- rameters and data-related modeling choices are also both crucial and challenging

    NSAID use and clinical outcomes in COVID-19 patients: a 38-center retrospective cohort study.

    Get PDF
    BACKGROUND: Non-steroidal anti-inflammatory drugs (NSAIDs) are commonly used to reduce pain, fever, and inflammation but have been associated with complications in community-acquired pneumonia. Observations shortly after the start of the COVID-19 pandemic in 2020 suggested that ibuprofen was associated with an increased risk of adverse events in COVID-19 patients, but subsequent observational studies failed to demonstrate increased risk and in one case showed reduced risk associated with NSAID use. METHODS: A 38-center retrospective cohort study was performed that leveraged the harmonized, high-granularity electronic health record data of the National COVID Cohort Collaborative. A propensity-matched cohort of 19,746 COVID-19 inpatients was constructed by matching cases (treated with NSAIDs at the time of admission) and 19,746 controls (not treated) from 857,061 patients with COVID-19 available for analysis. The primary outcome of interest was COVID-19 severity in hospitalized patients, which was classified as: moderate, severe, or mortality/hospice. Secondary outcomes were acute kidney injury (AKI), extracorporeal membrane oxygenation (ECMO), invasive ventilation, and all-cause mortality at any time following COVID-19 diagnosis. RESULTS: Logistic regression showed that NSAID use was not associated with increased COVID-19 severity (OR: 0.57 95% CI: 0.53-0.61). Analysis of secondary outcomes using logistic regression showed that NSAID use was not associated with increased risk of all-cause mortality (OR 0.51 95% CI: 0.47-0.56), invasive ventilation (OR: 0.59 95% CI: 0.55-0.64), AKI (OR: 0.67 95% CI: 0.63-0.72), or ECMO (OR: 0.51 95% CI: 0.36-0.7). In contrast, the odds ratios indicate reduced risk of these outcomes, but our quantitative bias analysis showed E-values of between 1.9 and 3.3 for these associations, indicating that comparatively weak or moderate confounder associations could explain away the observed associations. CONCLUSIONS: Study interpretation is limited by the observational design. Recording of NSAID use may have been incomplete. Our study demonstrates that NSAID use is not associated with increased COVID-19 severity, all-cause mortality, invasive ventilation, AKI, or ECMO in COVID-19 inpatients. A conservative interpretation in light of the quantitative bias analysis is that there is no evidence that NSAID use is associated with risk of increased severity or the other measured outcomes. Our results confirm and extend analogous findings in previous observational studies using a large cohort of patients drawn from 38 centers in a nationally representative multicenter database

    The Medical Action Ontology: A tool for annotating and analyzing treatments and clinical management of human disease.

    Get PDF
    BACKGROUND: Navigating the clinical literature to determine the optimal clinical management for rare diseases presents significant challenges. We introduce the Medical Action Ontology (MAxO), an ontology specifically designed to organize medical procedures, therapies, and interventions. METHODS: MAxO incorporates logical structures that link MAxO terms to numerous other ontologies within the OBO Foundry. Term development involves a blend of manual and semi-automated processes. Additionally, we have generated annotations detailing diagnostic modalities for specific phenotypic abnormalities defined by the Human Phenotype Ontology (HPO). We introduce a web application, POET, that facilitates MAxO annotations for specific medical actions for diseases using the Mondo Disease Ontology. FINDINGS: MAxO encompasses 1,757 terms spanning a wide range of biomedical domains, from human anatomy and investigations to the chemical and protein entities involved in biological processes. These terms annotate phenotypic features associated with specific disease (using HPO and Mondo). Presently, there are over 16,000 MAxO diagnostic annotations that target HPO terms. Through POET, we have created 413 MAxO annotations specifying treatments for 189 rare diseases. CONCLUSIONS: MAxO offers a computational representation of treatments and other actions taken for the clinical management of patients. Its development is closely coupled to Mondo and HPO, broadening the scope of our computational modeling of diseases and phenotypic features. We invite the community to contribute disease annotations using POET (https://poet.jax.org/). MAxO is available under the open-source CC-BY 4.0 license (https://github.com/monarch-initiative/MAxO). FUNDING: NHGRI 1U24HG011449-01A1 and NHGRI 5RM1HG010860-04

    The Monarch Initiative in 2019: an integrative data and analytic platform connecting phenotypes to genotypes across species.

    Get PDF
    In biology and biomedicine, relating phenotypic outcomes with genetic variation and environmental factors remains a challenge: patient phenotypes may not match known diseases, candidate variants may be in genes that haven\u27t been characterized, research organisms may not recapitulate human or veterinary diseases, environmental factors affecting disease outcomes are unknown or undocumented, and many resources must be queried to find potentially significant phenotypic associations. The Monarch Initiative (https://monarchinitiative.org) integrates information on genes, variants, genotypes, phenotypes and diseases in a variety of species, and allows powerful ontology-based search. We develop many widely adopted ontologies that together enable sophisticated computational analysis, mechanistic discovery and diagnostics of Mendelian diseases. Our algorithms and tools are widely used to identify animal models of human disease through phenotypic similarity, for differential diagnostics and to facilitate translational research. Launched in 2015, Monarch has grown with regards to data (new organisms, more sources, better modeling); new API and standards; ontologies (new Mondo unified disease ontology, improvements to ontologies such as HPO and uPheno); user interface (a redesigned website); and community development. Monarch data, algorithms and tools are being used and extended by resources such as GA4GH and NCATS Translator, among others, to aid mechanistic discovery and diagnostics

    Guidelines for Genome-Scale Analysis of Biological Rhythms

    Get PDF
    Genome biology approaches have made enormous contributions to our understanding of biological rhythms, particularly in identifying outputs of the clock, including RNAs, proteins, and metabolites, whose abundance oscillates throughout the day. These methods hold significant promise for future discovery, particularly when combined with computational modeling. However, genome-scale experiments are costly and laborious, yielding “big data” that are conceptually and statistically difficult to analyze. There is no obvious consensus regarding design or analysis. Here we discuss the relevant technical considerations to generate reproducible, statistically sound, and broadly useful genome-scale data. Rather than suggest a set of rigid rules, we aim to codify principles by which investigators, reviewers, and readers of the primary literature can evaluate the suitability of different experimental designs for measuring different aspects of biological rhythms. We introduce CircaInSilico, a web-based application for generating synthetic genome biology data to benchmark statistical methods for studying biological rhythms. Finally, we discuss several unmet analytical needs, including applications to clinical medicine, and suggest productive avenues to address them

    The Human Phenotype Ontology in 2024: phenotypes around the world.

    Get PDF
    The Human Phenotype Ontology (HPO) is a widely used resource that comprehensively organizes and defines the phenotypic features of human disease, enabling computational inference and supporting genomic and phenotypic analyses through semantic similarity and machine learning algorithms. The HPO has widespread applications in clinical diagnostics and translational research, including genomic diagnostics, gene-disease discovery, and cohort analytics. In recent years, groups around the world have developed translations of the HPO from English to other languages, and the HPO browser has been internationalized, allowing users to view HPO term labels and in many cases synonyms and definitions in ten languages in addition to English. Since our last report, a total of 2239 new HPO terms and 49235 new HPO annotations were developed, many in collaboration with external groups in the fields of psychiatry, arthrogryposis, immunology and cardiology. The Medical Action Ontology (MAxO) is a new effort to model treatments and other measures taken for clinical management. Finally, the HPO consortium is contributing to efforts to integrate the HPO and the GA4GH Phenopacket Schema into electronic health records (EHRs) with the goal of more standardized and computable integration of rare disease data in EHRs

    Guidelines for Genome-Scale Analysis of Biological Rhythms

    Get PDF
    Genome biology approaches have made enormous contributions to our understanding of biological rhythms, particularly in identifying outputs of the clock, including RNAs, proteins, and metabolites, whose abundance oscillates throughout the day. These methods hold significant promise for future discovery, particularly when combined with computational modeling. However, genome-scale experiments are costly and laborious, yielding ‘big data’ that is conceptually and statistically difficult to analyze. There is no obvious consensus regarding design or analysis. Here we discuss the relevant technical considerations to generate reproducible, statistically sound, and broadly useful genome scale data. Rather than suggest a set of rigid rules, we aim to codify principles by which investigators, reviewers, and readers of the primary literature can evaluate the suitability of different experimental designs for measuring different aspects of biological rhythms. We introduce CircaInSilico, a web-based application for generating synthetic genome biology data to benchmark statistical methods for studying biological rhythms. Finally, we discuss several unmet analytical needs, including applications to clinical medicine, and suggest productive avenues to address them

    Clinical Characteristics, Racial Inequities, and Outcomes in Patients with Breast Cancer and COVID-19: A COVID-19 and Cancer Consortium (CCC19) Cohort Study

    Get PDF
    BACKGROUND: Limited information is available for patients with breast cancer (BC) and coronavirus disease 2019 (COVID-19), especially among underrepresented racial/ethnic populations. METHODS: This is a COVID-19 and Cancer Consortium (CCC19) registry-based retrospective cohort study of females with active or history of BC and laboratory-confirmed severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) infection diagnosed between March 2020 and June 2021 in the US. Primary outcome was COVID-19 severity measured on a five-level ordinal scale, including none of the following complications, hospitalization, intensive care unit admission, mechanical ventilation, and all-cause mortality. Multivariable ordinal logistic regression model identified characteristics associated with COVID-19 severity. RESULTS: 1383 female patient records with BC and COVID-19 were included in the analysis, the median age was 61 years, and median follow-up was 90 days. Multivariable analysis revealed higher odds of COVID-19 severity for older age (aOR per decade, 1.48 [95% CI, 1.32-1.67]); Black patients (aOR 1.74; 95 CI 1.24-2.45), Asian Americans and Pacific Islander patients (aOR 3.40; 95 CI 1.70-6.79) and Other (aOR 2.97; 95 CI 1.71-5.17) racial/ethnic groups; worse ECOG performance status (ECOG PS ≥2: aOR, 7.78 [95% CI, 4.83-12.5]); pre-existing cardiovascular (aOR, 2.26 [95% CI, 1.63-3.15])/pulmonary comorbidities (aOR, 1.65 [95% CI, 1.20-2.29]); diabetes mellitus (aOR, 2.25 [95% CI, 1.66-3.04]); and active and progressing cancer (aOR, 12.5 [95% CI, 6.89-22.6]). Hispanic ethnicity, timing, and type of anti-cancer therapy modalities were not significantly associated with worse COVID-19 outcomes. The total all-cause mortality and hospitalization rate for the entire cohort was 9% and 37%, respectively however, it varied according to the BC disease status. CONCLUSIONS: Using one of the largest registries on cancer and COVID-19, we identified patient and BC-related factors associated with worse COVID-19 outcomes. After adjusting for baseline characteristics, underrepresented racial/ethnic patients experienced worse outcomes compared to non-Hispanic White patients. FUNDING: This study was partly supported by National Cancer Institute grant number P30 CA068485 to Tianyi Sun, Sanjay Mishra, Benjamin French, Jeremy L Warner; P30-CA046592 to Christopher R Friese; P30 CA023100 for Rana R McKay; P30-CA054174 for Pankil K Shah and Dimpy P Shah; KL2 TR002646 for Pankil Shah and the American Cancer Society and Hope Foundation for Cancer Research (MRSG-16-152-01-CCE) and P30-CA054174 for Dimpy P Shah. REDCap is developed and supported by Vanderbilt Institute for Clinical and Translational Research grant support (UL1 TR000445 from NCATS/NIH). The funding sources had no role in the writing of the manuscript or the decision to submit it for publication. CLINICAL TRIAL NUMBER: CCC19 registry is registered on ClinicalTrials.gov, NCT04354701
    corecore