11 research outputs found

    The National COVID Cohort Collaborative (N3C): Rationale, design, infrastructure, and deployment.

    Get PDF
    OBJECTIVE: Coronavirus disease 2019 (COVID-19) poses societal challenges that require expeditious data and knowledge sharing. Though organizational clinical data are abundant, these are largely inaccessible to outside researchers. Statistical, machine learning, and causal analyses are most successful with large-scale data beyond what is available in any given organization. Here, we introduce the National COVID Cohort Collaborative (N3C), an open science community focused on analyzing patient-level data from many centers. MATERIALS AND METHODS: The Clinical and Translational Science Award Program and scientific community created N3C to overcome technical, regulatory, policy, and governance barriers to sharing and harmonizing individual-level clinical data. We developed solutions to extract, aggregate, and harmonize data across organizations and data models, and created a secure data enclave to enable efficient, transparent, and reproducible collaborative analytics. RESULTS: Organized in inclusive workstreams, we created legal agreements and governance for organizations and researchers; data extraction scripts to identify and ingest positive, negative, and possible COVID-19 cases; a data quality assurance and harmonization pipeline to create a single harmonized dataset; population of the secure data enclave with data, machine learning, and statistical analytics tools; dissemination mechanisms; and a synthetic data pilot to democratize data access. CONCLUSIONS: The N3C has demonstrated that a multisite collaborative learning health network can overcome barriers to rapidly build a scalable infrastructure incorporating multiorganizational clinical data for COVID-19 analytics. We expect this effort to save lives by enabling rapid collaboration among clinicians, researchers, and data scientists to identify treatments and specialized care and thereby reduce the immediate and long-term impacts of COVID-19

    Increased Incidence of Vestibular Disorders in Patients With SARS-CoV-2

    Get PDF
    OBJECTIVE: Determine the incidence of vestibular disorders in patients with SARS-CoV-2 compared to the control population. STUDY DESIGN: Retrospective. SETTING: Clinical data in the National COVID Cohort Collaborative database (N3C). METHODS: Deidentified patient data from the National COVID Cohort Collaborative database (N3C) were queried based on variant peak prevalence (untyped, alpha, delta, omicron 21K, and omicron 23A) from covariants.org to retrospectively analyze the incidence of vestibular disorders in patients with SARS-CoV-2 compared to control population, consisting of patients without documented evidence of COVID infection during the same period. RESULTS: Patients testing positive for COVID-19 were significantly more likely to have a vestibular disorder compared to the control population. Compared to control patients, the odds ratio of vestibular disorders was significantly elevated in patients with untyped (odds ratio [OR], 2.39; confidence intervals [CI], 2.29-2.50; CONCLUSIONS: The incidence of vestibular disorders differed between COVID-19 variants and was significantly elevated in COVID-19-positive patients compared to the control population. These findings have implications for patient counseling and further research is needed to discern the long-term effects of these findings

    Synergies between centralized and federated approaches to data quality: a report from the national COVID cohort collaborative

    No full text
    Objective In response to COVID-19, the informatics community united to aggregate as much clinical data as possible to characterize this new disease and reduce its impact through collaborative analytics. The National COVID Cohort Collaborative (N3C) is now the largest publicly available HIPAA limited dataset in US history with over 6.4 million patients and is a testament to a partnership of over 100 organizations. Materials and Methods We developed a pipeline for ingesting, harmonizing, and centralizing data from 56 contributing data partners using 4 federated Common Data Models. N3C data quality (DQ) review involves both automated and manual procedures. In the process, several DQ heuristics were discovered in our centralized context, both within the pipeline and during downstream project-based analysis. Feedback to the sites led to many local and centralized DQ improvements. Results Beyond well-recognized DQ findings, we discovered 15 heuristics relating to source Common Data Model conformance, demographics, COVID tests, conditions, encounters, measurements, observations, coding completeness, and fitness for use. Of 56 sites, 37 sites (66%) demonstrated issues through these heuristics. These 37 sites demonstrated improvement after receiving feedback. Discussion We encountered site-to-site differences in DQ which would have been challenging to discover using federated checks alone. We have demonstrated that centralized DQ benchmarking reveals unique opportunities for DQ improvement that will support improved research analytics locally and in aggregate. Conclusion By combining rapid, continual assessment of DQ with a large volume of multisite data, it is possible to support more nuanced scientific questions with the scale and rigor that they require

    Nonelective coronary artery bypass graft outcomes are adversely impacted by Coronavirus disease 2019 infection, but not altered processes of care: A National COVID Cohort Collaborative and National Surgery Quality Improvement Program analysisCentral MessagePerspective

    No full text
    Objective: The effects of Coronavirus disease 2019 (COVID-19) infection and altered processes of care on nonelective coronary artery bypass grafting (CABG) outcomes remain unknown. We hypothesized that patients with COVID-19 infection would have longer hospital lengths of stay and greater mortality compared with COVID-negative patients, but that these outcomes would not differ between COVID-negative and pre-COVID controls. Methods: The National COVID Cohort Collaborative 2020-2022 was queried for adult patients undergoing CABG. Patients were divided into COVID-negative, COVID-active, and COVID-convalescent groups. Pre-COVID control patients were drawn from the National Surgical Quality Improvement Program database. Adjusted analysis of the 3 COVID groups was performed via generalized linear models. Results: A total of 17,293 patients underwent nonelective CABG, including 16,252 COVID-negative, 127 COVID-active, 367 COVID-convalescent, and 2254 pre-COVID patients. Compared to pre-COVID patients, COVID-negative patients had no difference in mortality, whereas COVID-active patients experienced increased mortality. Mortality and pneumonia were higher in COVID-active patients compared to COVID-negative and COVID-convalescent patients. Adjusted analysis demonstrated that COVID-active patients had higher in-hospital mortality, 30- and 90-day mortality, and pneumonia compared to COVID-negative patients. COVID-convalescent patients had a shorter length of stay but a higher rate of renal impairment. Conclusions: Traditional care processes were altered during the COVID-19 pandemic. Our data show that nonelective CABG in patients with active COVID-19 is associated with significantly increased rates of mortality and pneumonia. The equivalent mortality in COVID-negative and pre-COVID patients suggests that pandemic-associated changes in processes of care did not impact CABG outcomes. Additional research into optimal timing of CABG after COVID infection is warranted

    Evaluating COVID-19 vaccine effectiveness during pre-Delta, Delta and Omicron dominant periods among pregnant people in the U.S.: Retrospective cohort analysis from a nationally sampled cohort in National COVID Collaborative Cohort (N3C)

    No full text
    Objectives To evaluate the effectiveness of COVID-19 vaccinations (initial and booster) during pre-Delta, Delta and Omicron dominant periods among pregnant people via (1) COVID-19 incident and severe infections among pregnant people who were vaccinated versus unvaccinated and (2) post-COVID-19 vaccination breakthrough infections and severe infections among vaccinated females who were pregnant versus non-pregnant.Design Retrospective cohort study using nationally sampled electronic health records data from the National COVID Cohort Collaborative, 10 December 2020 –7 June 2022.Participants Cohort 1 included pregnant people (15–55 years) and cohort 2 included vaccinated females of reproductive age (15–55 years).Exposures (1) COVID-19 vaccination and (2) pregnancy.Main outcome measures Adjusted HRs (aHRs) for COVID-19 incident or breakthrough infections and severe infections (ie, COVID-19 infections with related hospitalisations).Results In cohort 1, 301 107 pregnant people were included. Compared with unvaccinated pregnant people, the aHRs for pregnant people with initial vaccinations during pregnancy of incident COVID-19 were 0.77 (95% CI 0.62 to 0.96) and 0.88 (95% CI 0.73 to 1.07) and aHRs of severe COVID-19 infections were 0.65 (95% CI 0.47 to 0.90) and 0.79 (95% CI 0.51 to 1.21) during the Delta and Omicron periods, respectively. Compared with pregnant people with full initial vaccinations, the aHR of incident COVID-19 for pregnant people with booster vaccinations was 0.64 (95% CI 0.58 to 0.71) during the Omicron period. In cohort 2, 934 337 vaccinated people were included. Compared with vaccinated non-pregnant females, the aHRs of severe COVID-19 infections for people with initial vaccinations during pregnancy was 2.71 (95% CI 1.31 to 5.60) during the Omicron periods.Conclusions Pregnant people with initial and booster vaccinations during pregnancy had a lower risk of incident and severe COVID-19 infections compared with unvaccinated pregnant people across the pandemic stages. However, vaccinated pregnant people still had a higher risk of severe infections compared with non-pregnant females
    corecore