Search CORE

8 research outputs found

What Every Reader Should Know About Studies Using Electronic Health Record Data but May Be Afraid to Ask

Coincident with the tsunami of COVID-19-related publications, there has been a surge of studies using real-world data, including those obtained from the electronic health record (EHR). Unfortunately, several of these high-profile publications were retracted because of concerns regarding the soundness and quality of the studies and the EHR data they purported to analyze. These retractions highlight that although a small community of EHR informatics experts can readily identify strengths and flaws in EHR-derived studies, many medical editorial teams and otherwise sophisticated medical readers lack the framework to fully critically appraise these studies. In addition, conventional statistical analyses cannot overcome the need for an understanding of the opportunities and limitations of EHR-derived studies. We distill here from the broader informatics literature six key considerations that are crucial for appraising studies utilizing EHR data: data completeness, data collection and handling (eg, transformation), data type (ie, codified, textual), robustness of methods against EHR variability (within and across institutions, countries, and time), transparency of data and analytic code, and the multidisciplinary approach. These considerations will inform researchers, clinicians, and other stakeholders as to the recommended best practices in reviewing manuscripts, grants, and other outputs from EHR-data derived studies, and thereby promote and foster rigor, quality, and reliability of this rapidly growing field

UCL Discovery

A retrospective cohort analysis leveraging augmented intelligence to characterize long COVID in the electronic health record: A precision medicine framework.

Author: Arianna Dagliati
Consortium for Clinical Characterization of COVID-19 by EHR (4CE)
Darren W Henderson
Gilbert S Omenn
Hossein Estiri
Jeffrey G Klann
John H Holmes
Kavishwar B Wagholikar
Malarkodi Jebathilagam Samayamuthu
Michele Morris
Rebecca Mesa
Shawn N Murphy
Shyam Visweswaran
Yuan Luo
Zachary H Strasser
Zahra Shakeri Hossein Abad
Zongqi Xia
Publication venue: Public Library of Science (PLoS)
Publication date: 01/07/2023
Field of study

Physical and psychological symptoms lasting months following an acute COVID-19 infection are now recognized as post-acute sequelae of COVID-19 (PASC). Accurate tools for identifying such patients could enhance screening capabilities for the recruitment for clinical trials, improve the reliability of disease estimates, and allow for more accurate downstream cohort analysis. In this retrospective cohort study, we analyzed the EHR of hospitalized COVID-19 patients across three healthcare systems to develop a pipeline for better identifying patients with persistent PASC symptoms (dyspnea, fatigue, or joint pain) after their SARS-CoV-2 infection. We implemented distributed representation learning powered by the Machine Learning for modeling Health Outcomes (MLHO) to identify novel EHR features that could suggest PASC symptoms outside of typical diagnosis codes. MLHO applies an entropy-based feature selection and boosting algorithms for representation mining. These improved definitions were then used for estimating PASC among hospitalized patients. 30,422 hospitalized patients were diagnosed with COVID-19 across three healthcare systems between March 13, 2020 and February 28, 2021. The mean age of the population was 62.3 years (SD, 21.0 years) and 15,124 (49.7%) were female. We implemented the distributed representation learning technique to augment PASC definitions. These definitions were found to have positive predictive values of 0.73, 0.74, and 0.91 for dyspnea, fatigue, and joint pain, respectively. We estimated that 25 percent (CI 95%: 6-48), 11 percent (CI 95%: 6-15), and 13 percent (CI 95%: 8-17) of hospitalized COVID-19 patients will have dyspnea, fatigue, and joint pain, respectively, 3 months or longer after a COVID-19 diagnosis. We present a validated framework for screening and identifying patients with PASC in the EHR and then use the tool to estimate its prevalence among hospitalized COVID-19 patients

Directory of Open Access Journals

Acute respiratory distress syndrome after SARS-CoV-2 infection on young adult population: International observational federated study based on electronic health records through the 4CE consortium.

PurposeIn young adults (18 to 49 years old), investigation of the acute respiratory distress syndrome (ARDS) after severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection has been limited. We evaluated the risk factors and outcomes of ARDS following infection with SARS-CoV-2 in a young adult population.MethodsA retrospective cohort study was conducted between January 1st, 2020 and February 28th, 2021 using patient-level electronic health records (EHR), across 241 United States hospitals and 43 European hospitals participating in the Consortium for Clinical Characterization of COVID-19 by EHR (4CE). To identify the risk factors associated with ARDS, we compared young patients with and without ARDS through a federated analysis. We further compared the outcomes between young and old patients with ARDS.ResultsAmong the 75,377 hospitalized patients with positive SARS-CoV-2 PCR, 1001 young adults presented with ARDS (7.8% of young hospitalized adults). Their mortality rate at 90 days was 16.2% and they presented with a similar complication rate for infection than older adults with ARDS. Peptic ulcer disease, paralysis, obesity, congestive heart failure, valvular disease, diabetes, chronic pulmonary disease and liver disease were associated with a higher risk of ARDS. We described a high prevalence of obesity (53%), hypertension (38%- although not significantly associated with ARDS), and diabetes (32%).ConclusionTrough an innovative method, a large international cohort study of young adults developing ARDS after SARS-CoV-2 infection has been gather. It demonstrated the poor outcomes of this population and associated risk factor

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Oskar Bordeaux

Recommended from our members

J Am Med Inform Assoc

INTRODUCTION: The Consortium for Clinical Characterization of COVID-19 by EHR (4CE) is an international collaboration addressing COVID-19 with federated analyses of electronic health record (EHR) data. OBJECTIVE: We sought to develop and validate a computable phenotype for COVID-19 severity. METHODS: Twelve 4CE sites participated. First we developed an EHR-based severity phenotype consisting of six code classes, and we validated it on patient hospitalization data from the 12 4CE clinical sites against the outcomes of ICU admission and/or death. We also piloted an alternative machine-learning approach and compared selected predictors of severity to the 4CE phenotype at one site. RESULTS: The full 4CE severity phenotype had pooled sensitivity of 0.73 and specificity 0.83 for the combined outcome of ICU admission and/or death. The sensitivity of individual code categories for acuity had high variability - up to 0.65 across sites. At one pilot site, the expert-derived phenotype had mean AUC 0.903 (95% CI: 0.886, 0.921), compared to AUC 0.956 (95% CI: 0.952, 0.959) for the machine-learning approach. Billing codes were poor proxies of ICU admission, with as low as 49% precision and recall compared to chart review. DISCUSSION: We developed a severity phenotype using 6 code classes that proved resilient to coding variability across international institutions. In contrast, machine-learning approaches may overfit hospital-specific orders. Manual chart review revealed discrepancies even in the gold-standard outcomes, possibly due to heterogeneous pandemic conditions. CONCLUSION: We developed an EHR-based severity phenotype for COVID-19 in hospitalized patients and validated it at 12 international sites

University of Miami: Scholarship Miami

eScholarship - University of California

Oskar Bordeaux

Recommended from our members

Validation of an internationally derived patient severity phenotype to support COVID-19 analytics from electronic health record data.

Author: Avillach Paul
Beaulieu-Jones Brett K
Bell Douglas S
Bellazzi Riccardo
Boeker Martin
Brat Gabriel A
Castro Victor
Chiovato Luca
Consortium for Clinical Characterization of COVID-19 by EHR (4CE) (CONSORTIA AUTHOR)
Estiri Hossein
Follett Robert W
Geva Alon
Hanauer David A
Hong Chuan
Hutch Meghan
Jouhet Vianney
Klann Jeffrey G
Kohane Isaac S
Li Anthony LLJ
Loh Ne-Hooi Will
Luo Yuan
Malovini Alberto
Mandl Kenneth D
Maulhardt Thomas
Moal Bertrand
Moore Jason H
Morris Michele
Mowery Danielle L
Murphy Shawn N
Ngiam Kee Yuan
Olson Karen L
Omenn Gilbert S
Rieg Siegbert
Samayamuthu Malarkodi J
Schriver Emily
South Andrew M
Tan Amelia LM
Tibollo Valentina
Visweswaran Shyam
Wagholikar Kavishwar B
Weber Griffin M
Weber Griffin M
Xia Zongqi
Publication venue: eScholarship, University of California
Publication date: 01/07/2021
Field of study

ObjectiveThe Consortium for Clinical Characterization of COVID-19 by EHR (4CE) is an international collaboration addressing coronavirus disease 2019 (COVID-19) with federated analyses of electronic health record (EHR) data. We sought to develop and validate a computable phenotype for COVID-19 severity.Materials and methodsTwelve 4CE sites participated. First, we developed an EHR-based severity phenotype consisting of 6 code classes, and we validated it on patient hospitalization data from the 12 4CE clinical sites against the outcomes of intensive care unit (ICU) admission and/or death. We also piloted an alternative machine learning approach and compared selected predictors of severity with the 4CE phenotype at 1 site.ResultsThe full 4CE severity phenotype had pooled sensitivity of 0.73 and specificity 0.83 for the combined outcome of ICU admission and/or death. The sensitivity of individual code categories for acuity had high variability-up to 0.65 across sites. At one pilot site, the expert-derived phenotype had mean area under the curve of 0.903 (95% confidence interval, 0.886-0.921), compared with an area under the curve of 0.956 (95% confidence interval, 0.952-0.959) for the machine learning approach. Billing codes were poor proxies of ICU admission, with as low as 49% precision and recall compared with chart review.DiscussionWe developed a severity phenotype using 6 code classes that proved resilient to coding variability across international institutions. In contrast, machine learning approaches may overfit hospital-specific orders. Manual chart review revealed discrepancies even in the gold-standard outcomes, possibly owing to heterogeneous pandemic conditions.ConclusionsWe developed an EHR-based severity phenotype for COVID-19 in hospitalized patients and validated it at 12 international sites

eScholarship - University of California

International comparisons of laboratory values from the 4CE collaborative to predict COVID-19 mortality.

Author: Agapito Giuseppe
Alessiani Mario
Aronow Bruce J
Avillach Paul
Balazote Pablo Serrano
Barrio Noelia García
Beaulieu-Jones Brett K
Bell Douglas S
Bellazzi Riccardo
Benoit Vincent
Bonzel Clara-Lea
Bourgeois Florence T
Brat Gabriel A
Cai Tianxi
Cannataro Mario
Chiovato Luca
Cho Kelly
Consortium for Clinical Characterization of COVID-19 by EHR (4CE)
Dagliati Arianna
DuVall Scott L
Gutiérrez-Sacristán Alba
Hanauer David A
Ho Yuk-Lam
Holmes John H
Hong Chuan
Issitt Richard W
Keller Mark S
Klann Jeffrey G
Kohane Isaac S
L'Yi Sehi
Liu Molei
Loh Ne Hooi Will
Luo Yuan
Lynch Kristine E
Maidlow Sarah E
Malovini Alberto
Mandl Kenneth D
Mao Chengsheng
Matheny Michael E
Moore Jason H
Morris Jeffrey S
Morris Michele
Mowery Danielle L
Murphy Shawn N
Neuraz Antoine
Ngiam Kee Yuan
Omenn Gilbert S
Palmer Nathan P
Patel Lav P
Pedrera-Jimenez Miguel
Ramoni Rachel B
Schriver Emily R
Schubert Petra
Serret-Larmande Arnaud
South Andrew M
Spiridou Anastasia
Tan Amelia LM
Tan Byorn WL
Tibollo Valentina
Torti Carlo
Trecarichi Enrico M
Visweswaran Shyam
Wang Xuan
Weber Griffin M
Xia Zongqi
Publication venue: eScholarship, University of California
Publication date: 01/01/2022
Field of study

Given the growing number of prediction algorithms developed to predict COVID-19 mortality, we evaluated the transportability of a mortality prediction algorithm using a multi-national network of healthcare systems. We predicted COVID-19 mortality using baseline commonly measured laboratory values and standard demographic and clinical covariates across healthcare systems, countries, and continents. Specifically, we trained a Cox regression model with nine measured laboratory test values, standard demographics at admission, and comorbidity burden pre-admission. These models were compared at site, country, and continent level. Of the 39,969 hospitalized patients with COVID-19 (68.6% male), 5717 (14.3%) died. In the Cox model, age, albumin, AST, creatine, CRP, and white blood cell count are most predictive of mortality. The baseline covariates are more predictive of mortality during the early days of COVID-19 hospitalization. Models trained at healthcare systems with larger cohort size largely retain good transportability performance when porting to different sites. The combination of routine laboratory test values at admission along with basic demographic features can predict mortality in patients hospitalized with COVID-19. Importantly, this potentially deployable model differs from prior work by demonstrating not only consistent performance but also reliable transportability across healthcare systems in the US and Europe, highlighting the generalizability of this model and the overall approach

Archivio Istituzionale della Ricerca - Università degli Studi di Pavia

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Recommended from our members

International Comparisons of Harmonized Laboratory Value Trajectories to Predict Severe COVID-19: Leveraging the 4CE Collaborative Across 342 Hospitals and 6 Countries: A Retrospective Cohort Study

Author: Agapito Giuseppe
Alessiani Mario
Aronow Bruce J
Avillach Paul
Beaulieu-Jones Brett K
Bell Douglas S
Bellasi Antonio
Bellazzi Riccardo
Benoit Vincent
Beraghi Michele
Boeker Martin
Booth John
Bosari Silvano
Bourgeois Florence T
Brat Gabriel A
Brown Nicholas W
Bucalo Mauro
Cai Tianxi
Cannataro Mario
Chiovato Luca
Chiudinelli Lorenzo
Consortium for Clinical Characterization of COVID-19 by EHR (4CE)
Dagliati Arianna
Devkota Batsal
DuVall Scott L
Follett Robert W
Ganslandt Thomas
García Barrio Noelia
Gradinger Tobias
Griffier Romain
Gutiérrez-Sacristán Alba
Hanauer David A
Holmes John H
Hong Chuan
Horki Petar
Huling Kenneth M
Issitt Richard W
Jouhet Vianney
Keller Mark S
Klann Jeffrey G
Kohane Isaac S
Kraska Detlef
Liu Molei
Loh Ne Hooi Will
Luo Yuan
Lynch Kristine E
Malovini Alberto
Mandl Kenneth D
Mao Chengsheng
Maram Anupama
Matheny Michael E
Maulhardt Thomas
Mazzitelli Maria
Milano Marianna
Moore Jason H
Morris Jeffrey S
Morris Michele
Mowery Danielle L
Murphy Shawn N
Naughton Thomas P
Neuraz Antoine
Ngiam Kee Yuan
Norman James B
Omenn Gilbert S
Palmer Nathan P
Patel Lav P
Pedrera Jimenez Miguel
Ramoni Rachel B
Schriver Emily R
Scudeller Luigia
Sebire Neil J
Serrano Balazote Pablo
Serret-Larmande Arnaud
South Andrew M
Spiridou Anastasia
Tan Amelia Lm
Tan Byorn Wl
Tibollo Valentina
Torti Carlo
Trecarichi Enrico M
Visweswaran Shyam
Vitacca Michele
Weber Griffin M
Xia Zongqi
Zambelli Alberto
Zucco Chiara
Publication venue
Publication date: 05/02/2021
Field of study

To perform an international comparison of the trajectory of laboratory values among hospitalized patients with COVID-19 who develop severe disease and identify optimal timing of laboratory value collection to predict severity across hospitals and regions. Retrospective cohort study. The Consortium for Clinical Characterization of COVID-19 by EHR (4CE), an international multi-site data-sharing collaborative of 342 hospitals in the US and in Europe. Patients hospitalized with COVID-19, admitted before or after PCR-confirmed result for SARS-CoV-2. Primary and secondary outcome measures: Patients were categorized as ″ever-severe″ or ″never-severe″ using the validated 4CE severity criteria. Eighteen laboratory tests associated with poor COVID-19-related outcomes were evaluated for predictive accuracy by area under the curve (AUC), compared between the severity categories. Subgroup analysis was performed to validate a subset of laboratory values as predictive of severity against a published algorithm. A subset of laboratory values (CRP, albumin, LDH, neutrophil count, D-dimer, and procalcitonin) was compared between North American and European sites for severity prediction. Of 36,447 patients with COVID-19, 19,953 (43.7%) were categorized as ever-severe. Most patients (78.7%) were 50 years of age or older and male (60.5%). Longitudinal trajectories of CRP, albumin, LDH, neutrophil count, D-dimer, and procalcitonin showed association with disease severity. Significant differences of laboratory values at admission were found between the two groups. With the exception of D-dimer, predictive discrimination of laboratory values did not improve after admission. Sub-group analysis using age, D-dimer, CRP, and lymphocyte count as predictive of severity at admission showed similar discrimination to a published algorithm (AUC=0.88 and 0.91, respectively). Both models deteriorated in predictive accuracy as the disease progressed. On average, no difference in severity prediction was found between North American and European sites. Laboratory test values at admission can be used to predict severity in patients with COVID-19. Prediction models show consistency across international sites highlighting the potential generalizability of these models

University of Miami: Scholarship Miami

What every reader should know about studies using electronic health record data but may be afraid to ask

Author: Albayrak A.
Amendola D.F.
Anthony L.L.L.J.
Aronow Bruce J.
Atz A.
Avillach Paul
Balazote P.S.
Balshi J.
Barrio N.G.
Beaulieu-Jones Brett K.
Bell D.S.
Bellasi A.
Bellazzi Riccardo
Benoit V.
Beraghi M.
Bermúdez J.L.C.
Bernaux M.
Bey R.
Boeker M.
Bonzel C.-L.
Booth J.
Bosari S.
Bourgeois F.T.
Bradford Robert L.
Brat Gabriel A.
Bréant S.
Bucalo M.
Burgun A.
Cai Tianxi
Cannataro Mario
Cao A.
Carmona A.
Caucheteux C.
Champ J.
Chiovato L.
Cimino James J.
Colicchio T.K.
Cormont S.
Cossin S.
Craig J.
Dagliati A.
Daniar Mohamad
Daniel C.
Davoudi A.
Devkota B.
Domínguez G.R.
Dubiel J.
DuVall S.L.
Esteve L.
Fan S.
Follett R.W.
Gaiolla P.S.A.
Ganslandt T.
García-Barrio N.
Gehlenborg Nils
Geva A.
Ghassemi Marzyeh
Gradinger T.
Gramfort A.
Griffier R.
Griffon N.
Grisel O.
Gutiérrez-Sacristán A.
Hanauer David A.
Haverkamp C.
Hilka M.
Holmes John H.
Hong Chuan
Horki P.
Hutch M.R.
Issitt R.
Jannot A.S.
Jimenez M.P.
Jouhet V.
Keller M.S.
Kirchoff K.
Klann Jeffrey G.
Kohane Isaac S.
Krantz I.D.
Kraska D.
Krishnamurthy A.K.
L'Yi S.
Le T.T.
Leblanc J.
Lemaitre G.
Lenert L.
Leprovost D.
Liu M.
Loh Ne Hooi Will
Luo Yuan
Lynch K.E.
Mahmood S.
Maidlow S.
Malovini A.
Mandl Kenneth D.
Mao C.
Martel P.
Martínez A.B.
Masino A.J.
Matheny M.E.
Maulhardt T.
McDuffie M.T.
Mensch A.
Minicucci M.F.
Moal B.
Moore Jason H.
Morris J.S.
Morris M.
Moshal K.L.
Mousavi S.
Mowery D.L.
Murad D.A.
Murphy Shawn N.
Neuraz Antoine
Ngiam Kee Yuan
Obeid J.
Okoshi M.P.
Olson K.L.
Omenn Gilbert S.
Orlova N.
Ostasiewski B.D.
Palmer Nathan
Paris N.
Patel Lav P.
Pedrera-Jiménez M.
Prokosch H.U.
Prudente R.A.
Ramoni R.B.
Raskin M.
Rieg S.
Salamanca E.
Samayamuthu M.J.
Sandrin A.
Schiver E.
Schuettler J.
Scudeller L.
Sebire N.
Serre P.
Serret-Larmande A.
Silvio D.
Sliz Piotr
Sobrino J.L.B.
Son J.
Sonday C.
South Andrew M.
Spiridou A.
Tan Amelia Li Min
Tan B.W.L.
Tan B.W.Q.
Tanni S.E.
Taylor Bradley W.
Taylor Deanne M.
The Consortium For Clinical Characterization Of COVID-19 By EHR (4CE).
Tibollo V.
Tippmann P.
Torti Carlo
Vallejos Andrew K.
Varoquaux G.
Vie J.-J.
Visweswaran S.
Wagholikar Kavishwar B.
Waitman L.R.
Wassermann D.
Weber Griffin M.
William Y.
Xia Z.
Zambelli A.
Publication venue: 'JMIR Publications Inc.'
Publication date: 02/03/2021
Field of study

10.2196/22219Journal of Medical Internet Research233e2221

ScholarBank@NUS