Search CORE

37 research outputs found

Data quality considerations for evaluating COVID-19 treatments using real world data: learnings from the National COVID Cohort Collaborative (N3C)

Author: Funk M.J.
Girvin A.T.
Hotaling N.
Lee E.
Michael S.
Setoguchi S.
Shao Y.R.
Sidky H.
The N3C Consortium
Wilkins K.J.
Young J.C.
Publication venue: BioMed Central Ltd
Publication date: 01/01/2023
Field of study

Background: Multi-institution electronic health records (EHR) are a rich source of real world data (RWD) for generating real world evidence (RWE) regarding the utilization, benefits and harms of medical interventions. They provide access to clinical data from large pooled patient populations in addition to laboratory measurements unavailable in insurance claims-based data. However, secondary use of these data for research requires specialized knowledge and careful evaluation of data quality and completeness. We discuss data quality assessments undertaken during the conduct of prep-to-research, focusing on the investigation of treatment safety and effectiveness. Methods: Using the National COVID Cohort Collaborative (N3C) enclave, we defined a patient population using criteria typical in non-interventional inpatient drug effectiveness studies. We present the challenges encountered when constructing this dataset, beginning with an examination of data quality across data partners. We then discuss the methods and best practices used to operationalize several important study elements: exposure to treatment, baseline health comorbidities, and key outcomes of interest. Results: We share our experiences and lessons learned when working with heterogeneous EHR data from over 65 healthcare institutions and 4 common data models. We discuss six key areas of data variability and quality. (1) The specific EHR data elements captured from a site can vary depending on source data model and practice. (2) Data missingness remains a significant issue. (3) Drug exposures can be recorded at different levels and may not contain route of administration or dosage information. (4) Reconstruction of continuous drug exposure intervals may not always be possible. (5) EHR discontinuity is a major concern for capturing history of prior treatment and comorbidities. Lastly, (6) access to EHR data alone limits the potential outcomes which can be used in studies. Conclusions: The creation of large scale centralized multi-site EHR databases such as N3C enables a wide range of research aimed at better understanding treatments and health impacts of many conditions including COVID-19. As with all observational research, it is important that research teams engage with appropriate domain experts to understand the data in order to define research questions that are both clinically important and feasible to address using these real world data

Carolina Digital Repository

A method for comparing multiple imputation techniques: A case study on the U.S. national COVID cohort collaborative

Author: Blau H.
Bramante C.T.
Buse J.B.
Callahan T.J.
Casiraghi E.
Chan L.E.
Coleman B.
Evans M.D.
Hall M.
Huling J.D.
Johnson S.G.
Laraway B.
Moffitt R.A.
Notaro M.
Paccanaro A.
Reese J.
Robinson P.N.
Shao Y.R.
Stürmer T.
Tronieri J.S.
Valentini G.
Wilkins K.J.
Wong R.
Publication venue: Academic Press Inc.
Publication date: 01/01/2023
Field of study

Healthcare datasets obtained from Electronic Health Records have proven to be extremely useful for assessing associations between patients’ predictors and outcomes of interest. However, these datasets often suffer from missing values in a high proportion of cases, whose removal may introduce severe bias. Several multiple imputation algorithms have been proposed to attempt to recover the missing information under an assumed missingness mechanism. Each algorithm presents strengths and weaknesses, and there is currently no consensus on which multiple imputation algorithm works best in a given scenario. Furthermore, the selection of each algorithm's parameters and data-related modeling choices are also both crucial and challenging. In this paper we propose a novel framework to numerically evaluate strategies for handling missing data in the context of statistical analysis, with a particular focus on multiple imputation techniques. We demonstrate the feasibility of our approach on a large cohort of type-2 diabetes patients provided by the National COVID Cohort Collaborative (N3C) Enclave, where we explored the influence of various patient characteristics on outcomes related to COVID-19. Our analysis included classic multiple imputation techniques as well as simple complete-case Inverse Probability Weighted models. Extensive experiments show that our approach can effectively highlight the most promising and performant missing-data handling strategy for our case study. Moreover, our methodology allowed a better understanding of the behavior of the different models and of how it changed as we modified their parameters. Our method is general and can be applied to different research fields and on datasets containing heterogeneous types

Carolina Digital Repository

Predicting cyanobacteria bloom occurrence in lakes and reservoirs before blooms occur

Author: Dong B.E.
Feng P.
Ge Y.R.
Ren H.
Shao N.F.
Yang S.T.
Zhao C.S.
Zhao Y.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2007
Field of study

With increased global warming, cyanobacteria are blooming more frequently in lakes and reservoirs, severely damaging the health and stability of aquatic ecosystems and threatening drinking water safety and human health. There is an urgent demand for the effective prediction and prevention of cyanobacterial blooms. However, it is difficult to effectively reduce the risks and loss caused by cyanobacterial blooms because most methods are unable to successfully predict cyanobacteria blooms. Therefore, in this study, we proposed a new cyanobacterial bloom occurrence prediction method to analyze the probability and driving factors of the blooms for effective prevention and control. Dominant cyanobacterial species with bloom capabilities were initially determined using a dominant species identification model, and the principal driving factors of the dominant species were then analyzed using canonical correspondence analysis (CCA). Cyanobacterial bloom probability was calculated using a newly-developed model, after which, the probable mutation points were identified and thresholds for the principal driving factors of cyanobacterial blooms were predicted. A total of 141 phytoplankton data sets from 90 stations were collected from six large-scale hydrology, water-quality ecology, integrated field surveys in Jinan City, China in 2014–2015 and used for model application and verification. The results showed that there were six dominant cyanobacterial species in the study area, and that the principal driving factors were water temperature, pH, total phosphorus, ammonia nitrogen, chemical oxygen demand, and dissolved oxygen. The cyanobacterial blooms corresponded to a threshold water temperature range, pH, total phosphorus (TP), ammonium nitrogen level, chemical oxygen demand, and dissolved oxygen levels of 19.5–32.5 °C, 7.0–9.38, 0.13–0.22 mg L−1, 0.38–0.63 mg L−1, 10.5–17.5 mg L−1, and 4.97–8.28 mg L−1, respectively. Comparison with research results from other global regions further supported the use of these thresholds, indicating that this method could be used in habitats beyond China. We found that the probability of cyanobacterial bloom was 0.75, a critical point for prevention and control. When this critical point was exceeded, cyanobacteria could proliferate rapidly, increasing the risk of cyanobacterial blooms. Changes in driving factors need to be rapidly controlled, based on these thresholds, to prevent cyanobacterial blooms. Temporal and spatial scales were critical factors potentially affecting the selection of driving factors. This method is versatile and can help determine the risk of cyanobacterial blooms and the thresholds of the principal driving factors. It can effectively predict and help prevent cyanobacterial blooms to reduce the global probability of occurrence, protect the health and stability of water ecosystems, ensure drinking water safety, and protect human health

OceanRep

MACAU: Open Access Repository of Kiel University

Slope Stability and Support Lag-property Analysis of a Hydropower Station Intake Slope

Author: Bo Liu
Li Y.
Liu J.
Peng Huang
Shao Y.
Shengjie Di
Wang D.W.
Ying Zhang
Zheng Y.R.
Publication venue: 'IOP Publishing'
Publication date
Field of study

Crossref

Flavonoids from Galium verum

Author: Borisov M.I.
Chun-Chao Zhao
Da-Li Meng
Deng Y.R.
Jian-Hua Shao
Murakami T.
Ning Li
Paris R.
Petler A.
Seabra M.R.
Xian Li
Xing-Dong Kang
Yu-Wei Zhang
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Magmatic history during late Carboniferous to early Permian in the North of the central Xing’an-Mongolia Orogenic Belt: a case study of the Houtoumiao pluton, Inner Mongolia

Author: Bao Q.Z.
Bao Q.Z.
Bao Q.Z.
Baoying Ye
Barbarin B.
Changfeng Liu
Chappell B.W.
Chen Wu
Chen Y.
Cheng Y.H.
Ge M.C.
Guo L.
Guosheng Wang
He F.B.
Hong D.W.
Hongying Li
Hou K.J.
Hu C.S.
Jahn B.M.
Jian P.
Li H.K.
Li H.Y.
Li K.
Li K.
Li P.
Liang Y.W.
Liu J.F.
Liu Y.F.
Liu Y.F.
Ludwing K.R.
Mengting Xu
Nie F.J.
Shao J.A.
Shao J.A.
Shao J.A.
Shi Y.R.
Shi Y.R.
Sun L.X.
Tang K.D.
Taylor S.R.
Wang X.Y.
Xin H.T.
Xu L.Q.
Xu Y.W.
Xue H.M.
Xue H.M.
Yan Zhu
Yang C.
Yun F.
Zhang J.R.
Zhang S.H.
Zhang Y.M.
Zhang Y.Q.
Zhang Y.Q.
Zhiguang Zhou
Zhou Z.G.
Zhou Z.G.
Zhou Z.H.
Zhu J.C.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Zootecnia de precisão: princípios básicos e atualidades na suinocultura

Author: BOUZIDA N.
CAMPOS J.A.
CARO I.W.
COX S.W.R
HOFF S.J
KNÍŽKOVÁ I.
KOMINACS A.P
LEE A.
MANTEUFFEL G.
MARCHANT-FORDE J.N.
MARX G.
MONTANHOLI Y.R.
PANDORFI H.
PANDORFI H.
PANDORFI H.
PANDORFI H.
PANDORFI H.
PANDORFI H.
PUPPE B.
SAMPAIO C.A.P.
SAMPAIO C.A.P.
SHAO J.
SILVA K.O
SILVA K.O.
WATTS J.M.
WISMANS W.M.G
WOUTERS P.
XIN H
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

Heterogeneous microstructure and voids dependence of tensile deformation in a selective laser melted AlSi10Mg alloy

Author: Abolhasani
Aboulkhair
Asm International
Biffi
Cao
Chen
D.D. Ben
Delahaye
du Plessis
Galy
H.J. Yang
H.Q. Liu
Ji
Joseph
Khorasani
Kim
Kim
L.X. Meng
Leary
Li
Li
Liu
Liu
Prashanth
Q.Q. Duan
Rao
Rao
Read
S.G. Wang
Saeidi
Sun
Suryawanshi
Suzuki
Thijs
Thijs
Voisin
Wang
Wu
X.H. Shao
Xi
Xiong
Xu
Y.R. Ma
Yang
Yoo
Z.F. Zhang
Zhang
Zhang
Zhou
Zhou
Zhu
Zhu
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Scaling of divertor power footprint width in RF-heated type-III ELMy H-mode on the EAST superconducting tokamak

Author: Ahn J.-W.
Arnoux G.
B.N. Wan
C.F. Sang
D.Z. Wang
Eich T.
G.H. Hu
G.N. Luo
G.S. Xu
Goldston R.J.
Goniche M.
Guo H.Y.
H.B. Ding
H.Q. Wang
H.Y. Guo
Halpern F.D.
J. Li
J.B. Liu
J.C. Xu
J.L. Chen
J.S. Hu
J.Z. Sun
Jacquet P.
K.F. Gan
Kallenbach A.
L. Chen
L. Wang
L.M. Shao
L.Q. Hu
L.Y. Xiang
Lau C.
Li J.
Martin Y.R.
Masuzaki S.
N. Yan
N. Zhao
Perkins R.J.
Petrie T.W.
Petrie T.W.
R. Chen
S. Ding
S.C. Liu
W. Feng
W. Zhang
Wagner F.
Wan B.N.
Wang F.M.
Wang L.
Wang L.
X. Gao
X.L. Zou
X.Z. Gong
Y. Liang
Y.L. Li
Y.L. Liu
Zuo G.Z.
Publication venue: 'IOP Publishing'
Publication date: 01/01/2014
Field of study

Dedicated experiments for the scaling of divertor power footprint width have been performed in the ITER-relevant radio-frequency (RF)-heated H-mode scheme under the lower single null, double null and upper single null divertor configurations in the Experimental Advanced Superconducting Tokamak (EAST) under lithium wall coating conditioning. A strong inverse scaling of the edge localized mode (ELM)-averaged power fall-off width with the plasma current (equivalently the poloidal field) has been demonstrated for the attached type-III ELMy H-mode as

\lambda_{q} \propto I_{{\rm p}}^{-1.05}

by various heat flux diagnostics including the divertor Langmuir probes (LPs), infra-red (IR) thermograph and reciprocating LPs on the low-field side. The IR camera and divertor LP measurements show that \lambda_{q,{\rm IR}} \approx {\lambda_{q,{\rm div\mbox{-}LPs}}}/{1.3}=1.15B_{{\rm p,omp}}^{-1.25} , in good agreement with the multi-machine scaling trend during the inter-ELM phase between type-I ELMs or ELM-free enhanced Dα (EDA). H-mode. However, the magnitude is nearly doubled, which may be attributed to the different operation scenarios or heating schemes in EAST, i.e., dominated by electron heating. It is also shown that the type-III ELMs only broaden the power fall-off width slightly, and the ELM-averaged width is representative for the inter-ELM period. Furthermore, the inverse Ip (Bp) scaling appears to be independent of the divertor configurations in EAST. The divertor power footprint integral width, fall-off width and dissipation width derived from EAST IR camera measurements follow the relation, λint cong λq + 1.64S, yielding

\lambda_{\rm int}^{{\rm EAST}} =(1.39\pm 0.03)\lambda_{q}^{{\rm EAST}} +(0.97\pm 0.35)\,{\rm mm}

. Detailed analysis of these three characteristic widths was carried out to shed more light on their extrapolation to ITER

Crossref

Juelich Shared Electronic Resources