1,736 research outputs found
Electronic health record data quality assessment and tools: A systematic review
OBJECTIVE: We extended a 2013 literature review on electronic health record (EHR) data quality assessment approaches and tools to determine recent improvements or changes in EHR data quality assessment methodologies.
MATERIALS AND METHODS: We completed a systematic review of PubMed articles from 2013 to April 2023 that discussed the quality assessment of EHR data. We screened and reviewed papers for the dimensions and methods defined in the original 2013 manuscript. We categorized papers as data quality outcomes of interest, tools, or opinion pieces. We abstracted and defined additional themes and methods though an iterative review process.
RESULTS: We included 103 papers in the review, of which 73 were data quality outcomes of interest papers, 22 were tools, and 8 were opinion pieces. The most common dimension of data quality assessed was completeness, followed by correctness, concordance, plausibility, and currency. We abstracted conformance and bias as 2 additional dimensions of data quality and structural agreement as an additional methodology.
DISCUSSION: There has been an increase in EHR data quality assessment publications since the original 2013 review. Consistent dimensions of EHR data quality continue to be assessed across applications. Despite consistent patterns of assessment, there still does not exist a standard approach for assessing EHR data quality.
CONCLUSION: Guidelines are needed for EHR data quality assessment to improve the efficiency, transparency, comparability, and interoperability of data quality assessment. These guidelines must be both scalable and flexible. Automation could be helpful in generalizing this process
Leveraging GPT-4 for identifying cancer phenotypes in electronic health records: A performance comparison between GPT-4, GPT-3.5-turbo, Flan-T5, Llama-3-8B, and spaCy\u27s rule-based and machine learning-based methods
OBJECTIVE: Accurately identifying clinical phenotypes from Electronic Health Records (EHRs) provides additional insights into patients\u27 health, especially when such information is unavailable in structured data. This study evaluates the application of OpenAI\u27s Generative Pre-trained Transformer (GPT)-4 model to identify clinical phenotypes from EHR text in non-small cell lung cancer (NSCLC) patients. The goal was to identify disease stages, treatments and progression utilizing GPT-4, and compare its performance against GPT-3.5-turbo, Flan-T5-xl, Flan-T5-xxl, Llama-3-8B, and 2 rule-based and machine learning-based methods, namely, scispaCy and medspaCy.
MATERIALS AND METHODS: Phenotypes such as initial cancer stage, initial treatment, evidence of cancer recurrence, and affected organs during recurrence were identified from 13 646 clinical notes for 63 NSCLC patients from Washington University in St. Louis, Missouri. The performance of the GPT-4 model is evaluated against GPT-3.5-turbo, Flan-T5-xxl, Flan-T5-xl, Llama-3-8B, medspaCy, and scispaCy by comparing precision, recall, and micro-F1 scores.
RESULTS: GPT-4 achieved higher F1 score, precision, and recall compared to Flan-T5-xl, Flan-T5-xxl, Llama-3-8B, medspaCy, and scispaCy\u27s models. GPT-3.5-turbo performed similarly to that of GPT-4. GPT, Flan-T5, and Llama models were not constrained by explicit rule requirements for contextual pattern recognition. spaCy models relied on predefined patterns, leading to their suboptimal performance.
DISCUSSION AND CONCLUSION: GPT-4 improves clinical phenotype identification due to its robust pre-training and remarkable pattern recognition capability on the embedded tokens. It demonstrates data-driven effectiveness even with limited context in the input. While rule-based models remain useful for some tasks, GPT models offer improved contextual understanding of the text, and robust clinical phenotype extraction
Association between socioeconomic factors, race, and use of a specialty memory clinic
BACKGROUND AND OBJECTIVES: The capacity of specialty memory clinics in the United States is very limited. If lower socioeconomic status or minoritized racial group is associated with reduced use of memory clinics, this could exacerbate health care disparities, especially if more effective treatments of Alzheimer disease become available. We aimed to understand how use of a memory clinic is associated with neighborhood-level measures of socioeconomic factors and the intersectionality of race.
METHODS: We conducted an observational cross-sectional study using electronic health record data to compare the neighborhood advantage of patients seen at the Washington University Memory Diagnostic Center with the catchment area using a geographical information system. Furthermore, we compared the severity of dementia at the initial visit between patients who self-identified as Black or White. We used a multinomial logistic regression model to assess the Clinical Dementia Rating at the initial visit and
RESULTS: A total of 4,824 patients seen at the memory clinic between 2008 and 2018 were included in this study (mean age 72.7 [SD 11.0] years, 2,712 [56%] female, 543 [11%] Black). Most of the memory clinic patients lived in more advantaged neighborhoods within the overall catchment area. The percentage of patients self-identifying as Black (11%) was lower than the average percentage of Black individuals by census tract in the catchment area (16%) (
DISCUSSION: This study demonstrates that patients living in less affluent neighborhoods were less likely to be seen in one large memory clinic. Black patients were under-represented in the clinic, and Black patients had more severe dementia at their initial visit. These findings suggest that patients with a lower socioeconomic status and who identify as Black are less likely to be seen in memory clinics, which are likely to be a major point of access for any new Alzheimer disease treatments that may become available
Predicting language diversity with complex network
Evolution and propagation of the world's languages is a complex phenomenon,
driven, to a large extent, by social interactions. Multilingual society can be
seen as a system of interacting agents, where the interaction leads to a
modification of the language spoken by the individuals. Two people can reach
the state of full linguistic compatibility due to the positive interactions,
like transfer of loanwords. But, on the other hand, if they speak entirely
different languages, they will separate from each other. These simple
observations make the network science the most suitable framework to describe
and analyze dynamics of language change. Although many mechanisms have been
explained, we lack a qualitative description of the scaling behavior for
different sizes of a population. Here we address the issue of the language
diversity in societies of different sizes, and we show that local interactions
are crucial to capture characteristics of the empirical data. We propose a
model of social interactions, extending the idea from, that explains the growth
of the language diversity with the size of a population of country or society.
We argue that high clustering and network disintegration are the most important
characteristics of models properly describing empirical data. Furthermore, we
cancel the contradiction between previous models and the Solomon Islands case.
Our results demonstrate the importance of the topology of the network, and the
rewiring mechanism in the process of language change
Active/Passive, ‘Diminished’/‘Beautiful’, ‘Light’ from Above and Below: Rereading Shekhinah’s Sexual Desire in Zohar al Shir ha-Shirim (Song of Songs)
In Zohar al Shir ha-Shirim, the Zohar’s reading of Song of Songs, Shekhinah, echoing themes associated with the Shulamite of the biblical text, consistently initiates cosmic union. Sexual desire in the zoharic texts is a form of capital necessary to facilitate sefirotic intercourse, although scholarly readings of the zoharic corpus often identify Shekhinah as a passive receptacle. This, however, is only true if the endemic contradictions within the texts are glossed over. In Song of Songs, the Shulamite’s sexual ‘initiative’ is core. This was not lost on the author(s) of Zohar al Shir ha-Shirim, who, in struggling to explain Shekhinah’s sefirotic role in line with the erotics of Song of Songs, inescapably echoed the ‘depatriarchalizing’ themes of the biblical text. As this article demonstrates, in Zohar al Shir ha-Shirim, Shekhinah is active and repeatedly encourages and frustrates cosmic sexual intercourse. Zohar al Shir ha-Shirim shows that it is possible to reread Shekhinah’s role beyond the androcentrism of the authors as well as scholarly assumptions about her passivity
Measurement of the quasi-elastic axial vector mass in neutrino-oxygen interactions
The weak nucleon axial-vector form factor for quasi-elastic interactions is
determined using neutrino interaction data from the K2K Scintillating Fiber
detector in the neutrino beam at KEK. More than 12,000 events are analyzed, of
which half are charged-current quasi-elastic interactions nu-mu n to mu- p
occurring primarily in oxygen nuclei. We use a relativistic Fermi gas model for
oxygen and assume the form factor is approximately a dipole with one parameter,
the axial vector mass M_A, and fit to the shape of the distribution of the
square of the momentum transfer from the nucleon to the nucleus. Our best fit
result for M_A = 1.20 \pm 0.12 GeV. Furthermore, this analysis includes updated
vector form factors from recent electron scattering experiments and a
discussion of the effects of the nucleon momentum on the shape of the fitted
distributions.Comment: 14 pages, 10 figures, 6 table
Measurement of the Branching Fraction for B- --> D0 K*-
We present a measurement of the branching fraction for the decay B- --> D0
K*- using a sample of approximately 86 million BBbar pairs collected by the
BaBar detector from e+e- collisions near the Y(4S) resonance. The D0 is
detected through its decays to K- pi+, K- pi+ pi0 and K- pi+ pi- pi+, and the
K*- through its decay to K0S pi-. We measure the branching fraction to be
B.F.(B- --> D0 K*-)= (6.3 +/- 0.7(stat.) +/- 0.5(syst.)) x 10^{-4}.Comment: 7 pages, 1 postscript figure, submitted to Phys. Rev. D (Rapid
Communications
A Study of Time-Dependent CP-Violating Asymmetries and Flavor Oscillations in Neutral B Decays at the Upsilon(4S)
We present a measurement of time-dependent CP-violating asymmetries in
neutral B meson decays collected with the BABAR detector at the PEP-II
asymmetric-energy B Factory at the Stanford Linear Accelerator Center. The data
sample consists of 29.7 recorded at the
resonance and 3.9 off-resonance. One of the neutral B mesons,
which are produced in pairs at the , is fully reconstructed in
the CP decay modes , , , () and , or in flavor-eigenstate
modes involving and (). The flavor of the other neutral B meson is tagged at the time of
its decay, mainly with the charge of identified leptons and kaons. The proper
time elapsed between the decays is determined by measuring the distance between
the decay vertices. A maximum-likelihood fit to this flavor eigenstate sample
finds . The value of the asymmetry amplitude is determined from
a simultaneous maximum-likelihood fit to the time-difference distribution of
the flavor-eigenstate sample and about 642 tagged decays in the
CP-eigenstate modes. We find , demonstrating that CP violation exists in the neutral B meson
system. (abridged)Comment: 58 pages, 35 figures, submitted to Physical Review
Measurement of Branching Fraction and Dalitz Distribution for B0->D(*)+/- K0 pi-/+ Decays
We present measurements of the branching fractions for the three-body decays
B0 -> D(*)-/+ K0 pi^+/-B0 -> D(*)-/+ K*+/- using
a sample of approximately 88 million BBbar pairs collected by the BABAR
detector at the PEP-II asymmetric energy storage ring.
We measure:
B(B0->D-/+ K0 pi+/-)=(4.9 +/- 0.7(stat) +/- 0.5 (syst)) 10^{-4}
B(B0->D*-/+ K0 pi+/-)=(3.0 +/- 0.7(stat) +/- 0.3 (syst)) 10^{-4}
B(B0->D-/+ K*+/-)=(4.6 +/- 0.6(stat) +/- 0.5 (syst)) 10^{-4}
B(B0->D*-/+ K*+/-)=(3.2 +/- 0.6(stat) +/- 0.3 (syst)) 10^{-4}
From these measurements we determine the fractions of resonant events to be :
f(B0-> D-/+ K*+/-) = 0.63 +/- 0.08(stat) +/- 0.04(syst) f(B0-> D*-/+ K*+/-) =
0.72 +/- 0.14(stat) +/- 0.05(syst)Comment: 7 pages, 3 figures submitted to Phys. Rev. Let
Measurement of the B+ --> p pbar K+ Branching Fraction and Study of the Decay Dynamics
With a sample of 232x10^6 Upsilon(4S) --> BBbar events collected with the
BaBar detector, we study the decay B+ --> p pbar K+ excluding charmonium decays
to ppbar. We measure a branching fraction Br(B+ --> p pbar
K+)=(6.7+/-0.5+/-0.4)x10^{-6}. An enhancement at low ppbar mass is observed and
the Dalitz plot asymmetry suggests dominance of the penguin amplitude in this B
decay. We search for a pentaquark candidate Theta*++ decaying into pK+ in the
mass range 1.43 to 2.00 GeV/c2 and set limits on Br(B+ -->
Theta*++pbar)xBr(Theta*++ --> pK+) at the 10^{-7} level.Comment: 8 pages, 7 postscript figures, submitted to Phys. Rev. D (Rapid
Communications
- …