1,736 research outputs found

    Electronic health record data quality assessment and tools: A systematic review

    Get PDF
    OBJECTIVE: We extended a 2013 literature review on electronic health record (EHR) data quality assessment approaches and tools to determine recent improvements or changes in EHR data quality assessment methodologies. MATERIALS AND METHODS: We completed a systematic review of PubMed articles from 2013 to April 2023 that discussed the quality assessment of EHR data. We screened and reviewed papers for the dimensions and methods defined in the original 2013 manuscript. We categorized papers as data quality outcomes of interest, tools, or opinion pieces. We abstracted and defined additional themes and methods though an iterative review process. RESULTS: We included 103 papers in the review, of which 73 were data quality outcomes of interest papers, 22 were tools, and 8 were opinion pieces. The most common dimension of data quality assessed was completeness, followed by correctness, concordance, plausibility, and currency. We abstracted conformance and bias as 2 additional dimensions of data quality and structural agreement as an additional methodology. DISCUSSION: There has been an increase in EHR data quality assessment publications since the original 2013 review. Consistent dimensions of EHR data quality continue to be assessed across applications. Despite consistent patterns of assessment, there still does not exist a standard approach for assessing EHR data quality. CONCLUSION: Guidelines are needed for EHR data quality assessment to improve the efficiency, transparency, comparability, and interoperability of data quality assessment. These guidelines must be both scalable and flexible. Automation could be helpful in generalizing this process

    Leveraging GPT-4 for identifying cancer phenotypes in electronic health records: A performance comparison between GPT-4, GPT-3.5-turbo, Flan-T5, Llama-3-8B, and spaCy\u27s rule-based and machine learning-based methods

    Get PDF
    OBJECTIVE: Accurately identifying clinical phenotypes from Electronic Health Records (EHRs) provides additional insights into patients\u27 health, especially when such information is unavailable in structured data. This study evaluates the application of OpenAI\u27s Generative Pre-trained Transformer (GPT)-4 model to identify clinical phenotypes from EHR text in non-small cell lung cancer (NSCLC) patients. The goal was to identify disease stages, treatments and progression utilizing GPT-4, and compare its performance against GPT-3.5-turbo, Flan-T5-xl, Flan-T5-xxl, Llama-3-8B, and 2 rule-based and machine learning-based methods, namely, scispaCy and medspaCy. MATERIALS AND METHODS: Phenotypes such as initial cancer stage, initial treatment, evidence of cancer recurrence, and affected organs during recurrence were identified from 13 646 clinical notes for 63 NSCLC patients from Washington University in St. Louis, Missouri. The performance of the GPT-4 model is evaluated against GPT-3.5-turbo, Flan-T5-xxl, Flan-T5-xl, Llama-3-8B, medspaCy, and scispaCy by comparing precision, recall, and micro-F1 scores. RESULTS: GPT-4 achieved higher F1 score, precision, and recall compared to Flan-T5-xl, Flan-T5-xxl, Llama-3-8B, medspaCy, and scispaCy\u27s models. GPT-3.5-turbo performed similarly to that of GPT-4. GPT, Flan-T5, and Llama models were not constrained by explicit rule requirements for contextual pattern recognition. spaCy models relied on predefined patterns, leading to their suboptimal performance. DISCUSSION AND CONCLUSION: GPT-4 improves clinical phenotype identification due to its robust pre-training and remarkable pattern recognition capability on the embedded tokens. It demonstrates data-driven effectiveness even with limited context in the input. While rule-based models remain useful for some tasks, GPT models offer improved contextual understanding of the text, and robust clinical phenotype extraction

    Association between socioeconomic factors, race, and use of a specialty memory clinic

    Get PDF
    BACKGROUND AND OBJECTIVES: The capacity of specialty memory clinics in the United States is very limited. If lower socioeconomic status or minoritized racial group is associated with reduced use of memory clinics, this could exacerbate health care disparities, especially if more effective treatments of Alzheimer disease become available. We aimed to understand how use of a memory clinic is associated with neighborhood-level measures of socioeconomic factors and the intersectionality of race. METHODS: We conducted an observational cross-sectional study using electronic health record data to compare the neighborhood advantage of patients seen at the Washington University Memory Diagnostic Center with the catchment area using a geographical information system. Furthermore, we compared the severity of dementia at the initial visit between patients who self-identified as Black or White. We used a multinomial logistic regression model to assess the Clinical Dementia Rating at the initial visit and RESULTS: A total of 4,824 patients seen at the memory clinic between 2008 and 2018 were included in this study (mean age 72.7 [SD 11.0] years, 2,712 [56%] female, 543 [11%] Black). Most of the memory clinic patients lived in more advantaged neighborhoods within the overall catchment area. The percentage of patients self-identifying as Black (11%) was lower than the average percentage of Black individuals by census tract in the catchment area (16%) ( DISCUSSION: This study demonstrates that patients living in less affluent neighborhoods were less likely to be seen in one large memory clinic. Black patients were under-represented in the clinic, and Black patients had more severe dementia at their initial visit. These findings suggest that patients with a lower socioeconomic status and who identify as Black are less likely to be seen in memory clinics, which are likely to be a major point of access for any new Alzheimer disease treatments that may become available

    Predicting language diversity with complex network

    Full text link
    Evolution and propagation of the world's languages is a complex phenomenon, driven, to a large extent, by social interactions. Multilingual society can be seen as a system of interacting agents, where the interaction leads to a modification of the language spoken by the individuals. Two people can reach the state of full linguistic compatibility due to the positive interactions, like transfer of loanwords. But, on the other hand, if they speak entirely different languages, they will separate from each other. These simple observations make the network science the most suitable framework to describe and analyze dynamics of language change. Although many mechanisms have been explained, we lack a qualitative description of the scaling behavior for different sizes of a population. Here we address the issue of the language diversity in societies of different sizes, and we show that local interactions are crucial to capture characteristics of the empirical data. We propose a model of social interactions, extending the idea from, that explains the growth of the language diversity with the size of a population of country or society. We argue that high clustering and network disintegration are the most important characteristics of models properly describing empirical data. Furthermore, we cancel the contradiction between previous models and the Solomon Islands case. Our results demonstrate the importance of the topology of the network, and the rewiring mechanism in the process of language change

    Active/Passive, ‘Diminished’/‘Beautiful’, ‘Light’ from Above and Below: Rereading Shekhinah’s Sexual Desire in Zohar al Shir ha-Shirim (Song of Songs)

    Get PDF
    In Zohar al Shir ha-Shirim, the Zohar’s reading of Song of Songs, Shekhinah, echoing themes associated with the Shulamite of the biblical text, consistently initiates cosmic union. Sexual desire in the zoharic texts is a form of capital necessary to facilitate sefirotic intercourse, although scholarly readings of the zoharic corpus often identify Shekhinah as a passive receptacle. This, however, is only true if the endemic contradictions within the texts are glossed over. In Song of Songs, the Shulamite’s sexual ‘initiative’ is core. This was not lost on the author(s) of Zohar al Shir ha-Shirim, who, in struggling to explain Shekhinah’s sefirotic role in line with the erotics of Song of Songs, inescapably echoed the ‘depatriarchalizing’ themes of the biblical text. As this article demonstrates, in Zohar al Shir ha-Shirim, Shekhinah is active and repeatedly encourages and frustrates cosmic sexual intercourse. Zohar al Shir ha-Shirim shows that it is possible to reread Shekhinah’s role beyond the androcentrism of the authors as well as scholarly assumptions about her passivity

    Measurement of the quasi-elastic axial vector mass in neutrino-oxygen interactions

    Get PDF
    The weak nucleon axial-vector form factor for quasi-elastic interactions is determined using neutrino interaction data from the K2K Scintillating Fiber detector in the neutrino beam at KEK. More than 12,000 events are analyzed, of which half are charged-current quasi-elastic interactions nu-mu n to mu- p occurring primarily in oxygen nuclei. We use a relativistic Fermi gas model for oxygen and assume the form factor is approximately a dipole with one parameter, the axial vector mass M_A, and fit to the shape of the distribution of the square of the momentum transfer from the nucleon to the nucleus. Our best fit result for M_A = 1.20 \pm 0.12 GeV. Furthermore, this analysis includes updated vector form factors from recent electron scattering experiments and a discussion of the effects of the nucleon momentum on the shape of the fitted distributions.Comment: 14 pages, 10 figures, 6 table

    Measurement of the Branching Fraction for B- --> D0 K*-

    Get PDF
    We present a measurement of the branching fraction for the decay B- --> D0 K*- using a sample of approximately 86 million BBbar pairs collected by the BaBar detector from e+e- collisions near the Y(4S) resonance. The D0 is detected through its decays to K- pi+, K- pi+ pi0 and K- pi+ pi- pi+, and the K*- through its decay to K0S pi-. We measure the branching fraction to be B.F.(B- --> D0 K*-)= (6.3 +/- 0.7(stat.) +/- 0.5(syst.)) x 10^{-4}.Comment: 7 pages, 1 postscript figure, submitted to Phys. Rev. D (Rapid Communications

    A Study of Time-Dependent CP-Violating Asymmetries and Flavor Oscillations in Neutral B Decays at the Upsilon(4S)

    Get PDF
    We present a measurement of time-dependent CP-violating asymmetries in neutral B meson decays collected with the BABAR detector at the PEP-II asymmetric-energy B Factory at the Stanford Linear Accelerator Center. The data sample consists of 29.7 fb1{\rm fb}^{-1} recorded at the Υ(4S)\Upsilon(4S) resonance and 3.9 fb1{\rm fb}^{-1} off-resonance. One of the neutral B mesons, which are produced in pairs at the Υ(4S)\Upsilon(4S), is fully reconstructed in the CP decay modes J/ψKS0J/\psi K^0_S, ψ(2S)KS0\psi(2S) K^0_S, χc1KS0\chi_{c1} K^0_S, J/ψK0J/\psi K^{*0} (K0KS0π0K^{*0}\to K^0_S\pi^0) and J/ψKL0J/\psi K^0_L, or in flavor-eigenstate modes involving D()π/ρ/a1D^{(*)}\pi/\rho/a_1 and J/ψK0J/\psi K^{*0} (K0K+πK^{*0}\to K^+\pi^-). The flavor of the other neutral B meson is tagged at the time of its decay, mainly with the charge of identified leptons and kaons. The proper time elapsed between the decays is determined by measuring the distance between the decay vertices. A maximum-likelihood fit to this flavor eigenstate sample finds Δmd=0.516±0.016(stat)±0.010(syst)ps1\Delta m_d = 0.516\pm 0.016 {\rm (stat)} \pm 0.010 {\rm (syst)} {\rm ps}^{-1}. The value of the asymmetry amplitude sin2β\sin2\beta is determined from a simultaneous maximum-likelihood fit to the time-difference distribution of the flavor-eigenstate sample and about 642 tagged B0B^0 decays in the CP-eigenstate modes. We find sin2β=0.59±0.14(stat)±0.05(syst)\sin2\beta=0.59\pm 0.14 {\rm (stat)} \pm 0.05 {\rm (syst)}, demonstrating that CP violation exists in the neutral B meson system. (abridged)Comment: 58 pages, 35 figures, submitted to Physical Review

    Measurement of Branching Fraction and Dalitz Distribution for B0->D(*)+/- K0 pi-/+ Decays

    Get PDF
    We present measurements of the branching fractions for the three-body decays B0 -> D(*)-/+ K0 pi^+/-andtheirresonantsubmodes and their resonant submodes B0 -> D(*)-/+ K*+/- using a sample of approximately 88 million BBbar pairs collected by the BABAR detector at the PEP-II asymmetric energy storage ring. We measure: B(B0->D-/+ K0 pi+/-)=(4.9 +/- 0.7(stat) +/- 0.5 (syst)) 10^{-4} B(B0->D*-/+ K0 pi+/-)=(3.0 +/- 0.7(stat) +/- 0.3 (syst)) 10^{-4} B(B0->D-/+ K*+/-)=(4.6 +/- 0.6(stat) +/- 0.5 (syst)) 10^{-4} B(B0->D*-/+ K*+/-)=(3.2 +/- 0.6(stat) +/- 0.3 (syst)) 10^{-4} From these measurements we determine the fractions of resonant events to be : f(B0-> D-/+ K*+/-) = 0.63 +/- 0.08(stat) +/- 0.04(syst) f(B0-> D*-/+ K*+/-) = 0.72 +/- 0.14(stat) +/- 0.05(syst)Comment: 7 pages, 3 figures submitted to Phys. Rev. Let

    Measurement of the B+ --> p pbar K+ Branching Fraction and Study of the Decay Dynamics

    Get PDF
    With a sample of 232x10^6 Upsilon(4S) --> BBbar events collected with the BaBar detector, we study the decay B+ --> p pbar K+ excluding charmonium decays to ppbar. We measure a branching fraction Br(B+ --> p pbar K+)=(6.7+/-0.5+/-0.4)x10^{-6}. An enhancement at low ppbar mass is observed and the Dalitz plot asymmetry suggests dominance of the penguin amplitude in this B decay. We search for a pentaquark candidate Theta*++ decaying into pK+ in the mass range 1.43 to 2.00 GeV/c2 and set limits on Br(B+ --> Theta*++pbar)xBr(Theta*++ --> pK+) at the 10^{-7} level.Comment: 8 pages, 7 postscript figures, submitted to Phys. Rev. D (Rapid Communications
    corecore