166 research outputs found

    Identifying data set specific duplicate patient records

    Get PDF
    posterProbabilistic models are commonly used in the identification of duplicate records. These methods are usually more accurate than deterministic methods, but are exponentially more computationally complex. Thus to make them computationally feasible, they rely on deterministic blocking strategies. This project investigates how machine learning methods can be used to automatically determine an optimal blocking strategy using duplicate records already identified

    Creation of an open source master person index from proprietary code: the open source "care data exchange" project

    Get PDF
    posterFrom 1998 to 2004 the ""Care Data Exchange"" (CDE) software was developed as a proprietary product by CareScience for the California HealthCare Foundation (CHCF). In 2005 CHCF asked Forrester Research to study the feasibility of releasing the CDE software assets under a free, open source license. The Forrester report articulated relationships between proprietary and nonproprietary components in the CDE Information Architecture (CIA)

    Understanding the profile of errors that cause duplicate entries in a patient registry

    Get PDF
    posterDuplicate records are detrimental to the cost-effective and efficient delivery of health care. Manually identifying and resolving duplicates can cost $60 per case. Patterns have been found in the types of errors that occur in patient registries, suggesting that undetected duplicate records may be similar to those already identified. At the University of Utah, records from all community clinics are merged with hospital records in the Enterprise Data Warehouse (EDW). The Pedigree and Population Resource group at Huntsman Cancer Institute links demographic records from the EDW to the Utah Population Database (UPDB). In last year's linkage, 76,922 duplicate records were identified. The purpose of this study was to compare the differences between clinic and hospital records in the EDW with existing literature

    Preserving academic poster content

    Get PDF
    posterPosters are an important way to share information between academia and industry. They are presented at national conferences, regional meetings, and even in university departments. There were almost 75,000 calls for poster submissions last year alone. Most posters are presented for only a few hours at a conference and may be difficult to translate into full papers. Posters are represented by abstracts submitted months before conferences. They may not accurately reflect poster content. A new method for preserving academic poster content is needed

    Phylogenetically Widespread Multiple Paternity in New World Natricine Snakes

    Get PDF
    We used microsatellite DNA markers to identify the extent to which multiple paternity within litters occurs among species of New World natricine snakes. We selected seven species to represent the three major clades of Natricinae and all three subclades of the gartersnake clade. Microsatellite DNA genotyping of dams and litters confirmed multiple paternity within litters of six species, including Thamnophis radix, T. sauritus, Storeria dekayi, S. occipitomaculata, Nerodia rhombifer, and Regina septemvittata. Multiple paternity was not evident in one litter of nine Thamnophis melanogaster. Together with published data documenting multiple paternity in T. bulteri, T. elegans, T. sirtalis, and N. sipedon, these results confirm the phylogenetically widespread occurrence of multiple paternity among New World natricines, emphasizing the need to consider phylogenetic (historical) explanations when analyzing snake mating systems

    Bayesian peak-bagging of solar-like oscillators using MCMC: A comprehensive guide

    Full text link
    Context: Asteroseismology has entered a new era with the advent of the NASA Kepler mission. Long and continuous photometric observations of unprecedented quality are now available which have stimulated the development of a number of suites of innovative analysis tools. Aims: The power spectra of solar-like oscillations are an inexhaustible source of information on stellar structure and evolution. Robust methods are hence needed in order to infer both individual oscillation mode parameters and parameters describing non-resonant features, thus making a seismic interpretation possible. Methods: We present a comprehensive guide to the implementation of a Bayesian peak-bagging tool that employs a Markov chain Monte Carlo (MCMC). Besides making it possible to incorporate relevant prior information through Bayes' theorem, this tool also allows one to obtain the marginal probability density function for each of the fitted parameters. We apply this tool to a couple of recent asteroseismic data sets, namely, to CoRoT observations of HD 49933 and to ground-based observations made during a campaign devoted to Procyon. Results: The developed method performs remarkably well at constraining not only in the traditional case of extracting oscillation frequencies, but also when pushing the limit where traditional methods have difficulties. Moreover it provides an rigorous way of comparing competing models, such as the ridge identifications, against the asteroseismic data.Comment: Accepted for publication in A&

    Structure and Rotation of the Solar Interior: Initial Results from the MDI Medium-L Program

    Get PDF
    The medium-l program of the Michelson Doppler Imager instrument on board SOHO provides continuous observations of oscillation modes of angular degree, l, from 0 to approximately 300. The data for the program are partly processed on board because only about 3% of MDI observations can be transmitted continuously to the ground. The on-board data processing, the main component of which is Gaussian-weighted binning, has been optimized to reduce the negative influence of spatial aliasing of the high-degree oscillation modes. The data processing is completed in a data analysis pipeline at the SOI Stanford Support Center to determine the mean multiplet frequencies and splitting coefficients. The initial results show that the noise in the medium-l oscillation power spectrum is substantially lower than in ground-based measurements. This enables us to detect lower amplitude modes and, thus, to extend the range of measured mode frequencies. This is important for inferring the Sun's internal structure and rotation. The MDI observations also reveal the asymmetry of oscillation spectral lines. The line asymmetries agree with the theory of mode excitation by acoustic sources localized in the upper convective boundary layer. The sound-speed profile inferred from the mean frequencies gives evidence for a sharp variation at the edge of the energy-generating core. The results also confirm the previous finding by the GONG (Gough et al., 1996) that, in a thin layer just beneath the convection zone, helium appears to be less abundant than predicted by theory. Inverting the multiplet frequency splittings from MDI, we detect significant rotational shear in this thin layer. This layer is likely to be the place where the solar dynamo operates. In order to understand how the Sun works, it is extremely important to observe the evolution of this transition layer throughout the 11-year activity cycle

    International cohort study indicates no association between alpha-1 blockers and susceptibility to COVID-19 in benign prostatic hyperplasia patients

    Get PDF
    Purpose: Alpha-1 blockers, often used to treat benign prostatic hyperplasia (BPH), have been hypothesized to prevent COVID-19 complications by minimising cytokine storm release. The proposed treatment based on this hypothesis currently lacks support from reliable real-world evidence, however. We leverage an international network of large-scale healthcare databases to generate comprehensive evidence in a transparent and reproducible manner.Methods: In this international cohort study, we deployed electronic health records from Spain (SIDIAP) and the United States (Department of Veterans Affairs, Columbia University Irving Medical Center, IQVIA OpenClaims, Optum DOD, Optum EHR). We assessed association between alpha-1 blocker use and risks of three COVID-19 outcomes-diagnosis, hospitalization, and hospitalization requiring intensive services-using a prevalent-user active-comparator design. We estimated hazard ratios using state-of-the-art techniques to minimize potential confounding, including large-scale propensity score matching/stratification and negative control calibration. We pooled database-specific estimates through random effects meta-analysis.Results: Our study overall included 2.6 and 0.46 million users of alpha-1 blockers and of alternative BPH medications. We observed no significant difference in their risks for any of the COVID-19 outcomes, with our meta-analytic HR estimates being 1.02 (95% CI: 0.92-1.13) for diagnosis, 1.00 (95% CI: 0.89-1.13) for hospitalization, and 1.15 (95% CI: 0.71-1.88) for hospitalization requiring intensive services.Conclusion: We found no evidence of the hypothesized reduction in risks of the COVID-19 outcomes from the prevalent-use of alpha-1 blockers-further research is needed to identify effective therapies for this novel disease.</p

    Multinational patterns of second line antihyperglycaemic drug initiation across cardiovascular risk groups:federated pharmacoepidemiological evaluation in LEGEND-T2DM

    Get PDF
    Objective: To assess the uptake of second line antihyperglycaemic drugs among patients with type 2 diabetes mellitus who are receiving metformin.Design: Federated pharmacoepidemiological evaluation in LEGEND-T2DM.Setting: 10 US and seven non-US electronic health record and administrative claims databases in the Observational Health Data Sciences and Informatics network in eight countries from 2011 to the end of 2021.Participants: 4.8 million patients (≥18 years) across US and non-US based databases with type 2 diabetes mellitus who had received metformin monotherapy and had initiated second line treatments.Exposure: The exposure used to evaluate each database was calendar year trends, with the years in the study that were specific to each cohort.Main outcomes measures: The outcome was the incidence of second line antihyperglycaemic drug use (ie, glucagon-like peptide-1 receptor agonists, sodium-glucose cotransporter-2 inhibitors, dipeptidyl peptidase-4 inhibitors, and sulfonylureas) among individuals who were already receiving treatment with metformin. The relative drug class level uptake across cardiovascular risk groups was also evaluated.Results: 4.6 million patients were identified in US databases, 61 382 from Spain, 32 442 from Germany, 25 173 from the UK, 13 270 from France, 5580 from Scotland, 4614 from Hong Kong, and 2322 from Australia. During 2011-21, the combined proportional initiation of the cardioprotective antihyperglycaemic drugs (glucagon-like peptide-1 receptor agonists and sodium-glucose cotransporter-2 inhibitors) increased across all data sources, with the combined initiation of these drugs as second line drugs in 2021 ranging from 35.2% to 68.2% in the US databases, 15.4% in France, 34.7% in Spain, 50.1% in Germany, and 54.8% in Scotland. From 2016 to 2021, in some US and non-US databases, uptake of glucagon-like peptide-1 receptor agonists and sodium-glucose cotransporter-2 inhibitors increased more significantly among populations with no cardiovascular disease compared with patients with established cardiovascular disease. No data source provided evidence of a greater increase in the uptake of these two drug classes in populations with cardiovascular disease compared with no cardiovascular disease.Conclusions: Despite the increase in overall uptake of cardioprotective antihyperglycaemic drugs as second line treatments for type 2 diabetes mellitus, their uptake was lower in patients with cardiovascular disease than in people with no cardiovascular disease over the past decade. A strategy is needed to ensure that medication use is concordant with guideline recommendations to improve outcomes of patients with type 2 diabetes mellitus.</p
    corecore