5 research outputs found

    Int J Popul Data Sci

    Get PDF
    IntroductionFew studies have addressed how to select a study sample when using electronic health record (EHR) data.ObjectiveTo examine how changing criterion for number of visits in EHR data required for inclusion in a study sample would impact one basic epidemiologic measure: estimates of disease period prevalence.MethodsYear 2016 EHR data from three Midwestern health systems (Northwestern Medicine in Illinois, University of Iowa Health Care, and Froedtert & the Medical College of Wisconsin, all regional tertiary health care systems including hospitals and clinics) was used to examine how alternate definitions of the study sample, based on number of healthcare visits in one year, affected measures of disease period prevalence. In 2016, each of these health systems saw between 160,000 and 420,000 unique patients. Curated collections of ICD-9, ICD-10, and SNOMED codes (from CMS-approved electronic clinical quality measures) were used to define three diseases: acute myocardial infarction, asthma, and diabetic nephropathy).ResultsAcross all health systems, increasing the minimum required number of visits to be included in the study sample monotonically increased crude period prevalence estimates. The rate at which prevalence estimates increased with number of visits varied across sites and across diseases.ConclusionsIn addition to providing thorough descriptions of case definitions, when using EHR data authors must carefully describe how a study sample is identified and report data for a range of sample definitions, including minimum number of visits, so that others can assess the sensitivity of reported results to sample definition in EHR data.U18 DP006120/DP/NCCDPHP CDC HHS/United StatesUL1 TR001436/TR/NCATS NIH HHS/United StatesUL1 TR002537/TR/NCATS NIH HHS/United States2021-01-01T00:00:00Z32864475PMC74487498874vault:3634

    Towards an Ontology-Based Phenotypic Query Model

    Get PDF
    Clinical research based on data from patient or study data management systems plays an important role in transferring basic findings into the daily practices of physicians. To support study recruitment, diagnostic processes, and risk factor evaluation, search queries for such management systems can be used. Typically, the query syntax as well as the underlying data structure vary greatly between different data management systems. This makes it difficult for domain experts (e.g., clinicians) to build and execute search queries. In this work, the Core Ontology of Phenotypes is used as a general model for phenotypic knowledge. This knowledge is required to create search queries that determine and classify individuals (e.g., patients or study participants) whose morphology, function, behaviour, or biochemical and physiological properties meet specific phenotype classes. A specific model describing a set of particular phenotype classes is called a Phenotype Specification Ontology. Such an ontology can be automatically converted to search queries on data management systems. The methods described have already been used successfully in several projects. Using ontologies to model phenotypic knowledge on patient or study data management systems is a viable approach. It allows clinicians to model from a domain perspective without knowing the actual data structure or query language

    Assessing electronic health record phenotypes against gold-standard diagnostic criteria for diabetes mellitus

    No full text
    Objective: We assessed the sensitivity and specificity of 8 electronic health record (EHR)-based phenotypes for diabetes mellitus against gold-standard American Diabetes Association (ADA) diagnostic criteria via chart review by clinical experts. Materials and Methods: We identified EHR-based diabetes phenotype definitions that were developed for various purposes by a variety of users, including academic medical centers, Medicare, the New York City Health Department, and pharmacy benefit managers. We applied these definitions to a sample of 173 503 patients with records in the Duke Health System Enterprise Data Warehouse and at least 1 visit over a 5-year period (2007–2011). Of these patients, 22 679 (13%) met the criteria of 1 or more of the selected diabetes phenotype definitions. A statistically balanced sample of these patients was selected for chart review by clinical experts to determine the presence or absence of type 2 diabetes in the sample. Results: The sensitivity (62–94%) and specificity (95–99%) of EHR-based type 2 diabetes phenotypes (compared with the gold standard ADA criteria via chart review) varied depending on the component criteria and timing of observations and measurements. Discussion and Conclusions: Researchers using EHR-based phenotype definitions should clearly specify the characteristics that comprise the definition, variations of ADA criteria, and how different phenotype definitions and components impact the patient populations retrieved and the intended application. Careful attention to phenotype definitions is critical if the promise of leveraging EHR data to improve individual and population health is to be fulfilled

    Utilizing Electronic Dental Record Data to Track Periodontal Disease Change

    Get PDF
    Indiana University-Purdue University Indianapolis (IUPUI)Periodontal disease (PD) affects 42% of US population resulting in compromised quality of life, the potential for tooth loss and influence on overall health. Despite significant understanding of PD etiology, limited longitudinal studies have investigated PD change in response to various treatments. A major barrier is the difficulty of conducting randomized controlled trials with adequate numbers of patients over a longer time. Electronic dental record (EDR) data offer the opportunity to study outcomes following various periodontal treatments. However, using EDR data for research has challenges including quality and missing data. In this dissertation, I studied a cohort of patients with PD from EDR to monitor their disease status over time. I studied retrospectively 28,908 patients who received comprehensive oral evaluation at the Indiana University School of Dentistry between January 1st-2009 and December 31st-2014. Using natural language processing and automated approaches, we 1) determined PD diagnoses from periodontal charting based on case definitions for surveillance studies, 2) extracted clinician-recorded diagnoses from clinical notes, 3) determined the number of patients with disease improvement or progression over time from EDR data. We found 100% completeness for age, sex; 72% for race; 80% for periodontal charting findings; and 47% for clinician-recorded diagnoses. The number of visits ranged from 1-14 with an average of two visits. From diagnoses obtained from findings, 37% of patients had gingivitis, 55% had moderate periodontitis, and 28% had severe periodontitis. In clinician-recorded diagnoses, 50% patients had gingivitis, 18% had mild, 14% had moderate, and 4% had severe periodontitis. The concordance between periodontal charting-generated and clinician-recorded diagnoses was 47%. The results indicate that case definitions for PD are underestimating gingivitis and overestimating the prevalence of periodontitis. Expert review of findings identified clinicians relying on visual assessment and radiographic findings in addition to the case definition criteria to document PD diagnosis.2021-08-1
    corecore