13 research outputs found

    Standardizing adverse drug event reporting data

    Full text link

    Standardizing adverse drug event reporting data

    Get PDF

    J Biomed Inform

    Get PDF
    We followed a systematic approach based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses to identify existing clinical natural language processing (NLP) systems that generate structured information from unstructured free text. Seven literature databases were searched with a query combining the concepts of natural language processing and structured data capture. Two reviewers screened all records for relevance during two screening phases, and information about clinical NLP systems was collected from the final set of papers. A total of 7149 records (after removing duplicates) were retrieved and screened, and 86 were determined to fit the review criteria. These papers contained information about 71 different clinical NLP systems, which were then analyzed. The NLP systems address a wide variety of important clinical and research tasks. Certain tasks are well addressed by the existing systems, while others remain as open challenges that only a small number of systems attempt, such as extraction of temporal information or normalization of concepts to standard terminologies. This review has identified many NLP systems capable of processing clinical free text and generating structured output, and the information collected and evaluated here will be important for prioritizing development of new approaches for clinical NLP.CC999999/ImCDC/Intramural CDC HHS/United States2019-11-20T00:00:00Z28729030PMC6864736694

    Cancer

    Get PDF
    BackgroundUnderstanding of cancer outcomes is limited by data fragmentation. We analyzed the information yielded by integrating breast cancer data from three sources: electronic medical records (EMRs) of two healthcare systems and the state registry.MethodsWe extracted diagnostic test and treatment data from EMRs of all breast cancer patients treated from 2000\ue2\u20ac\u201c2010 in two independent California institutions: a community-based practice (Palo Alto Medical Foundation) and an academic medical center (Stanford University). We incorporated records from the population-based California Cancer Registry (CCR), and then linked EMR-CCR datasets of Community and University patients.ResultsWe initially identified 8210 University patients and 5770 Community patients; linked datasets revealed a 16% patient overlap, yielding 12,109 unique patients. The proportion of all Community patients, but not University patients, treated at both institutions increased with worsening cancer prognostic factors. Before linking datasets, Community patients appeared to receive less intervention than University patients (mastectomy: 37.6% versus 43.2%; chemotherapy: 35% versus 41.7%; magnetic resonance imaging (MRI): 10% versus 29.3%; genetic testing: 2.5% versus 9.2%). Linked Community and University datasets revealed that patients treated at both institutions received substantially more intervention (mastectomy: 55.8%; chemotherapy: 47.2%; MRI: 38.9%; genetic testing: 10.9%; p<0.001 for each three-way institutional comparison).ConclusionData linkage identified 16% of patients who were treated in two healthcare systems and who, despite comparable prognostic factors, received far more intensive treatment than others. By integrating complementary data from EMRs and population-based registries, we obtained a more comprehensive understanding of breast cancer care and factors that drive treatment utilization.1U58 DP000807-01/DP/NCCDPHP CDC HHS/United StatesHHSN261201000034C/CA/NCI NIH HHS/United StatesHHSN261201000034C/PHS HHS/United StatesHHSN261201000035C/PHS HHS/United StatesHHSN261201000140C/CA/NCI NIH HHS/United StatesHHSN261201000140C/PHS HHS/United States2015-01-01T00:00:00Z24101577PMC386759

    Electronic Health Record Phenotyping in Cardiovascular Epidemiology

    Get PDF
    The secondary use of EHR data for research is a cost-effective resource for a variety of research questions and domains; however, there are many challenges when using electronic health record (EHR) data for epidemiologic research.This dissertation quantified differences in prevalence for acute myocardial infarction (MI) and heart failure (HF) using phenotyping algorithms differing in diagnosis position of ICD-10-CM codes and the inclusion of clinical components. The period of interest was January 1, 2016 to December 31, 2019 for UNC Clinical Data Warehouse for Health data and October 1, 2015 and December 31, 2019 for Atherosclerosis Risk in Communities (ARIC) Study data, the latter used for validation analyses. During the period of interest, 13,200 acute MI cases and 53,545 HF cases were identified in the UNC data. Age-standardized prevalence of acute MI and HF were highest using Any Diagnosis Position algorithm and lowest for acute MI using 1st or 2nd Diagnosis Position with Lab or Procedure and 1st Diagnosis Position for HF. Projected differences in healthcare expenditures by algorithm as well as patient and clinical characteristics, such as event severity and mortality, were also estimated. When compared to physician-adjudicated hospitalizations in the ARIC study, the phenotyping algorithms used for the UNC analysis performed well given their simplicity. The algorithm with the highest sensitivity was Any Diagnosis Position for acute MI and HF at 75.5% and 70.5%. Specificity, PPV, and NPV ranged from 80-99% for all algorithms. Requiring clinical components had little effect except for increasing PPV slightly, while restricting diagnosis position to 1st or 2nd position decreased sensitivity and increased PPV. The impact of clinical components or diagnosis position did not differ by race, age, or sex subgroups.The results from this dissertation can be used by researchers using EHR data for a variety of reasons from informing their own analytic decisions to validating their study findings. The continued use of EHR data for research requires transparency to facilitate reproducibility as well as studies focused on what we are measuring.Doctor of Philosoph

    Preface

    Get PDF
    corecore