121 research outputs found

    Comparing administrative and survey data for ascertaining cases of irritable bowel syndrome: a population-based investigation

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Administrative and survey data are two key data sources for population-based research about chronic disease. The objectives of this methodological paper are to: (1) estimate agreement between the two data sources for irritable bowel syndrome (IBS) and compare the results to those for inflammatory bowel disease (IBD); (2) compare the frequency of IBS-related diagnoses in administrative data for survey respondents with and without self-reported IBS, and (3) estimate IBS prevalence from both sources.</p> <p>Methods</p> <p>This retrospective cohort study used linked administrative and health survey data for 5,134 adults from the province of Manitoba, Canada. Diagnoses in hospital and physician administrative data were investigated for respondents with self-reported IBS, IBD, and no bowel disorder. Agreement between survey and administrative data was estimated using the Īŗ statistic. The Ļ‡<sup>2 </sup>statistic tested the association between the frequency of IBS-related diagnoses and self-reported IBS. Crude, sex-specific, and age-specific IBS prevalence estimates were calculated from both sources.</p> <p>Results</p> <p>Overall, 3.0% of the cohort had self-reported IBS, 0.8% had self-reported IBD, and 95.3% reported no bowel disorder. Agreement was poor to fair for IBS and substantially higher for IBD. The most frequent IBS-related diagnoses among the cohort were anxiety disorders (34.4%), symptoms of the abdomen and pelvis (26.9%), and diverticulitis of the intestine (10.6%). Crude IBS prevalence estimates from both sources were lower than those reported previously.</p> <p>Conclusions</p> <p>Poor agreement between administrative and survey data for IBS may account for differences in the results of health services and outcomes research using these sources. Further research is needed to identify the optimal method(s) to ascertain IBS cases in both data sources.</p

    Osteoporosis-related fracture case definitions for population-based administrative data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Population-based administrative data have been used to study osteoporosis-related fracture risk factors and outcomes, but there has been limited research about the validity of these data for ascertaining fracture cases. The objectives of this study were to: (a) compare fracture incidence estimates from administrative data with estimates from population-based clinically-validated data, and (b) test for differences in incidence estimates from multiple administrative data case definitions.</p> <p>Methods</p> <p>Thirty-five case definitions for incident fractures of the hip, wrist, humerus, and clinical vertebrae were constructed using diagnosis codes in hospital data and diagnosis and service codes in physician billing data from Manitoba, Canada. Clinically-validated fractures were identified from the Canadian Multicentre Osteoporosis Study (CaMos). Generalized linear models were used to test for differences in incidence estimates.</p> <p>Results</p> <p>For hip fracture, sex-specific differences were observed in the magnitude of under- and over-ascertainment of administrative data case definitions when compared with CaMos data. The length of the fracture-free period to ascertain incident cases had a variable effect on over-ascertainment across fracture sites, as did the use of imaging, fixation, or repair service codes. Case definitions based on hospital data resulted in under-ascertainment of incident clinical vertebral fractures. There were no significant differences in trend estimates for wrist, humerus, and clinical vertebral case definitions.</p> <p>Conclusions</p> <p>The validity of administrative data for estimating fracture incidence depends on the site and features of the case definition.</p

    Statistical practices of educational researchers: An analysis of their ANOVA, MANOVA, and ANCOVA analyses

    Get PDF
    Articles published in several prominent educational journals were examined to investigate the use of data-analysis tools by researchers in four research paradigms: between-subjects univariate designs, between-subjects multivariate designs, repeated measures designs, and covariance designs. In addition to examining specific details pertaining to the research design (e.g., sample size, group size equality/inequality) and methods employed for data analysis, we also catalogued whether: (a) validity assumptions were examined, (b) effect size indices were reported, (c) sample sizes were selected based on power considerations, and (d) appropriate textbooks and/or articles were cited to communicate the nature of the analyses that were performed. Our analyses imply that researchers rarely verify that validity assumptions are satisfied and accordingly typically use analyses that are nonrobust to assumption violations. In addition, researchers rarely report effect size statistics, nor do they routinely perform power analyses to determine sample size requirements. We offer many recommendations to rectify these shortcomings.Social Sciences and Humanities Research Counci

    Investigating concordance in diabetes diagnosis between primary care charts (electronic medical records) and health administrative data: a retrospective cohort study

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Electronic medical records contain valuable clinical information not readily available elsewhere. Accordingly, they hold important potential for contributing to and enhancing chronic disease registries with the goal of improving chronic disease management; however a standard for diagnoses of conditions such as diabetes remains to be developed. The purpose of this study was to establish a validated electronic medical record definition for diabetes.</p> <p>Methods</p> <p>We constructed a retrospective cohort using health administrative data from the Institute for Clinical Evaluative Sciences Ontario Diabetes Database linked with electronic medical records from the Deliver Primary Healthcare Information Project using data from 1 April 2006 - 31 March 2008 (N = 19,443). We systematically examined eight definitions for diabetes diagnosis, both established and proposed.</p> <p>Results</p> <p>The definition that identified the highest number of patients with diabetes (N = 2,180) while limiting to those with the highest probability of having diabetes was: individuals with ā‰„2 abnormal plasma glucose tests, or diabetes on the problem list, or insulin prescription, or ā‰„2 oral anti-diabetic agents, or HbA1c ā‰„6.5%. Compared to the Ontario Diabetes Database, this definition identified 13% more patients while maintaining good sensitivity (75%) and specificity (98%).</p> <p>Conclusions</p> <p>This study establishes the feasibility of developing an electronic medical record standard definition of diabetes and validates an algorithm for use in this context. While the algorithm may need to be tailored to fit available data in different electronic medical records, it contributes to the establishment of validated disease registries with the goal of enhancing research, and enabling quality improvement in clinical care and patient self-management.</p

    Estimating and comparing incidence and prevalence of chronic diseases by combining GP registry data: the role of uncertainty

    Get PDF
    Background: Estimates of disease incidence and prevalence are core indicators of public health. The manner in which these indicators stand out against each other provide guidance as to which diseases are most common and what health problems deserve priority. Our aim was to investigate how routinely collected data from different general practitioner registration networks (GPRNs) can be combined to estimate incidence and prevalence of chronic diseases and to explore the role of uncertainty when comparing diseases. Methods. Incidence and prevalence counts, specified by gender and age, of 18 chronic diseases from 5 GPRNs in the Netherlands from the year 2007 were used as input. Generalized linear mixed models were fitted with the GPRN identifier acting as random intercept, and age and gender as explanatory variables. Using predictions of the regression models we estimated the incidence and prevalence for 18 chronic diseases and calculated a stochastic ranking of diseases in terms of incidence and prevalence per 1,000. Results: Incidence was highest for coronary heart disease and prevalence was highest for diabetes if we looked at the point estimates. The between GPRN variance in general was higher for incidence than for prevalence. Since uncertainty intervals were wide for some diseases and overlapped, the ranking of diseases was subject to uncertainty. For incidence shifts in rank of up to twelve positions were observed. For prevalence, most diseases shifted maximally three or four places in rank. Conclusion: Estimates of incidence and prevalence can be obtained by combining data from GPRNs. Uncertainty in the estimates of absolute figures may lead to different rankings of diseases and, hence, should be taken into consideration when comparing disease incidences and prevalences

    The validity of using ICD-9 codes and pharmacy records to identify patients with chronic obstructive pulmonary disease

    Get PDF
    Background: Administrative data is often used to identify patients with chronic obstructive pulmonary disease (COPD), yet the validity of this approach is unclear. We sought to develop a predictive model utilizing administrative data to accurately identify patients with COPD. Methods: Sequential logistic regression models were constructed using 9573 patients with postbronchodilator spirometry at two Veterans Affairs medical centers (2003-2007). COPD was defined as: 1) FEV1/FVC <0.70, and 2) FEV1/FVC < lower limits of normal. Model inputs included age, outpatient or inpatient COPD-related ICD-9 codes, and the number of metered does inhalers (MDI) prescribed over the one year prior to and one year post spirometry. Model performance was assessed using standard criteria. Results: 4564 of 9573 patients (47.7%) had an FEV1/FVC < 0.70. The presence of ā‰„1 outpatient COPD visit had a sensitivity of 76% and specificity of 67%; the AUC was 0.75 (95% CI 0.74-0.76). Adding the use of albuterol MDI increased the AUC of this model to 0.76 (95% CI 0.75-0.77) while the addition of ipratropium bromide MDI increased the AUC to 0.77 (95% CI 0.76-0.78). The best performing model included: ā‰„6 albuterol MDI, ā‰„3 ipratropium MDI, ā‰„1 outpatient ICD-9 code, ā‰„1 inpatient ICD-9 code, and age, achieving an AUC of 0.79 (95% CI 0.78-0.80). Conclusion: Commonly used definitions of COPD in observational studies misclassify the majority of patients as having COPD. Using multiple diagnostic codes in combination with pharmacy data improves the ability to accurately identify patients with COPD.Department of Veterans Affairs, Health Services Research and Development (DHA), American Lung Association (CI- 51755-N) awarded to DHA, the American Thoracic Society Fellow Career Development AwardPeer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/84155/1/Cooke - ICD9 validity in COPD.pd

    Chronic disease prevalence from Italian administrative databases in the VALORE project: a validation through comparison of population estimates with general practice databases and national survey

    Get PDF
    BACKGROUND: Administrative databases are widely available and have been extensively used to provide estimates of chronic disease prevalence for the purpose of surveillance of both geographical and temporal trends. There are, however, other sources of data available, such as medical records from primary care and national surveys. In this paper we compare disease prevalence estimates obtained from these three different data sources. METHODS: Data from general practitioners (GP) and administrative transactions for health services were collected from five Italian regions (Veneto, Emilia Romagna, Tuscany, Marche and Sicily) belonging to all the three macroareas of the country (North, Center, South). Crude prevalence estimates were calculated by data source and region for diabetes, ischaemic heart disease, heart failure and chronic obstructive pulmonary disease (COPD). For diabetes and COPD, prevalence estimates were also obtained from a national health survey. When necessary, estimates were adjusted for completeness of data ascertainment. RESULTS: Crude prevalence estimates of diabetes in administrative databases (range: from 4.8% to 7.1%) were lower than corresponding GP (6.2%-8.5%) and survey-based estimates (5.1%-7.5%). Geographical trends were similar in the three sources and estimates based on treatment were the same, while estimates adjusted for completeness of ascertainment (6.1%-8.8%) were slightly higher. For ischaemic heart disease administrative and GP data sources were fairly consistent, with prevalence ranging from 3.7% to 4.7% and from 3.3% to 4.9%, respectively. In the case of heart failure administrative estimates were consistently higher than GPs' estimates in all five regions, the highest difference being 1.4% vs 1.1%. For COPD the estimates from administrative data, ranging from 3.1% to 5.2%, fell into the confidence interval of the Survey estimates in four regions, but failed to detect the higher prevalence in the most Southern region (4.0% in administrative data vs 6.8% in survey data). The prevalence estimates for COPD from GP data were consistently higher than the corresponding estimates from the other two sources. CONCLUSION: This study supports the use of data from Italian administrative databases to estimate geographic differences in population prevalence of ischaemic heart disease, treated diabetes, diabetes mellitus and heart failure. The algorithm for COPD used in this study requires further refinement

    Stable reference genes for the measurement of transcript abundance during larval caste development in the honeybee

    Get PDF
    Many genes are differentially regulated by caste development in the honeybee. Identifying and understanding these differences is key to discovering the mechanisms underlying this process. To identify these gene expression differences requires robust methods to measure transcript abundance. RT-qPCR is currently the gold standard to measure gene expression, but requires stable reference genes to compare gene expression changes. Such reference genes have not been established for honeybee caste development. Here, we identify and test potential reference genes that have stable expression throughout larval development between the two female castes. In this study, 15 candidate reference genes were examined to identify the most stable reference genes. Three algorithms (GeNorm, Bestkeeper and NormFinder) were used to rank the candidate reference genes based on their stability between the castes throughout larval development. Of these genes Ndufa8 (the orthologue of a component of complex one of the mitochondrial electron transport chain) and Pros54 (orthologous to a component of the 26S proteasome) were identified as being the most stable. When these two genes were used to normalise expression of two target genes (previously found to be differentially expressed between queen and worker larvae by microarray analysis) they were able to more accurately detect differential expression than two previously used reference genes (awd and RpL12). The identification of these novel reference genes will be of benefit to future studies of caste development in the honeybee
    • ā€¦
    corecore