739 research outputs found

    A Machine Learning Trainable Model to Assess the Accuracy of Probabilistic Record Linkage

    Get PDF
    Record linkage (RL) is the process of identifying and linking data that relates to the same physical entity across multiple heterogeneous data sources. Deterministic linkage methods rely on the presence of common uniquely identifying attributes across all sources while probabilistic approaches use non-unique attributes and calculates similarity indexes for pair wise comparisons. A key component of record linkage is accuracy assessment — the process of manually verifying and validating matched pairs to further refine linkage parameters and increase its overall effectiveness. This process however is time-consuming and impractical when applied to large administrative data sources where millions of records must be linked. Additionally, it is potentially biased as the gold standard used is often the reviewer’s intuition. In this paper, we present an approach for assessing and refining the accuracy of probabilistic linkage based on different supervised machine learning methods (decision trees, naïve Bayes, logistic regression, random forest, linear support vector machines and gradient boosted trees). We used data sets extracted from huge Brazilian socioeconomic and public health care data sources. These models were evaluated using receiver operating characteristic plots, sensitivity, specificity and positive predictive values collected from a 10-fold cross-validation method. Results show that logistic regression outperforms other classifiers and enables the creation of a generalized, very accurate model to validate linkage results

    Mining and analysis of audiology data to find significant factors associated with tinnitus masker

    Get PDF
    Objectives: The objective of this research is to find the factors associated with tinnitus masker from the literature, and by using the large amount of audiology data available from a large NHS (National Health Services, UK) hearing aid clinic. The factors evaluated were hearing impairment, age, gender, hearing aid type, mould and clinical comments. Design: The research includes literature survey for factors associated with tinnitus masker, and performs the analysis of audiology data using statistical and data mining techniques. Setting: This research uses a large audiology data but it also faced the problem of limited data for tinnitus. Participants: It uses 1,316 records for tinnitus and other diagnoses, and 10,437 records of clinical comments from a hearing aid clinic. Primary and secondary outcome measures: The research is looking for variables associated with tinnitus masker, and in future, these variables can be combined into a single model to develop a decision support system to predict about tinnitus masker for a patient. Results: The results demonstrated that tinnitus maskers are more likely to be fit to individuals with milder forms of hearing loss, and the factors age, gender, type of hearing aid and mould were all found significantly associated with tinnitus masker. In particular, those patients having Age<=55 years were more likely to wear a tinnitus masker, as well as those with milder forms of hearing loss. ITE (in the ear) hearing aids were also found associated with tinnitus masker. A feedback on the results of association of mould with tinnitus masker from a professional audiologist of a large NHS (National Health Services, UK) was also taken to better understand them. The results were obtained with different accuracy for different techniques. For example, the chi-squared test results were obtained with 95% accuracy, for Support and Confidence only those results were retained which had more than 1% Support and 80% Confidence. Conclusions: The variables audiograms, age, gender, hearing aid type and mould were found associated with the choice of tinnitus masker in the literature and by using statistical and data mining techniques. The further work in this research would lead to the development of a decision support system for tinnitus masker with an explanation that how that decision was obtained

    Statistical challenges in the development and evaluation of marker-based clinical tests

    Get PDF
    Exciting new technologies for assessing markers in human specimens are now available to evaluate unprecedented types and numbers of variations in DNA, RNA, proteins, or biological structures such as chromosomes. These markers, whether viewed individually, or collectively as a 'signature', have the potential to be useful for disease risk assessment, screening, early detection, prognosis, therapy selection, and monitoring for therapy effectiveness or disease recurrence. Successful translation from basic research findings to clinically useful test requires basic, translational, and regulatory sciences and a collaborative effort among individuals with varied types of expertise including laboratory scientists, technology developers, clinicians, statisticians, and bioinformaticians. The focus of this commentary is the many statistical challenges in translational marker research, specifically in the development and validation of marker-based tests that have clinical utility for therapeutic decision-making

    Cough quality in children: a comparison of subjective vs. bronchoscopic findings

    Get PDF
    BACKGROUND: Cough is the most common symptom presenting to doctors. The quality of cough (productive or wet vs dry) is used clinically as well as in epidemiology and clinical research. There is however no data on the validity of cough quality descriptors. The study aims were to compare (1) cough quality (wet/dry and brassy/non-brassy) to bronchoscopic findings of secretions and tracheomalacia respectively and, (2) parent's vs clinician's evaluation of the cough quality (wet/dry). METHODS: Cough quality of children (without a known underlying respiratory disease) undergoing elective bronchoscopy was independently evaluated by clinicians and parents. A 'blinded' clinician scored the secretions seen at bronchoscopy on pre-determined criteria and graded (1 to 6). Kappa (K) statistics was used for agreement, and inter-rater and intra-rater agreement examined on digitally recorded cough. A receiver operating characteristic (ROC) curve was used to determine if cough quality related to amount of airway secretions present at bronchoscopy. RESULTS: Median age of the 106 children (62 boys, 44 girls) enrolled was 2.6 years (IQR 5.7). Parent's assessment of cough quality (wet/dry) agreed with clinicians' (K = 0.75, 95%CI 0.58–0.93). When compared to bronchoscopy (bronchoscopic secretion grade 4), clinicians' cough assessment had the highest sensitivity (0.75) and specificity (0.79) and were marginally better than parent(s). The area under the ROC curve was 0.85 (95%CI 0.77–0.92). Intra-observer (K = 1.0) and inter-clinician agreement for wet/dry cough (K = 0.88, 95%CI 0.82–0.94) was very good. Weighted K for inter-rater agreement for bronchoscopic secretion grades was 0.95 (95%CI 0.87–1). Sensitivity and specificity for brassy cough (for tracheomalacia) were 0.57 and 0.81 respectively. K for both intra and inter-observer clinician agreement for brassy cough was 0.79 (95%CI 0.73–0.86). CONCLUSIONS: Dry and wet cough in children, as determined by clinicians and parents has good clinical validity. Clinicians should however be cognisant that children with dry cough may have minimal to mild airway secretions. Brassy cough determined by respiratory physicians is highly specific for tracheomalacia

    Targeted physiotherapy for patellofemoral joint osteoarthritis: A protocol for a randomised, single-blind controlled trial

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The patellofemoral joint (PFJ) is one compartment of the knee that is frequently affected by osteoarthritis (OA) and is a potent source of OA symptoms. However, there is a dearth of evidence for compartment-specific treatments for PFJ OA. Therefore, this project aims to evaluate whether a physiotherapy treatment, targeted to the PFJ, results in greater improvements in pain and physical function than a physiotherapy education intervention in people with symptomatic and radiographic PFJ OA.</p> <p>Methods</p> <p>90 people with PFJ OA (PFJ-specific history, signs and symptoms and radiographic evidence of PFJ OA) will be recruited from the community and randomly allocated into one of two treatments. A randomised controlled trial adhering to CONSORT guidelines will evaluate the efficacy of physiotherapy (8 individual sessions over 12 weeks, as well as a home exercise program 4 times/week) compared to a physiotherapist-delivered OA education control treatment (8 individual sessions over 12 weeks). Physiotherapy treatment will consist of (i) quadriceps muscle retraining; (ii) quadriceps and hip muscle strengthening; (iii) patellar taping; (iv) manual PFJ and soft tissue mobilisation; and (v) OA education. Resistance and dosage of exercises will be tailored to the participant's functional level and clinical state. Primary outcomes will be evaluated by a blinded examiner at baseline, 12 weeks and 9 months using validated and reliable pain, physical function and perceived global effect scales. All analyses will be conducted on an intention-to-treat basis using linear mixed regression models, including respective baseline scores as a covariate, subjects as a random effect, treatment condition as a fixed factor and the covariate by treatment interaction.</p> <p>Conclusion</p> <p>This RCT is targeting PFJ OA, an important sub-group of knee OA patients, with a specifically designed conservative intervention. The project's outcome will influence PFJ OA rehabilitation, with the potential to reduce the personal and societal burden of this increasing public health problem.</p> <p>Trial Registration</p> <p>Australia New Zealand Clinical Trials Registry ACTRN12608000288325</p

    The challenges faced in the design, conduct and analysis of surgical randomised controlled trials

    Get PDF
    Randomised evaluations of surgical interventions are rare; some interventions have been widely adopted without rigorous evaluation. Unlike other medical areas, the randomised controlled trial (RCT) design has not become the default study design for the evaluation of surgical interventions. Surgical trials are difficult to successfully undertake and pose particular practical and methodological challenges. However, RCTs have played a role in the assessment of surgical innovations and there is scope and need for greater use. This article will consider the design, conduct and analysis of an RCT of a surgical intervention. The issues will be reviewed under three headings: the timing of the evaluation, defining the research question and trial design issues. Recommendations on the conduct of future surgical RCTs are made. Collaboration between research and surgical communities is needed to address the distinct issues raised by the assessmentof surgical interventions and enable the conduct of appropriate and well-designed trials.The Health Services Research Unit is funded by the Scottish Government Health DirectoratesPeer reviewedPublisher PD

    Interactive decision support in hepatic surgery

    Get PDF
    BACKGROUND: Hepatic surgery is characterized by complicated operations with a significant peri- and postoperative risk for the patient. We developed a web-based, high-granular research database for comprehensive documentation of all relevant variables to evaluate new surgical techniques. METHODS: To integrate this research system into the clinical setting, we designed an interactive decision support component. The objective is to provide relevant information for the surgeon and the patient to assess preoperatively the risk of a specific surgical procedure. Based on five established predictors of patient outcomes, the risk assessment tool searches for similar cases in the database and aggregates the information to estimate the risk for an individual patient. RESULTS: The physician can verify the analysis and exclude manually non-matching cases according to his expertise. The analysis is visualized by means of a Kaplan-Meier plot. To evaluate the decision support component we analyzed data on 165 patients diagnosed with hepatocellular carcinoma (period 1996–2000). The similarity search provides a two-peak distribution indicating there are groups of similar patients and singular cases which are quite different to the average. The results of the risk estimation are consistent with the observed survival data, but must be interpreted with caution because of the limited number of matching reference cases. CONCLUSION: Critical issues for the decision support system are clinical integration, a transparent and reliable knowledge base and user feedback

    Cave spiders choose optimal environmental factors with respect to the generated entropy when laying their cocoon

    Get PDF
    The choice of a suitable area to spiders where to lay eggs is promoted in terms of Darwinian fitness. Despite its importance, the underlying factors behind this key decision are generally poorly understood. Here, we designed a multidisciplinary study based both on in-field data and laboratory experiments focusing on the European cave spider Meta menardi (Araneae, Tetragnathidae) and aiming at understanding the selective forces driving the female in the choice of the depositional area. Our in-field data analysis demonstrated a major role of air velocity and distance from the cave entrance within a particular cave in driving the female choice. This has been interpreted using a model based on the Entropy Generation Minimization - EGM - method, without invoking best fit parameters and thanks to independent lab experiments, thus demonstrating that the female chooses the depositional area according to minimal level of thermo-fluid-dynamic irreversibility. This methodology may pave the way to a novel approach in understanding evolutionary strategies for other living organisms
    corecore