94,116 research outputs found
Recommended from our members
Are there valid proxy measures of clinical behaviour?
Background: Accurate measures of health professionals' clinical practice are critically important to guide health policy decisions, as well as for professional self-evaluation and for research-based investigation of clinical practice and process of care. It is often not feasible or ethical to measure behaviour through direct observation, and rigorous behavioural measures are difficult and costly to use. The aim of this review was to identify the current evidence relating to the relationships between proxy measures and direct measures of clinical behaviour. In particular, the accuracy of medical record review, clinician self-reported and patient-reported behaviour was assessed relative to directly observed behaviour.
Methods: We searched: PsycINFO; MEDLINE; EMBASE; CINAHL; Cochrane Central Register of Controlled Trials; science/social science citation index; Current contents (social & behavioural med/clinical med); ISI conference proceedings; and Index to Theses. Inclusion criteria: empirical, quantitative studies; and examining clinical behaviours. An independent, direct measure of behaviour (by standardised patient, other trained observer or by video/audio recording) was considered the 'gold standard' for comparison. Proxy measures of behaviour included: retrospective self-report; patient-report; or chart-review. All titles, abstracts, and full text articles retrieved by electronic searching were screened for inclusion and abstracted independently by two reviewers. Disagreements were resolved by discussion with a third reviewer where necessary.
Results: Fifteen reports originating from 11 studies met the inclusion criteria. The method of direct measurement was by standardised patient in six reports, trained observer in three reports, and audio/video recording in six reports. Multiple proxy measures of behaviour were compared in five of 15 reports. Only four of 15 reports used appropriate statistical methods to compare measures. Some direct measures failed to meet our validity criteria. The accuracy of patient report and chart review as proxy measures varied considerably across a wide range of clinical actions. The evidence for clinician self-report was inconclusive.
Conclusion: Valid measures of clinical behaviour are of fundamental importance to accurately identify gaps in care delivery, improve quality of care, and ultimately to improve patient care. However, the evidence base for three commonly used proxy measures of clinicians' behaviour is very limited. Further research is needed to better establish the methods of development, application, and analysis for a range of both direct and proxy measures of behaviour
Improving utility of brain tumor confocal laser endomicroscopy: objective value assessment and diagnostic frame detection with convolutional neural networks
Confocal laser endomicroscopy (CLE), although capable of obtaining images at
cellular resolution during surgery of brain tumors in real time, creates as
many non-diagnostic as diagnostic images. Non-useful images are often distorted
due to relative motion between probe and brain or blood artifacts. Many images,
however, simply lack diagnostic features immediately informative to the
physician. Examining all the hundreds or thousands of images from a single case
to discriminate diagnostic images from nondiagnostic ones can be tedious.
Providing a real-time diagnostic value assessment of images (fast enough to be
used during the surgical acquisition process and accurate enough for the
pathologist to rely on) to automatically detect diagnostic frames would
streamline the analysis of images and filter useful images for the
pathologist/surgeon. We sought to automatically classify images as diagnostic
or non-diagnostic. AlexNet, a deep-learning architecture, was used in a 4-fold
cross validation manner. Our dataset includes 16,795 images (8572 nondiagnostic
and 8223 diagnostic) from 74 CLE-aided brain tumor surgery patients. The ground
truth for all the images is provided by the pathologist. Average model accuracy
on test data was 91% overall (90.79 % accuracy, 90.94 % sensitivity and 90.87 %
specificity). To evaluate the model reliability we also performed receiver
operating characteristic (ROC) analysis yielding 0.958 average for the area
under ROC curve (AUC). These results demonstrate that a deeply trained AlexNet
network can achieve a model that reliably and quickly recognizes diagnostic CLE
images.Comment: SPIE Medical Imaging: Computer-Aided Diagnosis 201
Evaluating tools to support a new practical classification of diabetes: excellent control may represent misdiagnosis and omission from disease registers is associated with worse control.
To conduct a service evaluation of usability and utility on-line clinical audit tools developed as part of a UK Classification of Diabetes project to improve the categorisation and ultimately management of diabetes
Recommended from our members
School-Based Telemedicine Interventions for Asthma: A Systematic Review.
BackgroundSchool health systems are increasingly investing in telemedicine platforms to address acute and chronic illnesses. Asthma, the most common chronic illness in childhood, is of particular interest given its high burden on school absenteeism.ObjectiveConduct a systematic review evaluating impact of school-based telemedicine programs on improving asthma-related outcomes.Data sourcesPubMed, Cochrane CENTRAL, CINAHL, ERIC, PsycINFO, Embase, and Google Scholar.Study eligibility criteriaOriginal research, including quasi-experimental studies, without restriction on the type of telemedicine.ParticipantsSchool-aged pediatric patients with asthma and their families.InterventionsSchool-based telemedicine.Study appraisal and synthesis methodsTwo authors independently screened each abstract, conducted full-text review, assessed study quality, and extracted information. A third author resolved disagreements.ResultsOf 371 articles identified, 7 were included for the review. Outcomes of interest were asthma symptom-free days, asthma symptom frequency, quality of life, health care utilization, school absences, and spirometry. Four of 7 studies reported significant increases in symptom-free days and/or decrease in symptom frequency. Five of 6 reported increases in at least one quality-of-life metric, 2 of 7 reported a decrease in at least 1 health care utilization metric, 1 of 3 showed reductions in school absences, and 1 of 2 reported improvements in spirometry measures.LimitationsVariability in intervention designs and outcome measures make comparisons and quantitative analyses across studies difficult. Only 2 of 7 studies were randomized controlled trials.Conclusions and implications of key findingsHigh-quality evidence supporting the use of school-based telemedicine programs to improve patient outcomes is limited. While available evidence suggests benefit, only 2 comparative trials were identified, and the contribution of telemedicine to these studies' results is unclear
Screening and assessment tools for gaming disorder: A comprehensive systematic review
The inclusion of gaming disorder (GD) as an official diagnosis in the ICD-11 was a significant milestone for the field. However, the optimal measurement approaches for GD are currently unclear. This comprehensive systematic review aimed to identify and evaluate all available English-language GD tools and their corresponding evidence. A search of PsychINFO, PsychArticles, ScienceDirect, Scopus, Web of Science, and Google Scholar identified 32 tools employed in 320 studies (N = 462,249 participants). The evaluation framework examined tools in relation to: (1) conceptual and practical considerations; (2) alignment with DSM-5 and ICD-11 criteria; (3) type and quantity of studies and samples; and (4) psychometric properties. The evaluation showed that GD instrumentation has proliferated, with 2.5 tools, on average, published annually since 2013. Coverage of DSM-5 and ICD-11 criteria was inconsistent, especially for the criterion of continued use despite harm. Tools converge on the importance of screening for impaired control over gaming and functional impairment. Overall, no single tool was found to be clearly superior, but the AICA-Sgaming, GAS-7, IGDT-10, IGDS9-SF, and Lemmens IGD-9 scales had greater evidential support for their psychometric properties. The GD field would benefit from a standard international tool to identify gaming-related harms across the spectrum of maladaptive gaming behaviors.Peer reviewedFinal Accepted Versio
The influence of patient and doctor gender on diagnosing coronary heart disease
Using novel methods, this paper explores sources of uncertainty and gender bias in primary care doctors’ diagnostic decision making about coronary heart disease (CHD). Claims about gendered consultation styles and quality of care are re-examined, along with the adequacy of CHD models for women.
Randomly selected doctors in the UK and the US (n=112, 56 per country, stratified by gender) were shown standardised videotaped vignettes of actors portraying patients with CHD. ‘Patients’ age, gender, ethnicity and social class were varied systematically. During interviews, doctors gave free-recall accounts of their decision making, which were analysed to determine patient and doctor gender effects.
We found differences in male and female doctors’ responses to different types of patient information. Female doctors recall more patient cues overall, particularly about history presentation, and particularly amongst women. Male doctors appear less affected by patient gender but both male and especially female doctors take more account of male patients’ age and consider more age-related disease possibilities for men than women. Findings highlight the need for better integration of knowledge about female presentations within accepted CHD risk models, and do not support the contention that women receive better quality care from female doctors
The prodrome of autism: early behavioral and biological signs, regression, peri- and post-natal development and genetics
Autism is one of the most heritable neurodevelopmental conditions and has an early onset, with symptoms being required to be present in the first 3 years of life in order to meet criteria for the ‘core’ disorder in the classification systems. As such, the focus on identifying a prodrome over the past 20 years has been on pre-clinical signs or indicators that will be present very early in life, certainly in infancy. A number of novel lines of investigation have been used to this end, including retrospective coding of home videos, prospective population screening and ‘high risk’ sibling studies; as well as the investigation of pre- and peri-natal, brain developmental and other biological factors. Whilst no single prodromal sign is expected to be present in all cases, a picture is emerging of indicative prodromal signs in infancy and initial studies are being undertaken to attempt to ameliorate the early presentation and even ‘prevent’ emergence of the full syndrome
Influence of study design on digital pathology image quality evaluation : the need to define a clinical task
Despite the current rapid advance in technologies for whole slide imaging, there is still no scientific consensus on the recommended methodology for image quality assessment of digital pathology slides. For medical images in general, it has been recommended to assess image quality in terms of doctors’ success rates in performing a specific clinical task while using the images (clinical image quality, cIQ). However, digital pathology is a new modality, and already identifying the appropriate task is difficult. In an alternative common approach, humans are asked to do a simpler task such as rating overall image quality (perceived image quality, pIQ), but that involves the risk of nonclinically relevant findings due to an unknown relationship between the pIQ and cIQ. In this study, we explored three different experimental protocols: (1) conducting a clinical task (detecting inclusion bodies), (2) rating image similarity and preference, and (3) rating the overall image quality. Additionally, within protocol 1, overall quality ratings were also collected (task-aware pIQ). The experiments were done by diagnostic veterinary pathologists in the context of evaluating the quality of hematoxylin and eosin-stained digital pathology slides of animal tissue samples under several common image alterations: additive noise, blurring, change in gamma, change in color saturation, and JPG compression. While the size of our experiments was small and prevents drawing strong conclusions, the results suggest the need to define a clinical task. Importantly, the pIQ data collected under protocols 2 and 3 did not always rank the image alterations the
same as their cIQ from protocol 1, warning against using conventional pIQ to predict cIQ. At the same time, there was a correlation between the cIQ and task-aware pIQ ratings from protocol 1, suggesting that the clinical experiment context (set by specifying the clinical task) may affect human visual attention and bring focus to their criteria of image quality. Further research is needed to assess whether and for which purposes (e.g., preclinical testing) task-aware pIQ ratings could substitute cIQ for a given clinical task
- …