41 research outputs found
Natural Language Processing Methods to Identify Oncology Patients at High Risk for Acute Care with Clinical Notes
Clinical notes are an essential component of a health record. This paper
evaluates how natural language processing (NLP) can be used to identify the
risk of acute care use (ACU) in oncology patients, once chemotherapy starts.
Risk prediction using structured health data (SHD) is now standard, but
predictions using free-text formats are complex. This paper explores the use of
free-text notes for the prediction of ACU instead of SHD. Deep Learning models
were compared to manually engineered language features. Results show that SHD
models minimally outperform NLP models; an l1-penalised logistic regression
with SHD achieved a C-statistic of 0.748 (95%-CI: 0.735, 0.762), while the same
model with language features achieved 0.730 (95%-CI: 0.717, 0.745) and a
transformer-based model achieved 0.702 (95%-CI: 0.688, 0.717). This paper shows
how language models can be used in clinical applications and underlines how
risk bias is different for diverse patient groups, even using only free-text
data.Comment: 11 pages, 6 figures, 2 table
Diagnosis Classification in the Emergency Room Using Natural Language Processing
Diagnosis classification in the emergency room (ER) is a complex task. We developed several natural language processing classification models, looking both at the full classification task of 132 diagnostic categories and at several clinically applicable samples consisting of two diagnoses that are hard to distinguish
Pseudocontinuous arterial spin labeling reveals dissociable effects of morphine and alcohol on regional cerebral blood flow
We have examined sensitivity and specificity of pseudocontinuous arterial spin labeling (PCASL) to detect global and regional changes in cerebral blood flow (CBF) in response to two different psychoactive drugs. We tested alcohol and morphine in a placebo-controlled, double-blind randomized study in 12 healthy young men. Drugs were administered intravenously. Validated pharmacokinetic protocols achieved minimal intersubject and intrasubject variance in plasma drug concentration. Permutation-based statistical testing of a mixed effect repeated measures model revealed a widespread increase in absolute CBF because of both morphine and alcohol. Conjunction analysis revealed overlapping effects of morphine and alcohol on absolute CBF in the left anterior cingulate, right hippocampus, right insula, and left primary sensorimotor areas. Effects of morphine and alcohol on relative CBF (obtained from z-normalization of absolute CBF maps) were significantly different in the left putamen, left frontoparietal network, cerebellum, and the brainstem. Corroborating previous PET results, our findings suggest that PCASL is a promising tool for central nervous system drug research
Predicting Depression Risk in Patients with Cancer Using Multimodal Data
When patients with cancer develop depression, it is often left untreated. We developed a prediction model for depression risk within the first month after starting cancer treatment using machine learning and Natural Language Processing (NLP) models. The LASSO logistic regression model based on structured data performed well, whereas the NLP model based on only clinician notes did poorly. After further validation, prediction models for depression risk could lead to earlier identification and treatment of vulnerable patients, ultimately improving cancer care and treatment adherence
Predicting Depression Risk in Patients With Cancer Using Multimodal Data: Algorithm Development Study
Background: Patients with cancer starting systemic treatment programs, such as chemotherapy, often develop depression. A prediction model may assist physicians and health care workers in the early identification of these vulnerable patients. Objective: This study aimed to develop a prediction model for depression risk within the first month of cancer treatment. Methods: We included 16,159 patients diagnosed with cancer starting chemo- or radiotherapy treatment between 2008 and 2021. Machine learning models (eg, least absolute shrinkage and selection operator [LASSO] logistic regression) and natural language processing models (Bidirectional Encoder Representations from Transformers [BERT]) were used to develop multimodal prediction models using both electronic health record data and unstructured text (patient emails and clinician notes). Model performance was assessed in an independent test set (n=5387, 33%) using area under the receiver operating characteristic curve (AUROC), calibration curves, and decision curve analysis to assess initial clinical impact use. Results: Among 16,159 patients, 437 (2.7%) received a depression diagnosis within the first month of treatment. The LASSO logistic regression models based on the structured data (AUROC 0.74, 95% CI 0.71-0.78) and structured data with email classification scores (AUROC 0.74, 95% CI 0.71-0.78) had the best discriminative performance. The BERT models based on clinician notes and structured data with email classification scores had AUROCs around 0.71. The logistic regression model based on email classification scores alone performed poorly (AUROC 0.54, 95% CI 0.52-0.56), and the model based solely on clinician notes had the worst performance (AUROC 0.50, 95% CI 0.49-0.52). Calibration was good for the logistic regression models, whereas the BERT models produced overly extreme risk estimates even after recalibration. There was a small range of decision thresholds for which the best-performing model showed promising clinical effectiveness use. The risks were underestimated for female and Black patients. Conclusions: The results demonstrated the potential and limitations of machine learning and multimodal models for predicting depression risk in patients with cancer. Future research is needed to further validate these models, refine the outcome label and predictors related to mental health, and address biases across subgroups
Brain Deep Medullary Veins on 7T MRI in Dutch-Type Hereditary Cerebral Amyloid Angiopathy
BACKGROUND: Deep medullary vein (DMV) changes occur in cerebral small vessel diseases (SVD) and in Alzheimer's disease. Cerebral amyloid angiopathy (CAA) is a common SVD that has a high co-morbidity with Alzheimer's disease. So far, DMVs have not been evaluated in CAA. OBJECTIVE: To evaluate DMVs in Dutch-type hereditary CAA (D-CAA) mutation carriers and controls, in relation to MRI markers associated with D-CAA. METHODS: Quantitative DMV parameters length, tortuosity, inhomogeneity, and density were quantified on 7 Tesla 3D susceptibility weighted MRI in pre-symptomatic D-CAA mutation carriers (nâ=â8), symptomatic D-CAA mutation carriers (nâ=â8), and controls (nâ=â25). Hemorrhagic MRI markers (cerebral microbleeds, intracerebral hemorrhages, cortical superficial siderosis, convexity subarachnoid hemorrhage), non-hemorrhagic MRI markers (white matter hyperintensities, enlarged perivascular spaces, lacunar infarcts, cortical microinfarcts), cortical grey matter perfusion, and diffusion tensor imaging parameters were assessed in D-CAA mutation carriers. Univariate general linear analysis was used to determine associations between DMV parameters and MRI markers. RESULTS: Quantitative DMV parameters length, tortuosity, inhomogeneity, and density did not differ between pre-symptomatic D-CAA mutation carriers, symptomatic D-CAA mutation carriers, and controls. No associations were found between DMV parameters and MRI markers associated with D-CAA. CONCLUSION: This study indicates that vascular amyloid-ÎČ deposition does not affect DMV parameters. In patients with CAA, DMVs do not seem to play a role in the pathogenesis of MRI markers associated with CAA
Availability of Evidence for Predictive Machine Learning Algorithms in Primary Care: A Systematic Review
Importance: The aging and multimorbid population and health personnel shortages pose a substantial burden on primary health care. While predictive machine learning (ML) algorithms have the potential to address these challenges, concerns include transparency and insufficient reporting of model validation and effectiveness of the implementation in the clinical workflow. Objectives: To systematically identify predictive ML algorithms implemented in primary care from peer-reviewed literature and US Food and Drug Administration (FDA) and Conformité Européene (CE) registration databases and to ascertain the public availability of evidence, including peer-reviewed literature, gray literature, and technical reports across the artificial intelligence (AI) life cycle. Evidence Review: PubMed, Embase, Web of Science, Cochrane Library, Emcare, Academic Search Premier, IEEE Xplore, ACM Digital Library, MathSciNet, AAAI.org (Association for the Advancement of Artificial Intelligence), arXiv, Epistemonikos, PsycINFO, and Google Scholar were searched for studies published between January 2000 and July 2023, with search terms that were related to AI, primary care, and implementation. The search extended to CE-marked or FDA-approved predictive ML algorithms obtained from relevant registration databases. Three reviewers gathered subsequent evidence involving strategies such as product searches, exploration of references, manufacturer website visits, and direct inquiries to authors and product owners. The extent to which the evidence for each predictive ML algorithm aligned with the Dutch AI predictive algorithm (AIPA) guideline requirements was assessed per AI life cycle phase, producing evidence availability scores. Findings: The systematic search identified 43 predictive ML algorithms, of which 25 were commercially available and CE-marked or FDA-approved. The predictive ML algorithms spanned multiple clinical domains, but most (27 [63%]) focused on cardiovascular diseases and diabetes. Most (35 [81%]) were published within the past 5 years. The availability of evidence varied across different phases of the predictive ML algorithm life cycle, with evidence being reported the least for phase 1 (preparation) and phase 5 (impact assessment) (19% and 30%, respectively). Twelve (28%) predictive ML algorithms achieved approximately half of their maximum individual evidence availability score. Overall, predictive ML algorithms from peer-reviewed literature showed higher evidence availability compared with those from FDA-approved or CE-marked databases (45% vs 29%). Conclusions and Relevance: The findings indicate an urgent need to improve the availability of evidence regarding the predictive ML algorithms' quality criteria. Adopting the Dutch AIPA guideline could facilitate transparent and consistent reporting of the quality criteria that could foster trust among end users and facilitating large-scale implementation
Plasma amyloidâbeta levels in a preâsymptomatic dutchâtype hereditary cerebral amyloid angiopathy pedigree: A crossâsectional and longitudinal investigation
Plasma amyloidâbeta (AÎČ) has long been investigated as a blood biomarker candidate for Cerebral Amyloid Angiopathy (CAA), however previous findings have been inconsistent which could be attributed to the use of less sensitive assays. This study investigates plasma AÎČ alterations between preâsymptomatic Dutchâtype hereditary CAA (DâCAA) mutationâcarriers (MC) and non-carriers (NC) using two AÎČ measurement platforms. Seventeen preâsymptomatic members of a Dâ CAA pedigree were assembled and followed up 3â4 years later (NC = 8;MC = 9). Plasma AÎČ1â40 and AÎČ1â42 were crossâsectionally and longitudinally analysed at baseline (T1) and followâup (T2) and were found to be lower in MCs compared to NCs, crossâsectionally after adjusting for covari-ates, at both T1(AÎČ1â40: p = 0.001; AÎČ1â42: p = 0.0004) and T2 (AÎČ1â40: p = 0.001; AÎČ1â42: p = 0.016) employing the Single Molecule Array (Simoa) platform, however no significant differences were observed using the xMAP platform. Further, pairwise longitudinal analyses of plasma AÎČ1â40 revealed decreased levels in MCs using data from the Simoa platform (p = 0.041) and pairwise longitudinal analyses of plasma AÎČ1â42 revealed decreased levels in MCs using data from the xMAP platform (p = 0.041). Findings from the Simoa platform suggest that plasma AÎČ may add value to a panel of biomarkers for the diagnosis of preâsymptomatic CAA, however, further validation studies in larger sample sets are required
Cerebral small vessel disease genomics and its implications across the lifespan
White matter hyperintensities (WMH) are the most common brain-imaging feature of cerebral small vessel disease (SVD), hypertension being the main known risk factor. Here, we identify 27 genome-wide loci for WMH-volume in a cohort of 50,970 older individuals, accounting for modification/confounding by hypertension. Aggregated WMH risk variants were associated with altered white matter integrity (pâ=â2.5Ă10-7) in brain images from 1,738 young healthy adults, providing insight into the lifetime impact of SVD genetic risk. Mendelian randomization suggested causal association of increasing WMH-volume with stroke, Alzheimer-type dementia, and of increasing blood pressure (BP) with larger WMH-volume, notably also in persons without clinical hypertension. Transcriptome-wide colocalization analyses showed association of WMH-volume with expression of 39 genes, of which four encode known drug targets. Finally, we provide insight into BP-independent biological pathways underlying SVD and suggest potential for genetic stratification of high-risk individuals and for genetically-informed prioritization of drug targets for prevention trials.Peer reviewe