25 research outputs found

    Kidney Age Index (KAI):A novel age-related biomarker to estimate kidney function in patients with diabetic kidney disease using machine learning

    Get PDF
    BACKGROUND AND OBJECTIVE: With aging, patients with diabetic kidney disease (DKD) show progressive decrease in kidney function. We investigated whether the deviation of biological age (BA) from the chronological age (CA) due to DKD can be used (denoted as Kidney Age Index; KAI) to quantify kidney function using machine learning algorithms. METHODS: Three large datasets were used in this study to develop KAI. The machine learning algorithms were trained on PREVEND dataset with healthy subjects (NĀ =Ā 7963) using 13 clinical markers to predict the CA. The trained model was then used to predict the BA of patients with DKD using RENAAL (NĀ =Ā 1451) and IDNT (NĀ =Ā 1706). The performance of four traditional machine learning algorithms were evaluated and the KAIĀ =Ā BA-CA was estimated for each patient. RESULTS: The neural network model achieved the best performance and predicted the CA of healthy subjects in PREVEND dataset with a mean absolute deviation (MAD)Ā =Ā 6.5Ā Ā±Ā 3.5 years and pearson correlationĀ =Ā 0.62. Patients with DKD showed a significant higher KAI of 15.4Ā Ā±Ā 11.8 years and 13.6Ā Ā±Ā 12.3 years in RENAAL and IDNT datasets, respectively. CONCLUSIONS: Our findings suggest that for a given CA, patients with DKD shows excess BA when compared to their healthy counterparts due to disease severity. With further improvement, the proposed KAI can be used as a complementary easy-to-interpret tool to give a more inclusive idea into disease state

    STQS:Interpretable multi-modal Spatial-Temporal-seQuential model for automatic Sleep scoring

    Get PDF
    Sleep scoring is an important step for the detection of sleep disorders and usually performed by visual analysis. Since manual sleep scoring is time consuming, machine-learning based approaches have been proposed. Though efficient, these algorithms are black-box in nature and difficult to interpret by clinicians. In this paper, we propose a deep learning architecture for multi-modal sleep scoring, investigate the model's decision making process, and compare the model's reasoning with the annotation guidelines in the AASM manual. Our architecture, called STQS, uses convolutional neural networks (CNN) to automatically extract spatio-temporal features from 3 modalities (EEG, EOG and EMG), a bidirectional long short-term memory (Bi-LSTM) to extract sequential information, and residual connections to combine spatio-temporal and sequential features. We evaluated our model on two large datasets, obtaining an accuracy of 85% and 77% and a macro F1 score of 79% and 73% on SHHS and an in-house dataset, respectively. We further quantify the contribution of various architectural components and conclude that adding LSTM layers improves performance over a spatio-temporal CNN, while adding residual connections does not. Our interpretability results show that the output of the model is well aligned with AASM guidelines, and therefore, the model's decisions correspond to domain knowledge. We also compare multi-modal models and single-channel models and suggest that future research should focus on improving multi-modal models

    A Classification Approach for Cancer Survivors from Those Cancer-Free, Based on Health Behaviors:Analysis of the Lifelines Cohort

    Get PDF
    Simple Summary Health behaviors affect health status in cancer survivors. We aimed to identify such key health behaviors using nonlinear algorithms and compare their classification performance with logistic regression, for distinguishing cancer survivors from those cancer-free in a population-based cohort. We used health behaviors and socioeconomic factors for analysis. Participants from the Lifelines population-based cohort were binary classified as cancer survivors or cancer-free using nonlinear algorithms or logistic regression. Data were collected for 107,624 cancer-free participants and 2760 cancer survivors. Using all variables, algorithms obtained an area under the receiver operator curve (AUC) of 0.75 +/- 0.01. Using only health behaviors, the algorithms differentiated cancer survivors from cancer-free participants with AUCs of 0.62 +/- 0.01 and 0.60 +/- 0.01, respectively. In the case-control analyses, both algorithms produced AUCs of 0.52 +/- 0.01. The main distinctive classifier was age. No key health behaviors were identified by linear and nonlinear algorithms to differentiate cancer survivors from cancer-free participants. Health behaviors affect health status in cancer survivors. We hypothesized that nonlinear algorithms would identify distinct key health behaviors compared to a linear algorithm and better classify cancer survivors. We aimed to use three nonlinear algorithms to identify such key health behaviors and compare their performances with that of a logistic regression for distinguishing cancer survivors from those without cancer in a population-based cohort study. We used six health behaviors and three socioeconomic factors for analysis. Participants from the Lifelines population-based cohort were binary classified into a cancer-survivors group and a cancer-free group using either nonlinear algorithms or logistic regression, and their performances were compared by the area under the curve (AUC). In addition, we performed case-control analyses (matched by age, sex, and education level) to evaluate classification performance only by health behaviors. Data were collected for 107,624 cancer free participants and 2760 cancer survivors. Using all variables resulted an AUC of 0.75 +/- 0.01, using only six health behaviors, the logistic regression and nonlinear algorithms differentiated cancer survivors from cancer-free participants with AUCs of 0.62 +/- 0.01 and 0.60 +/- 0.01, respectively. The main distinctive classifier was age. Though not relevant to classification, the main distinctive health behaviors were body mass index and alcohol consumption. In the case-control analyses, algorithms produced AUCs of 0.52 +/- 0.01. No key health behaviors were identified by linear and nonlinear algorithms to differentiate cancer survivors from cancer-free participants in this population-based cohort

    Predicting Deep Hypnotic State From Sleep Brain Rhythms Using Deep Learning:A Data-Repurposing Approach

    Get PDF
    BACKGROUND: Brain monitors tracking quantitative brain activities from electroencephalogram (EEG) to predict hypnotic levels have been proposed as a labor-saving alternative to behavioral assessments. Expensive clinical trials are required to validate any newly developed processed EEG monitor for every drug and combinations of drugs due to drug-specific EEG patterns. There is a need for an alternative, efficient, and economical method. METHODS: Using deep learning algorithms, we developed a novel data-repurposing framework to predict hypnotic levels from sleep brain rhythms. We used an online large sleep data set (5723 clinical EEGs) for training the deep learning algorithm and a clinical trial hypnotic data set (30 EEGs) for testing during dexmedetomidine infusion. Model performance was evaluated using accuracy and the area under the receiver operator characteristic curve (AUC). RESULTS: The deep learning model (a combination of a convolutional neural network and long short-term memory units) trained on sleep EEG predicted deep hypnotic level with an accuracy (95% confidence interval [CI]) = 81 (79.2-88.3)%, AUC (95% CI) = 0.89 (0.82-0.94) using dexmedetomidine as a prototype drug. We also demonstrate that EEG patterns during dexmedetomidine-induced deep hypnotic level are homologous to nonrapid eye movement stage 3 EEG sleep. CONCLUSIONS: We propose a novel method to develop hypnotic level monitors using large sleep EEG data, deep learning, and a data-repurposing approach, and for optimizing such a system for monitoring any given individual. We provide a novel data-repurposing framework to predict hypnosis levels using sleep EEG, eliminating the need for new clinical trials to develop hypnosis level monitors

    AI-Driven Model for Automatic Emphysema Detection in Low-Dose Computed Tomography Using Disease-Specific Augmentation

    Get PDF
    The objective of this study is to evaluate the feasibility of a disease-specific deep learning (DL) model based on minimum intensity projection (minIP) for automated emphysema detection in low-dose computed tomography (LDCT) scans. LDCT scans of 240 individuals from a population-based cohort in the Netherlands (ImaLife study, mean age Ā± SD = 57 Ā± 6 years) were retrospectively chosen for training and internal validation of the DL model. For independent testing, LDCT scans of 125 individuals from a lung cancer screening cohort in the USA (NLST study, mean age Ā± SD = 64 Ā± 5 years) were used. Dichotomous emphysema diagnosis based on radiologists' annotation was used to develop the model. The automated model included minIP processing (slab thickness range: 1 mm to 11 mm), classification, and detection maps generation. The data-split for the pipeline evaluation involved class-balanced and imbalanced settings. The proposed DL pipeline showed the highest performance (area under receiver operating characteristics curve) for 11 mm slab thickness in both the balanced (ImaLife = 0.90 Ā± 0.05) and the imbalanced dataset (NLST = 0.77 Ā± 0.06). For ImaLife subcohort, the variation in minIP slab thickness from 1 to 11 mm increased the DL model's sensitivity from 75 to 88% and decreased the number of false-negative predictions from 10 to 5. The minIP-based DL model can automatically detect emphysema in LDCTs. The performance of thicker minIP slabs was better than that of thinner slabs. LDCT can be leveraged for emphysema detection by applying disease specific augmentation

    Machine Learning based Early Prediction of End-stage Renal Disease in Patients with Diabetic Kidney Disease using Clinical Trials Data

    Get PDF
    AimTo predict endā€stage renal disease (ESRD) in patients with type 2 diabetes by using machineā€learning models with multiple baseline demographic and clinical characteristics.Materials and methodsIn total, 11ā€‰789 patients with type 2 diabetes and nephropathy from three clinical trials, RENAAL (n = 1513), IDNT (n = 1715) and ALTITUDE (n = 8561), were used in this study. Eighteen baseline demographic and clinical characteristics were used as predictors to train machineā€learning models to predict ESRD (doubling of serum creatinine and/or ESRD). We used the area under the receiver operator curve (AUC) to assess the prediction performance of models and compared this with traditional Cox proportional hazard regression and kidney failure risk equation models.ResultsThe feed forward neural network model predicted ESRD with an AUC of 0.82 (0.76ā€0.87), 0.81 (0.75ā€0.86) and 0.84 (0.79ā€0.90) in the RENAAL, IDNT and ALTITUDE trials, respectively. The feed forward neural network model selected urinary albumin to creatinine ratio, serum albumin, uric acid and serum creatinine as important predictors and obtained a stateā€ofā€theā€art performance for predicting longā€term ESRD.ConclusionsDespite large interā€patient variability, nonā€linear machineā€learning models can be used to predict longā€term ESRD in patients with type 2 diabetes and nephropathy using baseline demographic and clinical characteristics. The proposed method has the potential to create accurate and multiple outcome prediction automated models to identify highā€risk patients who could benefit from therapy in clinical practice.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/163629/2/dom14178.pdfhttp://deepblue.lib.umich.edu/bitstream/2027.42/163629/1/dom14178_am.pd

    Neonatal EEG classification using atomic decomposition

    Get PDF
    The electroencephalogram (EEG) is an important noninvasive tool used in the neonatal intensive care unit (NICU) for the neurologic evaluation of the sick newborn infant. It provides an excellent assessment of at-risk newborns and formulates a prognosis for long-term neurologic outcome.The automated analysis of neonatal EEG data in the NICU can provide valuable information to the clinician facilitating medical intervention. The aim of this thesis is to develop a system for automatic classification of neonatal EEG which can be mainly divided into two parts: (1) classification of neonatal EEG seizure from nonseizure, and (2) classifying neonatal background EEG into several grades based on the severity of the injury using atomic decomposition. Atomic decomposition techniques use redundant time-frequency dictionaries for sparse signal representations or approximations. The first novel contribution of this thesis is the development of a novel time-frequency dictionary coherent with the neonatal EEG seizure states. This dictionary was able to track the time-varying nature of the EEG signal. It was shown that by using atomic decomposition and the proposed novel dictionary, the neonatal EEG transition from nonseizure to seizure states could be detected efficiently. The second novel contribution of this thesis is the development of a neonatal seizure detection algorithm using several time-frequency features from the proposed novel dictionary. It was shown that the time-frequency features obtained from the atoms in the novel dictionary improved the seizure detection accuracy when compared to that obtained from the raw EEG signal. With the assistance of a supervised multiclass SVM classifier and several timefrequency features, several methods to automatically grade EEG were explored. In summary, the novel techniques proposed in this thesis contribute to the application of advanced signal processing techniques for automatic assessment of neonatal EEG recordings
    corecore