19 research outputs found
Deep transfer learning for improving single-EEG arousal detection
Datasets in sleep science present challenges for machine learning algorithms
due to differences in recording setups across clinics. We investigate two deep
transfer learning strategies for overcoming the channel mismatch problem for
cases where two datasets do not contain exactly the same setup leading to
degraded performance in single-EEG models. Specifically, we train a baseline
model on multivariate polysomnography data and subsequently replace the first
two layers to prepare the architecture for single-channel
electroencephalography data. Using a fine-tuning strategy, our model yields
similar performance to the baseline model (F1=0.682 and F1=0.694,
respectively), and was significantly better than a comparable single-channel
model. Our results are promising for researchers working with small databases
who wish to use deep learning models pre-trained on larger databases.Comment: Accepted for presentation at EMBC202
Deep residual networks for automatic sleep stage classification of raw polysomnographic waveforms
We have developed an automatic sleep stage classification algorithm based on
deep residual neural networks and raw polysomnogram signals. Briefly, the raw
data is passed through 50 convolutional layers before subsequent classification
into one of five sleep stages. Three model configurations were trained on 1850
polysomnogram recordings and subsequently tested on 230 independent recordings.
Our best performing model yielded an accuracy of 84.1% and a Cohen's kappa of
0.746, improving on previous reported results by other groups also using only
raw polysomnogram data. Most errors were made on non-REM stage 1 and 3
decisions, errors likely resulting from the definition of these stages. Further
testing on independent cohorts is needed to verify performance for clinical
use
Towards a Flexible Deep Learning Method for Automatic Detection of Clinically Relevant Multi-Modal Events in the Polysomnogram
Much attention has been given to automatic sleep staging algorithms in past
years, but the detection of discrete events in sleep studies is also crucial
for precise characterization of sleep patterns and possible diagnosis of sleep
disorders. We propose here a deep learning model for automatic detection and
annotation of arousals and leg movements. Both of these are commonly seen
during normal sleep, while an excessive amount of either is linked to disrupted
sleep patterns, excessive daytime sleepiness impacting quality of life, and
various sleep disorders. Our model was trained on 1,485 subjects and tested on
1,000 separate recordings of sleep. We tested two different experimental setups
and found optimal arousal detection was attained by including a recurrent
neural network module in our default model with a dynamic default event window
(F1 = 0.75), while optimal leg movement detection was attained using a static
event window (F1 = 0.65). Our work show promise while still allowing for
improvements. Specifically, future research will explore the proposed model as
a general-purpose sleep analysis model.Comment: Accepted for publication in 41st International Engineering in
Medicine and Biology Conference (EMBC), July 23-27, 201
Automatic sleep stage classification with deep residual networks in a mixed-cohort setting
Study Objectives: Sleep stage scoring is performed manually by sleep experts
and is prone to subjective interpretation of scoring rules with low intra- and
interscorer reliability. Many automatic systems rely on few small-scale
databases for developing models, and generalizability to new datasets is thus
unknown. We investigated a novel deep neural network to assess the
generalizability of several large-scale cohorts.
Methods: A deep neural network model was developed using 15684
polysomnography studies from five different cohorts. We applied four different
scenarios: 1) impact of varying time-scales in the model; 2) performance of a
single cohort on other cohorts of smaller, greater or equal size relative to
the performance of other cohorts on a single cohort; 3) varying the fraction of
mixed-cohort training data compared to using single-origin data; and 4)
comparing models trained on combinations of data from 2, 3, and 4 cohorts.
Results: Overall classification accuracy improved with increasing fractions
of training data (0.25: 0.782 0.097, 95 CI [0.777-0.787];
100: 0.869 0.064, 95 CI [0.864-0.872]), and with increasing
number of data sources (2: 0.788 0.102, 95 CI [0.787-0.790]; 3: 0.808
0.092, 95 CI [0.807-0.810]; 4: 0.821 0.085, 95 CI
[0.819-0.823]). Different cohorts show varying levels of generalization to
other cohorts.
Conclusions: Automatic sleep stage scoring systems based on deep learning
algorithms should consider as much data as possible from as many sources
available to ensure proper generalization. Public datasets for benchmarking
should be made available for future research.Comment: Author's original version. This article has been accepted for
publication in SLEEP published by Oxford University Pres
Automatic Detection of Cortical Arousals in Sleep and their Contribution to Daytime Sleepiness
Cortical arousals are transient events of disturbed sleep that occur
spontaneously or in response to stimuli such as apneic events. The gold
standard for arousal detection in human polysomnographic recordings (PSGs) is
manual annotation by expert human scorers, a method with significant
interscorer variability. In this study, we developed an automated method, the
Multimodal Arousal Detector (MAD), to detect arousals using deep learning
methods. The MAD was trained on 2,889 PSGs to detect both cortical arousals and
wakefulness in 1 second intervals. Furthermore, the relationship between
MAD-predicted labels on PSGs and next day mean sleep latency (MSL) on a
multiple sleep latency test (MSLT), a reflection of daytime sleepiness, was
analyzed in 1447 MSLT instances in 873 subjects. In a dataset of 1,026 PSGs,
the MAD achieved a F1 score of 0.76 for arousal detection, while wakefulness
was predicted with an accuracy of 0.95. In 60 PSGs scored by multiple human
expert technicians, the MAD significantly outperformed the average human scorer
for arousal detection with a difference in F1 score of 0.09. After controlling
for other known covariates, a doubling of the arousal index was associated with
an average decrease in MSL of 40 seconds ( = -0.67, p = 0.0075). The MAD
outperformed the average human expert and the MAD-predicted arousals were shown
to be significant predictors of MSL, which demonstrate clinical validity the
MAD.Comment: 40 pages, 13 figures, 9 table
A comparative study of methods for automatic detection of rapid eye movement abnormal muscular activity in narcolepsy
Objective: To evaluate rapid eye movement (REM) muscular activity in narcolepsy by applying five algorithms to electromyogram (EMG) recordings, and to investigate its value for narcolepsy diagnosis. Patients/methods: A modified version of phasic EMG metric (mPEM), muscle activity index (MAI), REM atonia index (RAI), supra-threshold REM EMG activit ymetric (STREAM), and Frandsen method (FR) were calculated from polysomnography recordings of 20 healthy controls, 18 clinic controls (subjects suspected with narcolepsy but finally diagnosed without any sleep abnormality), 16 narcolepsy type 1 without REM sleep behavior disorder (RBD), 9 narcolepsy type 1 with RBD, and 18 narcolepsy type 2. Diagnostic value of metrics in differentiating between groups was quantified by area under the receiver operating characteristic curve (AUC). Correlations among the metrics and cerebrospinal fluid hypocretin-1 (CSF-hcrt-1) values were calculated using linear models. Results: All metrics excluding STREAM found significantly higher muscular activity in narcolepsy 1 cases versus controls (p<0.05). Moreover, RAI showed high sensitivity in the detection of RBD. The mPEM achieved the highest AUC in differentiating healthy controls from narcoleptic subjects. The RAI best differentiated between narcolepsy 1 and 2. Lower CSF-hcrt-1 values correlated with high muscular activity quantified by mPEM, sMAI, lMAI, PEM and FR (p<0.05). Conclusions: This automatic analysis showed higher number of muscle activations in narcolepsy 1 compared to controls. This finding might play a supportive role in diagnosing narcolepsy and in discriminating narcolepsy subtypes. Moreover, the negative correlation between CSF-hcrt-1 level and REM muscular activity supported a role for hypocretin in the control of motor tone during REM sleep