96 research outputs found
Auxiliary Learning for Self-Supervised Video Representation via Similarity-based Knowledge Distillation
Despite the outstanding success of self-supervised pretraining methods for
video representation learning, they generalise poorly when the unlabeled
dataset for pretraining is small or the domain difference between unlabelled
data in source task (pretraining) and labeled data in target task (finetuning)
is significant. To mitigate these issues, we propose a novel approach to
complement self-supervised pretraining via an auxiliary pretraining phase,
based on knowledge similarity distillation, auxSKD, for better generalisation
with a significantly smaller amount of video data, e.g. Kinetics-100 rather
than Kinetics-400. Our method deploys a teacher network that iteratively
distills its knowledge to the student model by capturing the similarity
information between segments of unlabelled video data. The student model
meanwhile solves a pretext task by exploiting this prior knowledge. We also
introduce a novel pretext task, Video Segment Pace Prediction or VSPP, which
requires our model to predict the playback speed of a randomly selected segment
of the input video to provide more reliable self-supervised representations.
Our experimental results show superior results to the state of the art on both
UCF101 and HMDB51 datasets when pretraining on K100 in apple-to-apple
comparisons. Additionally, we show that our auxiliary pretraining, auxSKD, when
added as an extra pretraining phase to recent state of the art self-supervised
methods (i.e. VCOP, VideoPace, and RSPNet), improves their results on UCF101
and HMDB51. Our code is available at https://github.com/Plrbear/auxSKD
PECoP: Parameter Efficient Continual Pretraining for Action Quality Assessment
The limited availability of labelled data in Action Quality Assessment (AQA),
has forced previous works to fine-tune their models pretrained on large-scale
domain-general datasets. This common approach results in weak generalisation,
particularly when there is a significant domain shift. We propose a novel,
parameter efficient, continual pretraining framework, PECoP, to reduce such
domain shift via an additional pretraining stage. In PECoP, we introduce
3D-Adapters, inserted into the pretrained model, to learn spatiotemporal,
in-domain information via self-supervised learning where only the adapter
modules' parameters are updated. We demonstrate PECoP's ability to enhance the
performance of recent state-of-the-art methods (MUSDL, CoRe, and TSA) applied
to AQA, leading to considerable improvements on benchmark datasets, JIGSAWS
(), MTL-AQA (), and FineDiving
(). We also present a new Parkinson's Disease dataset, PD4T, of
real patients performing four various actions, where we surpass
() the state-of-the-art in comparison. Our code, pretrained
models, and the PD4T dataset are available at https://github.com/Plrbear/PECoP.Comment: Accepted to WACV 2024 (preprint
Multimodal Indoor Localisation for Measuring Mobility in Parkinson's Disease using Transformers
17 pages, 1 figure, 3 tablesPreprin
Comparison of Test Your Memory and Montreal Cognitive Assessment Measures in Parkinsonās Disease
Background. MoCA is widely used in Parkinsonās disease (PD) to assess cognition. The Test Your Memory (TYM) test is a cognitive screening tool that is self-administered. Objectives. We sought to determine (a) the optimal value of TYM to discriminate between PD patients with and without cognitive deficits on MoCA testing, (b) equivalent MoCA and TYM scores, and (c) interrater reliability in TYM testing. Methods. We assessed the discriminant ability of TYM and the equivalence between TYM and MoCA scores and measured the interrater reliability between three raters. Results. Of the 135 subjects that completed both tests, 55% had cognitive impairment according to MoCA. A MoCA score of 25 was equivalent to a TYM score of 43-44. The area under the receiver operator characteristic (ROC) curve for TYM to differentiate between PD-normal and PD-cognitive impairment was 0.82 (95% CI 0.75 to 0.89). The optimal cutoff to distinguish PD-cognitive impairment from PD-normal was ā¤45 (sensitivity 90.5%, specificity 59%) thereby correctly classifying 76.3% of patients with PD-cognitive impairment. Interrater agreement was high (0.97) and TYM was completed in under 7 minutes (interquartile range 5.33 to 8.52 minutes). Conclusions. The TYM test is a useful and less resource intensive screening test for cognitive deficits in PD
A Time Series Approach to Parkinson's Disease Classification from EEG
Firstly, we present a novel representation for EEG data, a 7-variate series
of band power coefficients, which enables the use of (previously inaccessible)
time series classification methods. Specifically, we implement the
multi-resolution representation-based time series classification method MrSQL.
This is deployed on a challenging early-stage Parkinson's dataset that includes
wakeful and sleep EEG. Initial results are promising with over 90% accuracy
achieved on all EEG data types used. Secondly, we present a framework that
enables high-importance data types and brain regions for classification to be
identified. Using our framework, we find that, across different EEG data types,
it is the Prefrontal brain region that has the most predictive power for the
presence of Parkinson's Disease. This outperformance was statistically
significant versus ten of the twelve other brain regions (not significant
versus adjacent Left Frontal and Right Frontal regions). The Prefrontal region
of the brain is important for higher-order cognitive processes and our results
align with studies that have shown neural dysfunction in the prefrontal cortex
in Parkinson's Disease
Multimodal Classification of Parkinson's Disease in Home Environments with Resiliency to Missing Modalities
Parkinsonās disease (PD) is a chronic neurodegenerative condition that affects a patientās everyday life. Authors have proposed that a machine learning and sensor-based approach that continuously monitors patients in naturalistic settings can provide constant evaluation of PD and objectively analyse its progression. In this paper, we make progress toward such PD evaluation by presenting a multimodal deep learning approach for discriminating between people with PD and without PD. Specifically, our proposed architecture, named MCPD-Net, uses two data modalities, acquired from vision and accelerometer sensors in a home environment to train variational autoencoder (VAE) models. These are modality-specific VAEs that predict effective representations of human movements to be fused and given to a classification module. During our end-to-end training, we minimise the difference between the latent spaces corresponding to the two data modalities. This makes our method capable of dealing with missing modalities during inference. We show that our proposed multimodal method outperforms unimodal and other multimodal approaches by an average increase in F1-score of 0.25 and 0.09, respectively, on a data set with real patients. We also show that our method still outperforms other approaches by an average increase in F1-score of 0.17 when a modality is missing during inference, demonstrating the benefit of training on multiple modalities
- ā¦