3,285 research outputs found
Automatic neonatal sleep stage classification:A comparative study
Sleep is an essential feature of living beings. For neonates, it is vital for their mental and physical development. Sleep stage cycling is an important parameter to assess neonatal brain and physical development. Therefore, it is crucial to administer newborn's sleep in the neonatal intensive care unit (NICU). Currently, Polysomnography (PSG) is used as a gold standard method for classifying neonatal sleep patterns, but it is expensive and requires a lot of human involvement. Over the last two decades, multiple researchers are working on automatic sleep stage classification algorithms using electroencephalography (EEG), electrocardiography (ECG), and video. In this study, we present a comprehensive review of existing algorithms for neonatal sleep, their limitations and future recommendations. Additionally, a brief comparison of the extracted features, classification algorithms and evaluation parameters is reported in the proposed study
Accuracy and Performance Comparison of Video Action Recognition Approaches
Over the past few years, there has been significant interest in video action
recognition systems and models. However, direct comparison of accuracy and
computational performance results remain clouded by differing training
environments, hardware specifications, hyperparameters, pipelines, and
inference methods. This article provides a direct comparison between fourteen
off-the-shelf and state-of-the-art models by ensuring consistency in these
training characteristics in order to provide readers with a meaningful
comparison across different types of video action recognition algorithms.
Accuracy of the models is evaluated using standard Top-1 and Top-5 accuracy
metrics in addition to a proposed new accuracy metric. Additionally, we compare
computational performance of distributed training from two to sixty-four GPUs
on a state-of-the-art HPC system.Comment: Accepted for publication at IEEE HPEC 202
Landmine detection using semi-supervised learning.
Landmine detection is imperative for the preservation of both military and civilian lives. While landmines are easy to place, they are relatively difficult to remove. The classic method of detecting landmines was by using metal-detectors. However, many present-day landmines are composed of little to no metal, necessitating the use of additional technologies. One of the most successful and widely employed technologies is Ground Penetrating Radar (GPR). In order to maximize efficiency of GPR-based landmine detection and minimize wasted effort caused by false alarms, intelligent detection methods such as machine learning are used. Many sophisticated algorithms are developed and employed to accomplish this. One such successful algorithm is K Nearest Neighbors (KNN) classification. Most of these algorithms, including KNN, are based on supervised learning, which requires labeling of known data. This process can be tedious. Semi-supervised learning leverages both labeled and unlabeled data in the training process, alleviating over-dependency on labeling. Semi-supervised learning has several advantages over supervised learning. For example, it applies well to large datasets because it uses the topology of unlabeled data to classify test data. Also, by allowing unlabeled data to influence classification, one set of training data can be adopted into varying test environments. In this thesis, we explore a graph-based learning method known as Label Propagation as an alternative classifier to KNN classification, and validate its use on vehicle-mounted and handheld GPR systems
DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning
We present DRLViz, a visual analytics interface to interpret the internal
memory of an agent (e.g. a robot) trained using deep reinforcement learning.
This memory is composed of large temporal vectors updated when the agent moves
in an environment and is not trivial to understand due to the number of
dimensions, dependencies to past vectors, spatial/temporal correlations, and
co-correlation between dimensions. It is often referred to as a black box as
only inputs (images) and outputs (actions) are intelligible for humans. Using
DRLViz, experts are assisted to interpret decisions using memory reduction
interactions, and to investigate the role of parts of the memory when errors
have been made (e.g. wrong direction). We report on DRLViz applied in the
context of video games simulators (ViZDoom) for a navigation scenario with item
gathering tasks. We also report on experts evaluation using DRLViz, and
applicability of DRLViz to other scenarios and navigation problems beyond
simulation games, as well as its contribution to black box models
interpretability and explainability in the field of visual analytics
The Utilization of Data Analysis Techniques in Predicting Student Performance in Massive Open Online Courses (MOOCs)
The growth of the Internet has enabled the popularity of open online learning platforms to increase over the years. This has led to the inception of Massive Open Online Courses (MOOCs) that enrol, millions of people, from all over the world. Such courses operate under the concept of open learning, where content does not have to be delivered via standard mechanisms that institutions employ, such as physically attending lectures. Instead learning occurs online via recorded lecture material and online tasks. This shift has allowed more people to gain access to education, regardless of their learning background. However, despite these advancements in delivering education, completion rates for MOOCs are low. In order to investigate this issue, the paper explores the impact that technology has on open learning and identifies how data about student performance can be captured to predict trend so that at risk students can be identified before they drop-out. In achieving this, subjects surrounding student engagement and performance in MOOCs and data analysis techniques are explored to investigate how technology can be used to address this issue. The paper is then concluded with our approach of predicting behaviour and a case study of the eRegister system, which has been developed to capture and analyse data.
Keywords: Open Learning; Prediction; Data Mining; Educational Systems; Massive Open Online Course; Data Analysi
- …