Search CORE

6,793 research outputs found

Human Action Recognition with RGB-D Sensors

Author
Publication venue: InTech Open
Publication date: 12/07/2017
Field of study

none3noHuman action recognition, also known as HAR, is at the foundation of many different applications related to behavioral analysis, surveillance, and safety, thus it has been a very active research area in the last years. The release of inexpensive RGB-D sensors fostered researchers working in this field because depth data simplify the processing of visual data that could be otherwise difficult using classic RGB devices. Furthermore, the availability of depth data allows to implement solutions that are unobtrusive and privacy preserving with respect to classic video-based analysis. In this scenario, the aim of this chapter is to review the most salient techniques for HAR based on depth signal processing, providing some details on a specific method based on temporal pyramid of key poses, evaluated on the well-known MSR Action3D dataset.Cippitelli, Enea; Gambi, Ennio; Spinsante, SusannaCippitelli, Enea; Gambi, Ennio; Spinsante, Susann

IRIS UniversitÃ Politecnica delle Marche

Human Action Recognition with RGB-D Sensors

Author: Cippitelli Enea
Gambi Ennio
Spinsante Susanna
Publication venue: 'IntechOpen'
Publication date: 01/01/2017
Field of study

Human action recognition, also known as HAR, is at the foundation of many different applications related to behavioral analysis, surveillance, and safety, thus it has been a very active research area in the last years. The release of inexpensive RGB-D sensors fostered researchers working in this field because depth data simplify the processing of visual data that could be otherwise difficult using classic RGB devices. Furthermore, the availability of depth data allows to implement solutions that are unobtrusive and privacy preserving with respect to classic video-based analysis. In this scenario, the aim of this chapter is to review the most salient techniques for HAR based on depth signal processing, providing some details on a specific method based on temporal pyramid of key poses, evaluated on the well-known MSR Action3D dataset

IntechOpen

Crossref

IRIS UniversitÃ Politecnica delle Marche

Home monitoring for frailty detection through sound and speaker diarization analysis

Author: Boudy Jérôme
Boutamine Sami
Chollet Gérard
Istrate Dan
Petitpont Frédéric
Tevissen Yannis
Zalc Vincent
Publication venue
Publication date: 17/08/2023
Field of study

As the French, European and worldwide populations are aging, there is a strong interest for new systems that guarantee a reliable and privacy preserving home monitoring for frailty prevention. This work is a part of a global environmental audio analysis system which aims to help identification of Activities of Daily Life (ADL) through human and everyday life sounds recognition, speech presence and number of speakers detection. The focus is made on the number of speakers detection. In this article, we present how recent advances in sound processing and speaker diarization can improve the existing embedded systems. We study the performances of two new methods and discuss the benefits of DNN based approaches which improve performances by about 100%.Comment: JETSAN, Jun 2023, Aubervilliers & Paris, Franc

arXiv.org e-Print Archive

Sound environment analysis in smart home

Author: Boudy Jérôme
Dorizzi Bernadette
Istrate Dan
Lecouteux Benjamin
Portet François
Sehili Mohamed El Amine
Vacher Michel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/11/2012
Field of study

International audienceThis study aims at providing audio-based interaction technology that lets the users have full control over their home environment, at detecting distress situations and at easing the social inclusion of the elderly and frail population. The paper presents the sound and speech analysis system evaluated thanks to a corpus of data acquired in a real smart home environment. The 4 steps of analysis are signal detection, speech/sound discrimination, sound classification and speech recognition. The results are presented for each step and globally. The very first experiments show promising results be it for the modules evaluated independently or for the whole system

Hal - Université Grenoble Alpes

Emerging technologies for learning (volume 1)

Author: Anderson Paul
Cych Leon
Philpott Michael
Sharpe Bill
Stead Geoff
Publication venue: British Educational Communications and Technology Agency (BECTA)
Publication date: 01/01/2006
Field of study

Collection of 5 articles on emerging technologies and trend

Digital Education Resource Archive

AudioPairBank: Towards A Large-Scale Tag-Pair-Based Audio Content Analysis

Author: Borth Damian
Elizalde Benjamin
Lane Ian
Raj Bhiksha
Sager Sebastian
Schulze Christian
Publication venue
Publication date: 08/01/2018
Field of study

Recently, sound recognition has been used to identify sounds, such as car and river. However, sounds have nuances that may be better described by adjective-noun pairs such as slow car, and verb-noun pairs such as flying insects, which are under explored. Therefore, in this work we investigate the relation between audio content and both adjective-noun pairs and verb-noun pairs. Due to the lack of datasets with these kinds of annotations, we collected and processed the AudioPairBank corpus consisting of a combined total of 1,123 pairs and over 33,000 audio files. One contribution is the previously unavailable documentation of the challenges and implications of collecting audio recordings with these type of labels. A second contribution is to show the degree of correlation between the audio content and the labels through sound recognition experiments, which yielded results of 70% accuracy, hence also providing a performance benchmark. The results and study in this paper encourage further exploration of the nuances in audio and are meant to complement similar research performed on images and text in multimedia analysis.Comment: This paper is a revised version of "AudioSentibank: Large-scale Semantic Ontology of Acoustic Concepts for Audio Content Analysis

arXiv.org e-Print Archive

Directory of Open Access Journals

An Analysis of Audio Features to Develop a Human Activity Recognition Model Using Genetic Algorithms, Random Forests, and Neural Networks

Author: Celaya Padilla José María
Delgado Contreras Juan Rubén
Galván Tejada Carlos Eric
Galván Tejada Jorge
Gamboa Rosales Hamurabi
Garza Veloz Idalia
López Hernández Yamilé
Magallanes Quintanar Rafael
Martínez Fierro Margarita de la Luz
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2016
Field of study

This work presents a human activity recognition (HAR) model based on audio features. The use of sound as an information source for HAR models represents a challenge because sound wave analyses generate very large amounts of data. However, feature selection techniques may reduce the amount of data required to represent an audio signal sample. Some of the audio features that were analyzed include Mel-frequency cepstral coefficients (MFCC). Although MFCC are commonly used in voice and instrument recognition, their utility within HAR models is yet to be confirmed, and this work validates their usefulness. Additionally, statistical features were extracted from the audio samples to generate the proposed HAR model. The size of the information is necessary to conform a HAR model impact directly on the accuracy of the model. This problem also was tackled in the present work; our results indicate that we are capable of recognizing a human activity with an accuracy of 85% using the HAR model proposed. This means that minimum computational costs are needed, thus allowing portable devices to identify human activities using audio as an information source

Directory of Open Access Journals

Caxcan Repositorio Institucional de la Universidad Autónoma de Zacatecas