Search CORE

363,066 research outputs found

CirdoX: an On/Off-line Multisource Speech and Sound Analysis Software

Author: Aman F
Duclot W
Lecouteux Benjamin
Portet François
Vacher Michel
Publication venue: HAL CCSD
Publication date: 23/05/2016
Field of study

International audienceVocal User Interfaces in domestic environments recently gained interest in the speech processing community. This interest is due to the opportunity of using it in the framework of Ambient Assisted Living both for home automation (vocal command) and for call for help in case of distress situations, i.e. after a fall. CIRDOX, which is a modular software, is able to analyse online the audio environment in a home, to extract the uttered sentences and then to process them thanks to an ASR module. Moreover, this system perfoms non-speech audio event classification; in this case, specific models must be trained. The software is designed to be modular and to process on-line the audio multichannel stream. Some exemples of studies in which CIRDOX was involved are described. They were operated in real environment, namely a Living lab environment. Keywords: audio and speech processing, natural language and multimodal interactions, Ambient Assisted Living (AAL)

Hal - Université Grenoble Alpes

Tackling fear of falling through a peer sharing strategy

Author: Bailey Cathy
Publication venue: Prevention of Falls Network
Publication date: 01/03/2010
Field of study

A large Irish research programme: Technology Research for Independent Living (TRIL [1]) worked closely with a fallers’ expert panel of 7 older people (aged 70-84 yrs) to produce a series of short audio visual discussion programmes on falling and fear of falling. These added to daily broadcasts that featured within a TRIL pilot study, exploring use of home based advanced telephone technology to ameliorate social isolation (Wherton and Prendergast 2009 [2])

Northumbria Research Link

Meta-analyses support a taxonomic model for representations of different categories of audio-visual interaction events in the human brain

Author: Brefczynski-Lewis Julie A.
Csonka Matt
Frum Chris
Lewis James W.
Mardmomen Nadia
Webster Paula J.
Publication venue: The Research Repository @ WVU
Publication date: 01/01/2021
Field of study

Our ability to perceive meaningful action events involving objects, people and other animate agents is characterized in part by an interplay of visual and auditory sensory processing and their cross-modal interactions. However, this multisensory ability can be altered or dysfunctional in some hearing and sighted individuals, and in some clinical populations. The present meta-analysis sought to test current hypotheses regarding neurobiological architectures that may mediate audio-visual multisensory processing. Reported coordinates from 82 neuroimaging studies (137 experiments) that revealed some form of audio-visual interaction in discrete brain regions were compiled, converted to a common coordinate space, and then organized along specific categorical dimensions to generate activation likelihood estimate (ALE) brain maps and various contrasts of those derived maps. The results revealed brain regions (cortical “hubs”) preferentially involved in multisensory processing along different stimulus category dimensions, including (1) living versus non-living audio-visual events, (2) audio-visual events involving vocalizations versus actions by living sources, (3) emotionally valent events, and (4) dynamic-visual versus static-visual audio-visual stimuli. These meta-analysis results are discussed in the context of neurocomputational theories of semantic knowledge representations and perception, and the brain volumes of interest are available for download to facilitate data interpretation for future neuroimaging studies

The Research Repository @ WVU (West Virginia University)

Multimodal Signal Processing and Learning Aspects of Human-Robot Interaction for an Assistive Bathing Robot

Author: Dometios A. C.
Koutras P.
Maragos P.
Pitsikalis V.
Rodomagoulakis I.
Tzafestas C. S.
Zlatintsi A.
Publication venue
Publication date: 06/11/2017
Field of study

We explore new aspects of assistive living on smart human-robot interaction (HRI) that involve automatic recognition and online validation of speech and gestures in a natural interface, providing social features for HRI. We introduce a whole framework and resources of a real-life scenario for elderly subjects supported by an assistive bathing robot, addressing health and hygiene care issues. We contribute a new dataset and a suite of tools used for data acquisition and a state-of-the-art pipeline for multimodal learning within the framework of the I-Support bathing robot, with emphasis on audio and RGB-D visual streams. We consider privacy issues by evaluating the depth visual stream along with the RGB, using Kinect sensors. The audio-gestural recognition task on this new dataset yields up to 84.5%, while the online validation of the I-Support system on elderly users accomplishes up to 84% when the two modalities are fused together. The results are promising enough to support further research in the area of multimodal recognition for assistive social HRI, considering the difficulties of the specific task. Upon acceptance of the paper part of the data will be publicly available

arXiv.org e-Print Archive

Crossref

Living the life of the great Buster Keaton

Author: Collura Douglas
Publication venue: The Missouri Review
Publication date: 01/08/2009
Field of study

Poem"On this episode of The Missouri Review podcast, we'll be hearing 'Living the life of the great Buster Keaton,' by Douglas Collura, the winner of the Voice-only poetry category in our 2008 Audio/Video competition."--Publisher's Web site

University of Missouri: MOspace