Search CORE

2 research outputs found

A bag-of-features framework for incremental learning of speech invariants in unsegmented audio streams

Author: Filliat David
Mangin Olivier
Oudeyer Pierre-Yves
Publication venue: HAL CCSD
Publication date: 01/01/2010
Field of study

International audienceWe introduce a computational framework that allows a machine to bootstrap flexible autonomous learning of speech recognition skills. Technically, this framework shall en- able a robot to incrementally learn to recog- nize speech invariants from unsegmented au- dio streams and with no prior knowledge of phonetics. To achieve this, we import the bag-of-words/bag-of-features approach from recent research in computer vision, and adapt it to incremental developmental speech pro- cessing. We evaluate an implementation of this framework on a complex speech database

INRIA a CCSD electronic archive server

Towards unsupervised online word clustering

Author: Brandl Holger
Goerick Christian
Joublin Frank
Publication venue
Publication date: 01/01/2008
Field of study

Brandl H, Joublin F, Goerick C. Towards unsupervised online word clustering. In: IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas, NV; 2008: 5073-5076.Understanding the bootstrapping process of speech representation in infants is one key issue towards systems which may provide human-like speech recognition abilities some day. Until now, almost all current speech recognition systems have failed to integrate learning into the recognition process. Here we propose a system for unsupervised word-clustering, which is able to recognize and learn the structure of speech online in a unified framework. To do so we’ve extended HMM-based filler-free keyword spotting with acoustic model acquisition. To evaluate and control the dynamics of the combined acquisition-recognition process we propose measures for model activity, model correlation and speech coverage

Publications at Bielefeld University