Search CORE

28 research outputs found

How does a dictation machine recognize speech?

Author: Bourlard
F Jelinek
H Bourlard
H Bourlard
JA Bilmes
JW Picone
K Murphy
LR Rabiner
O Cappé
R Polikar
RO Duda
S Thorvaldsen
S Young
TK Moon
Publication venue: Centre du Parc, Rue Marconi 19, 1920 Martigny, Idiap
Publication date: 01/01/2009
Field of study

There is magic (or is it witchcraft?) in a speech recognizer that transcribes continuous radio speech into text with a word accuracy of even not more than 50%. The extreme difficulty of this task, tough, is usually not perceived by the general public. This is because we are almost deaf to the infinite acoustic variations that accompany the production of vocal sounds, which arise from physiological constraints (co-articulation), but also from the acoustic environment (additive or convolutional noise, Lombard effect), or from the emotional state of the speaker (voice quality, speaking rate, hesitations, etc.)46. Our consciousness of speech is indeed not stimulated until after it has been processed by our brain to make it appear as a sequence of meaningful units: phonemes and words. In this Chapter we will see how statistical pattern recognition and statistical sequence recognition techniques are currently used for trying to mimic this extraordinary faculty of our mind (4.1). We will follow, in Section 4.2, with a MATLAB-based proof of concept of word-based automatic speech recognition (ASR) based on Hidden Markov Models (HMM), using a bigram model for modeling (syntactic-semantic) language constraints

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Introduction to the Special Issue on New Computational Paradigms for Acoustic Modelling in Speech Recognition

Author: Bilmes JA
Russell Martin
Publication venue: 'Elsevier BV'
Publication date: 01/07/2003
Field of study

Crossref

University of Birmingham Research Portal

Acoustic and device feature fusion for load recognition

Author: A Temko
CJC Burges
GW Hart
J Uteley
JA Bilmes
LK Norford
M Hazas
SR Shaw
Takekazu Kato
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2016
Field of study

Appliance-specific Load Monitoring (LM) provides a possible solution to the problem of energy conservation which is becoming increasingly challenging, due to growing energy demands within offices and residential spaces. It is essential to perform automatic appliance recognition and monitoring for optimal resource utilization. In this paper, we study the use of non-intrusive LM methods that rely on steady-state appliance signatures for classifying most commonly used office appliances, while demonstrating their limitation in terms of accurately discerning the low-power devices due to overlapping load signatures. We propose a multi-layer decision architecture that makes use of audio features derived from device sounds and fuse it with load signatures acquired from energy meter. For the recognition of device sounds, we perform feature set selection by evaluating the combination of time-domain and FFT-based audio features on the state of the art machine learning algorithms. Further, we demonstrate that our proposed feature set which is a concatenation of device audio feature and load signature significantly improves the device recognition accuracy in comparison to the use of steady-state load signatures only

Deakin Research Online

Crossref

Enlighten

Gaussian mixture model–based path-synthesis accumulation imaging of guided wave for damage monitoring of aircraft composite structures under temperature variation

Author: Bilmes JA.
Fang Fang
Goldberger J
Lei Qiu
Shenfang Yuan
Supreet K
Yuanqiang Ren
Zhao X
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Clustering performance comparison using K

Author: Ahuja S
Bilmes JA
Jun Heo
Jung YG
Min Soo Kang
Shukla SK
Singh M
Yong Gyu Jung
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Multi-branch hidden Markov models for remaining useful life estimation of systems under multiple deterioration modes

Author: Bilmes JA
Bishop CM
Christophe Bérenguer
Florent Chatelain
Hoeting JA
Huynh KT
Kohavi R
Le TT
Le TT
Thanh Trung Le
Wang W
Zhang X
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Continuous Speech Recognition Using Dynamic Bayesian Networks : A Fast Decoding Algorithm

Author: AJ Viterbi
AP Dawid
G Zweig
G Zweig
JA Bilmes
K Daoudi
K Daoudi
K-F Lee
M Deviren
M Deviren
N Friedman
N Friedman
O Cetin
U Kjaerulf
Publication venue: Springer Physica Verlag
Publication date: 01/01/2004
Field of study

Contribution à un ouvrage.State-of-the-art automatic speech recognition systems are based on probabilistic modeling of the speech signal using Hidden Markov Models (HMMs). Recent work has focused on the use of dynamic Bayesian networks (DBNs) framework to construct new acoustic models to overcome the limitations of HMM based systems. In this line of research we proposed a methodology to learn the conditional independence assertions of acoustic models based on structural learning of DBNs. In previous work, we evaluated this approach for simple isolated and connected digit recognition tasks. In this paper we evaluate our approach for a more complex task: continuous phoneme recognition. For this purpose, we propose a new decoding algorithm based on dynamic programming. The proposed algorithm decreases the computational complexity of decoding and hence enables the application of the approach to complex speech recognition tasks

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Continuous Speech Recognition Using Dynamic Bayesian Networks: A Fast Decoding Algorithm

Author: AJ Viterbi
AP Dawid
G Zweig
G Zweig
JA Bilmes
K Daoudi
K Daoudi
K-F Lee
M Deviren
M Deviren
N Friedman
N Friedman
O Cetin
U Kjaerulf
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Crossref

Design and evaluation of nonverbal sound-based input for those with motor handicapped

Author: Atiwong Suchato
Bilmes JA
Chanjaradwichai S
Dai L
Gerdtman C
Hornof AJ
Igarashi T
Karimullah AS
Loewenich F
Proadpran Punyabukkana
Sears A
Sporka AJ
Supadaech Chanjaradwichai
Toni P
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

A New Classifier for Facial Expression Recognition: Fuzzy Buried Markov Model

Author: Chuan-Jun Wen
D Kim
DM Tsai
JA Bilmes
Ke-Yang Cheng
M Pardµas
PS Aleksic
Y Normandin
Ya-Bi Chen
YF Zhu
Yong-Zhao Zhan
YZ Zhan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref