6,475 research outputs found
SVMs for Automatic Speech Recognition: a Survey
Hidden Markov Models (HMMs) are, undoubtedly, the most employed core technique for Automatic Speech Recognition (ASR). Nevertheless, we are still far from achieving high-performance ASR systems. Some alternative approaches, most of them based on Artificial Neural Networks (ANNs), were proposed during the late eighties and early nineties. Some of them tackled the ASR problem using predictive ANNs, while others proposed hybrid HMM/ANN systems. However, despite some achievements, nowadays, the preponderance of Markov Models is a fact.
During the last decade, however, a new tool appeared in the field of machine learning that has proved to be able to cope with hard classification problems in several fields of application: the Support Vector Machines (SVMs). The SVMs are effective discriminative classifiers with several outstanding characteristics, namely: their solution is that with maximum margin; they are capable to deal with samples of a very higher dimensionality; and their convergence to the minimum of the associated cost function is guaranteed.
These characteristics have made SVMs very popular and successful. In this chapter we discuss their strengths and weakness in the ASR context and make a review of the current state-of-the-art techniques. We organize the contributions in two parts: isolated-word recognition and continuous speech recognition. Within the first part we review several techniques to produce the fixed-dimension vectors needed for original SVMs. Afterwards we explore more sophisticated techniques based on the use of kernels capable to deal with sequences of different length. Among them is the DTAK kernel, simple and effective, which rescues an old technique of speech recognition: Dynamic Time Warping (DTW). Within the second part, we describe some recent approaches to tackle more complex tasks like connected digit recognition or continuous speech recognition using SVMs. Finally we draw some conclusions and outline several ongoing lines of research
Bayesian Nonparametric Hidden Semi-Markov Models
There is much interest in the Hierarchical Dirichlet Process Hidden Markov
Model (HDP-HMM) as a natural Bayesian nonparametric extension of the ubiquitous
Hidden Markov Model for learning from sequential and time-series data. However,
in many settings the HDP-HMM's strict Markovian constraints are undesirable,
particularly if we wish to learn or encode non-geometric state durations. We
can extend the HDP-HMM to capture such structure by drawing upon
explicit-duration semi-Markovianity, which has been developed mainly in the
parametric frequentist setting, to allow construction of highly interpretable
models that admit natural prior information on state durations.
In this paper we introduce the explicit-duration Hierarchical Dirichlet
Process Hidden semi-Markov Model (HDP-HSMM) and develop sampling algorithms for
efficient posterior inference. The methods we introduce also provide new
methods for sampling inference in the finite Bayesian HSMM. Our modular Gibbs
sampling methods can be embedded in samplers for larger hierarchical Bayesian
models, adding semi-Markov chain modeling as another tool in the Bayesian
inference toolbox. We demonstrate the utility of the HDP-HSMM and our inference
methods on both synthetic and real experiments
Complex sequencing rules of birdsong can be explained by simple hidden Markov processes
Complex sequencing rules observed in birdsongs provide an opportunity to
investigate the neural mechanism for generating complex sequential behaviors.
To relate the findings from studying birdsongs to other sequential behaviors,
it is crucial to characterize the statistical properties of the sequencing
rules in birdsongs. However, the properties of the sequencing rules in
birdsongs have not yet been fully addressed. In this study, we investigate the
statistical propertiesof the complex birdsong of the Bengalese finch (Lonchura
striata var. domestica). Based on manual-annotated syllable sequences, we first
show that there are significant higher-order context dependencies in Bengalese
finch songs, that is, which syllable appears next depends on more than one
previous syllable. This property is shared with other complex sequential
behaviors. We then analyze acoustic features of the song and show that
higher-order context dependencies can be explained using first-order hidden
state transition dynamics with redundant hidden states. This model corresponds
to hidden Markov models (HMMs), well known statistical models with a large
range of application for time series modeling. The song annotation with these
models with first-order hidden state dynamics agreed well with manual
annotation, the score was comparable to that of a second-order HMM, and
surpassed the zeroth-order model (the Gaussian mixture model (GMM)), which does
not use context information. Our results imply that the hierarchical
representation with hidden state dynamics may underlie the neural
implementation for generating complex sequences with higher-order dependencies
NILM techniques for intelligent home energy management and ambient assisted living: a review
The ongoing deployment of smart meters and different commercial devices has made electricity disaggregation feasible in buildings and households, based on a single measure of the current and, sometimes, of the voltage. Energy disaggregation is intended to separate the total power consumption into specific appliance loads, which can be achieved by applying Non-Intrusive Load Monitoring (NILM) techniques with a minimum invasion of privacy. NILM techniques are becoming more and more widespread in recent years, as a consequence of the interest companies and consumers have in efficient energy consumption and management. This work presents a detailed review of NILM methods, focusing particularly on recent proposals and their applications, particularly in the areas of Home Energy Management Systems (HEMS) and Ambient Assisted Living (AAL), where the ability to determine the on/off status of certain devices can provide key information for making further decisions. As well as complementing previous reviews on the NILM field and providing a discussion of the applications of NILM in HEMS and AAL, this paper provides guidelines for future research in these topics.Agência financiadora:
Programa Operacional Portugal 2020 and Programa Operacional Regional do Algarve
01/SAICT/2018/39578
Fundação para a Ciência e Tecnologia through IDMEC, under LAETA:
SFRH/BSAB/142998/2018
SFRH/BSAB/142997/2018
UID/EMS/50022/2019
Junta de Comunidades de Castilla-La-Mancha, Spain:
SBPLY/17/180501/000392
Spanish Ministry of Economy, Industry and Competitiveness (SOC-PLC project):
TEC2015-64835-C3-2-R MINECO/FEDERinfo:eu-repo/semantics/publishedVersio
- …