Search CORE

1,494 research outputs found

Models for evaluating interaction protocols in speech recognition

Author
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/1991
Field of study

Crossref

Querying and Efficiently Searching Large, Temporal Text Corpora

Author: Willkomm Jens
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 21/10/2021
Field of study

KITopen

An acoustic-phonetic approach in automatic Arabic speech recognition

Author: Marwan Al-Zabibi (7203125)
Publication venue
Publication date: 01/01/1990
Field of study

In a large vocabulary speech recognition system the broad phonetic classification technique is used instead of detailed phonetic analysis to overcome the variability in the acoustic realisation of utterances. The broad phonetic description of a word is used as a means of lexical access, where the lexicon is structured into sets of words sharing the same broad phonetic labelling. This approach has been applied to a large vocabulary isolated word Arabic speech recognition system. Statistical studies have been carried out on 10,000 Arabic words (converted to phonemic form) involving different combinations of broad phonetic classes. Some particular features of the Arabic language have been exploited. The results show that vowels represent about 43% of the total number of phonemes. They also show that about 38% of the words can uniquely be represented at this level by using eight broad phonetic classes. When introducing detailed vowel identification the percentage of uniquely specified words rises to 83%. These results suggest that a fully detailed phonetic analysis of the speech signal is perhaps unnecessary. In the adopted word recognition model, the consonants are classified into four broad phonetic classes, while the vowels are described by their phonemic form. A set of 100 words uttered by several speakers has been used to test the performance of the implemented approach. In the implemented recognition model, three procedures have been developed, namely voiced-unvoiced-silence segmentation, vowel detection and identification, and automatic spectral transition detection between phonemes within a word. The accuracy of both the V-UV-S and vowel recognition procedures is almost perfect. A broad phonetic segmentation procedure has been implemented, which exploits information from the above mentioned three procedures. Simple phonological constraints have been used to improve the accuracy of the segmentation process. The resultant sequence of labels are used for lexical access to retrieve the word or a small set of words sharing the same broad phonetic labelling. For the case of having more than one word-candidates, a verification procedure is used to choose the most likely one

Loughborough University Institutional Repository

Adaptation of reference patterns in word-based speech recognition

Author: McInnes Fergus Robert
Publication venue: The University of Edinburgh
Publication date: 01/01/1988
Field of study

Edinburgh Research Archive

The Production of Speech Corpora

Author: Baumann Angela
Draxler Christoph
Ellbogen Tania
Schiel Florian
Steffen Alexander
Publication venue
Publication date: 21/03/2012
Field of study

Open Access LMU

Recommended from our members

Error-correcting output codes : a general method for improving multiclass inductive learning programs

Author: Bakiri Ghulum
Dietterich Thomas G.
Publication venue: Oregon State University. Department of Computer Science
Publication date
Field of study

Multiclass learning problems involve finding a definition for an unknown function f(x) whose range is a discrete set containing k > 2 values (i.e., k "classes") . The definition is acquired by studying large collections of training examples of the form (xi, f(xi)) . Existing approaches to this problem include (a) direct application of multiclass algorithms such as the decision-tree algorithms ID3 and CART, (b) application of binary concept learning algorithms to learn individual binary functions for each of the k classes, and (c) application of binary concept learning algorithms with distributed output codes such as those employed by Sejnowski and Rosenberg in the NETtalk system. This paper compares these three approaches to a new technique in which BCH error-correcting codes are employed as a distributed output representation. We show that these output representations improve the performance of ID3 on the NETtalk task and of backpropagation on an isolated-letter speech-recognition task. These results demonstrate that error-correcting output codes provide a general-purpose method for improving the performance of inductive learning programs on multi- class problems

ScholarsArchive@OSU

Introduction to Quantum Information Processing

Author: Barnum H.
Dalvit D.
Dziarmaga J.
Gubernatis J.
Gurvits L.
Knill E.
Laflamme R.
Ortiz G.
Viola L.
Zurek W. H.
Publication venue
Publication date: 01/01/2002
Field of study

As a result of the capabilities of quantum information, the science of quantum information processing is now a prospering, interdisciplinary field focused on better understanding the possibilities and limitations of the underlying theory, on developing new applications of quantum information and on physically realizing controllable quantum devices. The purpose of this primer is to provide an elementary introduction to quantum information processing, and then to briefly explain how we hope to exploit the advantages of quantum information. These two sections can be read independently. For reference, we have included a glossary of the main terms of quantum information.Comment: 48 pages, to appear in LA Science. Hyperlinked PDF at http://www.c3.lanl.gov/~knill/qip/prhtml/prpdf.pdf, HTML at http://www.c3.lanl.gov/~knill/qip/prhtm

arXiv.org e-Print Archive

CiteSeerX