Search CORE

Edinburgh Research Archive

Crossref

Institutional Repository for Minnesota State University, Mankato

Prosodic modules for speech recognition and understanding in VERBMOBIL

Author: Batliner Anton
Hess Wolfgang
Kießling Andreas
Kompe Ralf
Nöth Elmar
Petzol Anja
Reyelt Matthias
Strom Volker
Publication venue: Sonstige Einrichtungen. DFKI Deutsches Forschungszentrum für Künstliche Intelligenz
Publication date: 01/01/1996
Field of study

Within VERBMOBIL, a large project on spoken language research in Germany, two modules for detecting and recognizing prosodic events have been developed. One module operates on speech signal parameters and the word hypothesis graph, whereas the other module, designed for a novel, highly interactive architecture, only uses speech signal parameters as its input. Phrase boundaries, sentence modality, and accents are detected. The recognition rates in spontaneous dialogs are for accents up to 82,5%, for phrase boundaries up to 91,7%

arXiv.org e-Print Archive

Parsing of Spoken Language under Time Constraints

Author: Menzel Wolfgang
Publication venue
Publication date: 01/01/1994
Field of study

Spoken language applications in natural dialogue settings place serious requirements on the choice of processing architecture. Especially under adverse phonetic and acoustic conditions parsing procedures have to be developed which do not only analyse the incoming speech in a time-synchroneous and incremental manner, but which are able to schedule their resources according to the varying conditions of the recognition process. Depending on the actual degree of local ambiguity the parser has to select among the available constraints in order to narrow down the search space with as little effort as possible. A parsing approach based on constraint satisfaction techniques is discussed. It provides important characteristics of the desired real-time behaviour and attempts to mimic some of the attention focussing capabilities of the human speech comprehension mechanism.Comment: 19 pages, LaTe

Some experiments in speech act prediction

Author: Reithinger Norbert
Publication venue: Sonstige Einrichtungen. DFKI Deutsches Forschungszentrum für Künstliche Intelligenz
Publication date: 01/01/1994
Field of study

In this paper, we present a statistical approach for speech act prediction in the dialogue component of the speech-to-speech translation system Verbmobil. The prediction algorithm is based on work known from language modelling and uses N-gram information computed from a training corpus. We demonstrate the performance of this method with 10 experiments. These experiments vary in two dimensions, namely whether the N-gram information is updated while processing, and whether deviations from the standard dialogue structure are processed. Six of the experiments use complete dialogues, while four process only the speech acts of one dialogue partner. It is shown that the predictions are best when using the update feature and deviations are not processed. Even the processing of incomplete dialogues then yields acceptable results. Another experiment shows that a training corpus size of about 40 dialogues is sufficient for the prediction task, and that the structure of the dialogues of the Verbmobil corpus we use differs remarkably with respect to the predictions

Semantic transfer in Verbmobil

Author: Copestake Ann
Publication venue: Sonstige Einrichtungen. DFKI Deutsches Forschungszentrum für Künstliche Intelligenz
Publication date: 01/01/1995
Field of study

This paper is a detailed discussion of semantic transfer in the context of the Verbmobil Machine Translation project. The use of semantic transfer as a translation mechanism is introduced and justified by comparison with alternative approaches. Some criteria for evaluation of transfer frameworks are discussed and a comparison is made of three different approaches to the representation of translation rules or equivalences. This is followed by a discussion of control of application of transfer rules and interaction with a domain description and inference component