Search CORE

40 research outputs found

Comparison of Acoustic Comparison of Acoustic Model Adaptation Techniques on Non-native Speech

Author: Schultz Tanja
Waibel Alex
Wang Zhirong
Publication venue
Publication date: 13/06/2008
Field of study

KITopen

Articulatory features for conversational speech recognition

Author: Metze Florian
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2005
Field of study

KITopen

A Multi-Perspective Evaluation of the NESPOLE! Speech-to-Speech Translation System

Author: Cattoni Roldano
Constantini Erica
Lavie Alon
Metze Florian
Publication venue
Publication date: 13/06/2008
Field of study

KITopen

Rapid Development of an Afrikaans-English Speech-to-Speech Translator

Author: Engelbrecht Hermann
Schultz Tanja
Publication venue
Publication date: 16/06/2008
Field of study

KITopen

Hybrid discourse modeling and summarization for a speech-to-speech translation system

Author: Alexandersson Jan
Publication venue: Fakultät 6 - Naturwissenschaftlich-Technische Fakultät I. Fachrichtung 6.2 - Informatik
Publication date: 01/01/2003
Field of study

The thesis discusses two parts of the speech-to-speech translation system VerbMobil: the dialogue model and one of its applications, multilingual summary generation. In connection with the dialogue model, two topics are of special interest: (a) the use of a default unification operation called overlay as the fundamental operation for dialogue management; and (b) an intentional model that is able to describe intentions in dialogue on five levels in a language-independent way. Besides the actual generation algorithm developed, we present a comprehensive evaluation of the summarization functionality. In addition to precision and recall, a new characterization - confabulation - is defined that provides a more precise understanding of the performance of complex natural language processing systems.Die vorliegende Arbeit behandelt hauptsächlich zwei Themen, die für das VerbMobil-System, ein Übersetzungssystem gesprochener Spontansprache, entwickelt wurden: das Dialogmodell und als Applikation die multilinguale Generierung von Ergebnissprotokollen. Für die Dialogmodellierung sind zwei Themen von besonderem Interesse. Das erste behandelt eine in der vorliegenden Arbeit formalisierte Default-Unifikations-Operation namens Overlay, die als fundamentale Operation für Diskursverarbeitung dient. Das zweite besteht aus einem intentionalen Modell, das Intentionen eines Dialogs auf fünf Ebenen in einer sprachunabhängigen Repräsentation darstellt. Neben dem für die Protokollgenerierung entwickelten Generierungsalgorithmus wird eine umfassende Evaluation zur Protokollgenerierungsfunktionalität vorgestellt. Zusätzlich zu "precision" und "recall" wird ein neues Maß - Konfabulation (Engl.: "confabulation") - vorgestellt, das eine präzisere Charakterisierung der Qualität eines komplexen Sprachverarbeitungssystems ermöglicht

Universaar

Acronym

Hybride konnektionistische, statistische und regelbasierte Ansätze zur Verarbeitung natürlicher Sprache : Workshop auf der 21. Deutschen Jahrestagung für Künstliche Intelligenz, Freiburg, 9.-10. September 1997

Author
Publication venue
Publication date: 01/01/1998
Field of study

Acronym

Hybride konnektionistische, statistische und regelbasierte Ansätze zur Verarbeitung natürlicher Sprache : Workshop auf der 21. Deutschen Jahrestagung für Künstliche Intelligenz, Freiburg, 9.-10. September 1997

Author
Publication venue: Sonstige Einrichtungen. DFKI Deutsches Forschungszentrum für Künstliche Intelligenz
Publication date: 01/01/1998
Field of study

Hybrid discourse modeling and summarization for a speech-to-speech translation system

Author: Alexandersson Jan
Publication venue
Publication date: 01/01/2003
Field of study

Acronym

Erkennen und Lernen neuer Wörter

Author: Schaaf Thomas
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2004
Field of study

KITopen

Combining Spectral Representations for Large Vocabulary Continuous Speech Recognition

Author: Garau Giulia
Renals Steve
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

In this paper we investigate the combination of complementary acoustic feature streams in large vocabulary continuous speech recognition (LVCSR). We have explored the use of acoustic features obtained using a pitch-synchronous analysis, STRAIGHT, in combination with conventional features such as mel frequency cepstral coefficients. Pitch-synchronous acoustic features are of particular interest when used with vocal tract length normalisation (VTLN) which is known to be affected by the fundamental frequency. We have combined these spectral representations directly at the acoustic feature level using heteroscedastic linear discriminant analysis (HLDA) and at the system level using ROVER. We evaluated this approach on three LVCSR tasks: dictated newspaper text (WSJCAM0), conversational telephone speech (CTS), and multiparty meeting transcription. The CTS and meeting transcription experiments were both evaluated using standard NIST test sets and evaluation protocols. Our results indicate that combining conventional and pitch-synchronous acoustic feature sets using HLDA results in a consistent, significant decrease in word error rate across all three tasks. Combining at the system level using ROVER resulted in a further significant decrease in word error rate

CiteSeerX

Crossref

Edinburgh Research Archive

Edinburgh Research Explorer