Search CORE

15 research outputs found

Communicating Unknown Words in Machine Translation

Author: Eck Matthias
Vogel Stephan
Waibel Alex
Publication venue: Association for Computational Linguistics
Publication date: 03/01/2024
Field of study

A new approach to handle unknown words in machine translation is presented. The basic idea is to find definitions for the unknown words on the source language side and translate those definitions instead. Only monolingual resources are required, which generally offer a broader coverage than bilingual resources and are available for a large number of languages. In order to use this in a machine translation system definitions are extracted automatically from online dictionaries and encyclopedias. The translated definition is then inserted and clearly marked in the original hypothesis. This is shown to lead to significant improvements in (subjective) translation quality

KITopen

Developing Deployable Spoken Language Translation Systems given Limited Resources

Author: Eck Matthias
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2008
Field of study

Approaches are presented that support the deployment of spoken language translation systems. Newly developed methods allow low cost portability to new language pairs. Proposed translation model pruning techniques achieve a high translation performance even in low memory situations. The named entity and specialty vocabulary coverage, particularly on small and mobile devices, is targeted to an individual user by translation model personalization

KITopen

(Meta-) Evaluation of Machine Translation

Author: Callison-Burch Chris
Fordyce Cameron
Koehn Philipp
Monz Christof
Schroeder Josh
Publication venue
Publication date: 01/01/2007
Field of study

Edinburgh Research Explorer

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination

Author: Khalilov Maxim
Rodríguez Fonollosa José Adrián
Publication venue
Publication date: 30/03/2009
Field of study

In this paper we compare and contrast two approaches to Machine Translation (MT): the CMU-UKA Syntax Augmented Machine Translation system (SAMT) and UPC-TALP N-gram-based Statistical Machine Translation (SMT). SAMT is a hierarchical syntax-driven translation system underlain by a phrase-based model and a target part parse tree. In N-gram-based SMT, the translation process is based on bilingual units related to word-to-word alignment and statistical modeling of the bilingual context following a maximumentropy framework. We provide a stepby- step comparison of the systems and report results in terms of automatic evaluation metrics and required computational resources for a smaller Arabic-to-English translation task (1.5M tokens in the training corpus). Human error analysis clarifies advantages and disadvantages of the systems under consideration. Finally, we combine the output of both systems to yield significant improvements in translation quality.Postprint (published version

UPCommons. Portal del coneixement obert de la UPC

The CMU TransTac 2007 Eyes-free and Hands-free Two-way Speech-to-Speech Translation System

Author: Alex Waibel Alex
Bach Nguyen
Black Alan W.
Charoenpornsawat Paisarn
Eck Matthias
Hsiao Roger
Köhler Thilo
Nguyen ThuyLinh
Schultz Tanja
Stüker Sebastian
Vogel Stephan
Publication venue: Trento
Publication date: 01/01/2007
Field of study

KITopen

Multilingual Speech Processing in the context of Under-resourced Languages

Author: Schultz Tanja
Publication venue
Publication date: 04/08/2008
Field of study

KITopen

Veröffentlichungen und Vorträge 2006 der Mitglieder der Fakultät für Informatik

Author: Fakultät für Informatik
Publication venue: Universität Karlsruhe (TH)
Publication date: 01/01/2007
Field of study

KITopen

A System for Simultaneous Translation of Lectures and Speeches

Author: Fügen Christian
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2009
Field of study

This thesis realizes the first existing automatic system for simultaneous speech-to-speech translation. The focus of this system is the automatic translation of (technical oriented) lectures and speeches from English to Spanish, but the different aspects described in this thesis will also be helpful for developing simultaneous translation systems for other domains or languages

KITopen

Robust automatic transcription of lectures

Author: Wölfel Matthias
Publication venue: KIT Scientific Publishing, Karlsruhe
Publication date: 01/01/2009
Field of study

Automatic transcription of lectures is becoming an important task. Possible applications can be found in the fields of automatic translation or summarization, information retrieval, digital libraries, education and communication research. Ideally those systems would operate on distant recordings, freeing the presenter from wearing body-mounted microphones. This task, however, is surpassingly difficult, given that the speech signal is severely degraded by background noise and reverberation

KITopen

Directory of Open Access Books (DOAB)

Robust Automatic Transcription of Lectures

Author: Wölfel Matthias
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2009
Field of study

Die automatische Transkription von Vorträgen, Vorlesungen und Präsentationen wird immer wichtiger und ermöglicht erst die Anwendungen der automatischen Übersetzung von Sprache, der automatischen Zusammenfassung von Sprache, der gezielten Informationssuche in Audiodaten und somit die leichtere Zugänglichkeit in digitalen Bibliotheken. Im Idealfall arbeitet ein solches System mit einem Mikrofon das den Vortragenden vom Tragen eines Mikrofons befreit was der Fokus dieser Arbeit ist

KITopen