Charles University

Biblio at Institute of Formal and Applied Linguistics

Not a member yet

506 research outputs found

Sort by

Computer Aided Translation Backed by Machine Translation

Author: Odcházel Ondřej
Bojar Ondřej
Publication venue
Publication date: 01/01/2009
Field of study

A number of tools to support translators (computer-aided translation, CAT) exist, as there are many systems of machine translation (MT). So far, the integration of the two system types was little or none. The aim of this paper is to examine a tighter coupling of MT and CAT. We introduce our web-based CAT tool implemented using the modern AJAX technology that communicates with Moses MT system on the server side to provide the translator with suggested translations of individual phrases of the source sentence as well as several options of the complete continuation of the output sentence. The suggested continuation is based on what has been already translated and what the user has already written as the output. Hopefully, the proposed user interface and the MT system at the back end will accelerate and simplify the process of translation

Jak se Mojžíš s Jozuem učili hindsky (aneb souboj s mizernými daty při překladu z angličtiny do hindštiny)

Author: Bojar Ondřej
Zeman Daniel
Straňák Pavel
Publication venue
Publication date: 01/01/2009
Field of study

Získávání paralelních textů z webu

Author: Ehrenberger Jan
Novák Michal
Bojar Ondřej
Fabian Peter
Klempová Hana
Publication venue
Publication date: 01/01/2009
Field of study

We examine methods for collecting parallel Czech-English corpora from the web. We propose and evaluate automatic methods for finding source web sites, language identification and most importantly the document alignment of obtained pages

Tectogrammatical Annotation of the Wall Street Journal

Author: Klimeš Václav
Šindlerová Jana
Mladová Lucie
Hajič Jan
Cinková Silvie
Žabokrtský Zdeněk
Tomšů Kristýna
Čermáková Kristýna
Toman Josef
Publication venue
Publication date: 01/01/2009
Field of study

This paper gives an overview of the current state of the Prague English Dependency Treebank project. It is an updated version of a draft text that was released along with a CD presenting the first 25\% of the PDT-like version of the Penn Treebank -- WSJ section (PEDT 1.0)

Jak se dělá strojový překlad

Author: Bojar Ondřej
Publication venue
Publication date: 01/01/2009
Field of study

A quick introduction to problems of machine translation for the participants of Pralinka corespondence series

Pokyny k překladu určené překladatelům, revizorům a korektorům textů z Wall Street Journal pro projekt PCEDT

Author: Mikulová Marie
Publication venue
Publication date: 01/01/2009
Field of study

PCEDT is planned to be a corpus of syntactically annotated parallel texts (in English and Czech) intended chiefly for machine translation experiments. The texts for PCEDT were taken from Penn Treebank, which means there are mostly economical articles from the Wall Street Journal. 2312 documents were used in PCEDT (approximately 49,000 sentences) that are manually annotated with constituent trees in Penn Treebank. For the Czech part of PCEDT, the English texts have to be translated into Czech. This book contents the guidelines for translators

58

full texts

506

metadata records

Updated in last 30 days.

Biblio at Institute of Formal and Applied Linguistics is based in Czechia

Access Repository Dashboard

Do you manage Open Research Online? Become a CORE Member to access insider analytics, issue reports and manage access to outputs from your repository in the CORE Repository Dashboard! 👇