506 research outputs found
Sort by
Computer Aided Translation Backed by Machine Translation
A number of tools to support translators (computer-aided translation, CAT)
exist, as there are many systems of machine translation (MT). So far, the
integration of the two system types was little or none. The aim of this paper is
to examine a tighter coupling of MT and CAT.
We introduce our web-based CAT tool implemented using the modern AJAX technology
that
communicates with Moses MT system on the server side
to provide the translator with suggested translations of individual phrases of
the source sentence as well as several options of the complete continuation of
the output sentence. The suggested continuation is based on what has been
already translated and what the user has already written as the output.
Hopefully, the proposed user interface and the MT system at the back end will
accelerate and simplify the process of translation
Jak se Mojžíš s Jozuem učili hindsky (aneb souboj s mizernými daty při překladu z angličtiny do hindštiny)
Získávání paralelních textů z webu
We examine methods for collecting parallel Czech-English corpora from the web. We propose and evaluate automatic methods for finding source web sites, language identification and most importantly the document alignment of obtained pages
Tectogrammatical Annotation of the Wall Street Journal
This paper gives an overview of the current state of the Prague English Dependency Treebank project. It is an updated version of a draft text that was released along with a CD presenting the first 25\% of the PDT-like version of the Penn Treebank -- WSJ section (PEDT 1.0)
Jak se dělá strojový překlad
A quick introduction to problems of machine translation for the participants of Pralinka corespondence series
Pokyny k překladu určené překladatelům, revizorům a korektorům textů z Wall Street Journal pro projekt PCEDT
PCEDT is planned to be a corpus of syntactically annotated parallel texts (in English and Czech) intended chiefly for machine translation experiments. The texts for PCEDT were taken from Penn Treebank, which means there are mostly economical articles from the Wall Street Journal. 2312 documents were used in PCEDT (approximately 49,000 sentences) that are manually annotated with constituent trees in Penn Treebank. For the Czech part of PCEDT, the English texts have to be translated into Czech. This book contents the guidelines for translators