4 research outputs found

    Learning Translation Rules from Bilingual English - Filipino Corpus

    Get PDF
    PACLIC 19 / Taipei, taiwan / December 1-3, 200

    Learning translation rules from bilingual English - Filipino corpus

    No full text
    Most machine translators are implemented using example based, rule based, and statistical approaches. However, each of these paradigms has its drawbacks. Example based and statistical based approaches are domain specific and requires a large database of examples to produce accurate translation results. Although rule based approach is known to produce high quality translations, a linguist is necessary in deriving the set of rules to be used. To address these problems, we present an approach that uses the rule based approach in translating from English to Filipino text. It incorporates learning of rules based on the analysis of a bilingual corpus in an attempt to eliminate the need for a linguist. The learning algorithm is based on seeded version space learning algorithm as presented by Probst (2002). Implementation of the algorithm has been modified to allow learning of non-lexically aligned languages and to adapt to the complex free word order of the Filipino language

    TWIRL: Translation with rule learning

    No full text
    Machine translation (MT) is the automatic conversion of a source language to a target language using computers. The two most common paradigms for machine translation are the rule based and example based approaches. The problem with the example based approach is that it needs to be domain specific and a large database of examples is needed to produce accurate translation results. Rule based approaches are known to produce high quality translations however, a linguist is necessary in deriving the set of rules. To overcome the problems of both the example based and the pure rule based paradigms, TWiRL used the rule based approach with an integration of machine learning of rules to allow flexibility in translation. Since no rule learning has been explored in English to Filipino machine translation system, the focus of this research is on translating English to Filipino text. Keywords: Rule generalization, Compositionality, Bilingual Corpus, rule based, Machine Translation, Seed Rule Generation
    corecore