Skip to main content
Article thumbnail
Location of Repository

Aligning noisy parallel corpora across language groups: Word pair feature matching by dynamic time warping

By Pascale Fung and Kathleen Mckeown

Abstract

We propose a new algorithm, DK-vec, for aligning pairs of Asian/Indo-European noisy parallel texts without sentence boundaries. The algorithm uses frequency, position and recency information as features for pattern matching. Dynamic Time Warping is used as the matching technique between word pairs. This algorithm produces a small bilingual lexicon which provides anchor points for alignment

Year: 1994
OAI identifier: oai:CiteSeerX.psu:10.1.1.134.6869
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://www.ee.ust.hk/~pascale/... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.