3 research outputs found
Digital access for language and culture in First Nations communities
Digital access for language and culture in First Nations communitie
A unicode-based environment for creation and use of language resources.
GATE is a Unicode-aware architecture, development environment and framework for building systems that process human language. It is often thought that the character sets problem has been solved by the arrival of the Unicode standard. This standard is an important advance, but in practice the ability to process text in a large number of the World's languages is still limited. This paper describes work done in the context of the GATE project that makes use of Unicode and plugs some of the gaps for language processing R&D