Bruk av et norsk leksikon til tagging og andre språkteknologiske formål

Abstract

Norsk ordbank (the Norwegian Word Bank) is an electronic lexiconfor the two Norwegian written standards, Bokmål and Nynorsk. Itforms the basis of many, probably most, of the existing languagetechnology tools for Norwegian. The lexicon is based on the entriesand inflectional information found in the dictionaries Bokmålsordbokaand Nynorskordboka as well as word lists and inflectionalpatterns developed by IBM Norway. We present some backgroundinformation about the lexicon and show how it has been applied toa variety of language technology tools and various applications forend users. Since the lexicon was developed from resources meant foruse by human readers, much work has been devoted to modifyingthe lexicon to make it better suited for use in language technology,and the main focus of our paper is on this work

    Similar works