1,264 research outputs found

    Natural language processing for similar languages, varieties, and dialects: A survey

    Get PDF
    There has been a lot of recent interest in the natural language processing (NLP) community in the computational processing of language varieties and dialects, with the aim to improve the performance of applications such as machine translation, speech recognition, and dialogue systems. Here, we attempt to survey this growing field of research, with focus on computational methods for processing similar languages, varieties, and dialects. In particular, we discuss the most important challenges when dealing with diatopic language variation, and we present some of the available datasets, the process of data collection, and the most common data collection strategies used to compile datasets for similar languages, varieties, and dialects. We further present a number of studies on computational methods developed and/or adapted for preprocessing, normalization, part-of-speech tagging, and parsing similar languages, language varieties, and dialects. Finally, we discuss relevant applications such as language and dialect identification and machine translation for closely related languages, language varieties, and dialects.Non peer reviewe

    Parallels in romance nominal and clausal microvariation

    Get PDF
    This article explores parallels in the dimensions of microvariation characterizing the functional structure and organization of the Romance nominal and clausal groups. Within a parameter hierarchy approach it is argued that observed synchronic and diachronic variation across both domains can be readily captured in terms of a single set of higher- and above all lower-level parametric options. This parallelism constitutes a welcome finding in that it points to how the available parametric space can be constrained and defined in terms of a set of common transcategorial principles and options.This is the final version of the article. It first appeared from Editura Academiei Române via http://www.lingv.ro/index.php?option=com_content&view=article&id=137%3Arrlarhiva2015&catid=36%3Areviste-ilb&Itemid=9
    • …
    corecore