4 research outputs found

    Proceedings of the Conference on Natural Language Processing 2010

    Get PDF
    This book contains state-of-the-art contributions to the 10th conference on Natural Language Processing, KONVENS 2010 (Konferenz zur Verarbeitung natĂĽrlicher Sprache), with a focus on semantic processing. The KONVENS in general aims at offering a broad perspective on current research and developments within the interdisciplinary field of natural language processing. The central theme draws specific attention towards addressing linguistic aspects ofmeaning, covering deep as well as shallow approaches to semantic processing. The contributions address both knowledgebased and data-driven methods for modelling and acquiring semantic information, and discuss the role of semantic information in applications of language technology. The articles demonstrate the importance of semantic processing, and present novel and creative approaches to natural language processing in general. Some contributions put their focus on developing and improving NLP systems for tasks like Named Entity Recognition or Word Sense Disambiguation, or focus on semantic knowledge acquisition and exploitation with respect to collaboratively built ressources, or harvesting semantic information in virtual games. Others are set within the context of real-world applications, such as Authoring Aids, Text Summarisation and Information Retrieval. The collection highlights the importance of semantic processing for different areas and applications in Natural Language Processing, and provides the reader with an overview of current research in this field

    The Development of Collocations as Constructions in L2 Writing

    Get PDF
    Cross-sectional and longitudinal learner corpus studies utilizing phraseological, frequency, and association strength approaches to phraseological unit identification have shown how the use of phraseological units varies across proficiency levels and develops over time. However, these methods suffer from several limitations, such as a reliance on native speaker intuition, a limited focus on contiguous word sequences, and a neglect of part of speech information in association strength calculation. This study seeks to address these limitations by defining lexical collocations as constructions (henceforth “collconstructions”) within the framework of Construction Grammar and investigating their cross-sectional variation and longitudinal development in two corpora of L2 writing. The cross-sectional corpus consisted of beginner and intermediate EFL learner texts assessed for overall writing proficiency, while the longitudinal corpus contained freewrites produced by ESL learners over the course of one year. Contiguous and non-contiguous adjective-noun, verb-noun, and adverb-adjective collconstruction tokens were extracted from each learner text in the two learner corpora. Each learner text was assessed for multiple constructional and collostructional indices of collconstruction production. Constructional indices included type frequencies, token frequencies, and normalized entropy scores for each collconstruction category. Collostructional indices consisted of proportion scores for different categories of adjective-noun, adverb-adjective, and verb-noun collconstruction types and tokens based on covarying collexeme scores calculated using frequency information from an academic reference corpus. Variation across proficiency levels was evaluated both qualitatively and quantitatively. The qualitative analysis consisted of examining variation in the production of specific functional collconstruction subcategories from a Usage-based Second Language Acquisition perspective. The quantitative analysis consisted of the calculation of an ordinal logistic regression in order to determine whether any indices of collconstruction production were predictive of L2 writing quality. Longitudinal development at the group level was investigated through the use of linear mixed effects models. Development for individual learners was examined from a Dynamic Systems Theory perspective that focuses on the role of variability in language development as well as interconnected development for multiple indices of collconstruction production. This study has important implications for future research on L2 phraseology research and second language acquisition research as well as phraseology pedagogy
    corecore