Search CORE

74 research outputs found

A brain network for integration of tone and suffix

Author: Horne Merle
Roll Mikael
Söderström Pelle
Publication venue
Publication date: 01/01/2015
Field of study

Lund University Publications

DWS 2006: Proceedings of the fourth international workshop on dictionary writing systems, Tuesday 5th September 2006, Turin, Italy (Pre-EURALEX 2006)

Author: de Schryver Gilles-Maurice
Publication venue: (SF)2 Press
Publication date: 01/01/2006
Field of study

Ghent University Academic Bibliography

Recommended from our members

He done done or He’d undone? An investigation of second language listening processes with a particular focus on the processing of functional morphemes

Author: Schmidt-Renfree Nicola
Publication venue
Publication date: 09/12/2020
Field of study

This thesis contributes to the research on people’s abilities to listen to English as their second language. Language teaching research has suggested that language learners are not taught effectively how to listen in their second language (L2) and there is, additionally, an assumption that language learners will naturally develop their listening skills as their level of proficiency increases. The thesis aimed to show that the latter may happen but that it frequently does not, and there are many competent second language communicators whose listening skills are under-developed. The thesis further aimed to investigate whether specific training methods could facilitate improvements in listening abilities. The thesis investigated listening at the level of morpho-syntax, focusing on listeners’ abilities to recognise co-articulated and weakly stressed function words and functional morphemes. Seven studies were devised to test whether L2 listeners with higher levels of English proficiency have a deficit in recognising functional morphemes when listening, and whether deficits affect listeners’ abilities to produce reconstructions of the surface form of spoken sentences they have heard. L2 listeners’ results were compared with the results of a number of L1 participants. Levels of accuracy were significantly lower for the L2 listeners, even for those whose language levels were categorised as proficient (CEFR low C2). In the final two of the seven studies, L2 case study participants received training which drew their attention to co-articulations and weak stresses in spoken sentences. Training also included attention to a complex grammatical structure which had previously proved problematic for the listeners to produce. Post-training tests showed that training had had a positive effect on the L2 listeners’ abilities to recognise function words, functional morphemes, and the relevant complex grammatical structure. Implications of the training were discussed in the wider context of second language acquisition, language teaching, and psycholinguistic research

Sussex Research Online

Proceedings of the 17th Annual Conference of the European Association for Machine Translation

Author
Publication venue: Hrvatsko društvo za jezične tehnologije
Publication date: 01/01/2014
Field of study

Proceedings of the 17th Annual Conference of the European Association for Machine Translation (EAMT

Repozitorij Filozofskog fakulteta u Zagrebu' at University of Zagreb

Representation and Processing of Composition, Variation and Approximation in Language Resources and Tools

Author: Savary Agata
Publication venue: HAL CCSD
Publication date: 27/03/2014
Field of study

In my habilitation dissertation, meant to validate my capacity of and maturity for directingresearch activities, I present a panorama of several topics in computational linguistics, linguisticsand computer science.Over the past decade, I was notably concerned with the phenomena of compositionalityand variability of linguistic objects. I illustrate the advantages of a compositional approachto the language in the domain of emotion detection and I explain how some linguistic objects,most prominently multi-word expressions, defy the compositionality principles. I demonstratethat the complex properties of MWEs, notably variability, are partially regular and partiallyidiosyncratic. This fact places the MWEs on the frontiers between different levels of linguisticprocessing, such as lexicon and syntax.I show the highly heterogeneous nature of MWEs by citing their two existing taxonomies.After an extensive state-of-the art study of MWE description and processing, I summarizeMultiflex, a formalism and a tool for lexical high-quality morphosyntactic description of MWUs.It uses a graph-based approach in which the inflection of a MWU is expressed in function ofthe morphology of its components, and of morphosyntactic transformation patterns. Due tounification the inflection paradigms are represented compactly. Orthographic, inflectional andsyntactic variants are treated within the same framework. The proposal is multilingual: it hasbeen tested on six European languages of three different origins (Germanic, Romance and Slavic),I believe that many others can also be successfully covered. Multiflex proves interoperable. Itadapts to different morphological language models, token boundary definitions, and underlyingmodules for the morphology of single words. It has been applied to the creation and enrichmentof linguistic resources, as well as to morphosyntactic analysis and generation. It can be integratedinto other NLP applications requiring the conflation of different surface realizations of the sameconcept.Another chapter of my activity concerns named entities, most of which are particular types ofMWEs. Their rich semantic load turned them into a hot topic in the NLP community, which isdocumented in my state-of-the art survey. I present the main assumptions, processes and resultsissued from large annotation tasks at two levels (for named entities and for coreference), parts ofthe National Corpus of Polish construction. I have also contributed to the development of bothrule-based and probabilistic named entity recognition tools, and to an automated enrichment ofProlexbase, a large multilingual database of proper names, from open sources.With respect to multi-word expressions, named entities and coreference mentions, I pay aspecial attention to nested structures. This problem sheds new light on the treatment of complexlinguistic units in NLP. When these units start being modeled as trees (or, more generally, asacyclic graphs) rather than as flat sequences of tokens, long-distance dependencies, discontinu-ities, overlapping and other frequent linguistic properties become easier to represent. This callsfor more complex processing methods which control larger contexts than what usually happensin sequential processing. Thus, both named entity recognition and coreference resolution comesvery close to parsing, and named entities or mentions with their nested structures are analogous3to multi-word expressions with embedded complements.My parallel activity concerns finite-state methods for natural language and XML processing.My main contribution in this field, co-authored with 2 colleagues, is the first full-fledged methodfor tree-to-language correction, and more precisely for correcting XML documents with respectto a DTD. We have also produced interesting results in incremental finite-state algorithmics,particularly relevant to data evolution contexts such as dynamic vocabularies or user updates.Multilingualism is the leitmotif of my research. I have applied my methods to several naturallanguages, most importantly to Polish, Serbian, English and French. I have been among theinitiators of a highly multilingual European scientific network dedicated to parsing and multi-word expressions. I have used multilingual linguistic data in experimental studies. I believethat it is particularly worthwhile to design NLP solutions taking declension-rich (e.g. Slavic)languages into account, since this leads to more universal solutions, at least as far as nominalconstructions (MWUs, NEs, mentions) are concerned. For instance, when Multiflex had beendeveloped with Polish in mind it could be applied as such to French, English, Serbian and Greek.Also, a French-Serbian collaboration led to substantial modifications in morphological modelingin Prolexbase in its early development stages. This allowed for its later application to Polishwith very few adaptations of the existing model. Other researchers also stress the advantages ofNLP studies on highly inflected languages since their morphology encodes much more syntacticinformation than is the case e.g. in English.In this dissertation I am also supposed to demonstrate my ability of playing an active rolein shaping the scientific landscape, on a local, national and international scale. I describemy: (i) various scientific collaborations and supervision activities, (ii) roles in over 10 regional,national and international projects, (iii) responsibilities in collective bodies such as program andorganizing committees of conferences and workshops, PhD juries, and the National UniversityCouncil (CNU), (iv) activity as an evaluator and a reviewer of European collaborative projects.The issues addressed in this dissertation open interesting scientific perspectives, in whicha special impact is put on links among various domains and communities. These perspectivesinclude: (i) integrating fine-grained language data into the linked open data, (ii) deep parsingof multi-word expressions, (iii) modeling multi-word expression identification in a treebank as atree-to-language correction problem, and (iv) a taxonomy and an experimental benchmark fortree-to-language correction approaches

Thèses en Ligne

HAL Université de Tours

CLARIN

Author
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 30/01/2023
Field of study

The book provides a comprehensive overview of the Common Language Resources and Technology Infrastructure – CLARIN – for the humanities. It covers a broad range of CLARIN language resources and services, its underlying technological infrastructure, the achievements of national consortia, and challenges that CLARIN will tackle in the future. The book is published 10 years after establishing CLARIN as an Europ. Research Infrastructure Consortium

Directory of Open Access Books (DOAB)

CLARIN. The infrastructure for language resources

Author: Fišer Darja
Witt Andreas
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 17/10/2022
Field of study

CLARIN, the "Common Language Resources and Technology Infrastructure", has established itself as a major player in the field of research infrastructures for the humanities. This volume provides a comprehensive overview of the organization, its members, its goals and its functioning, as well as of the tools and resources hosted by the infrastructure. The many contributors representing various fields, from computer science to law to psychology, analyse a wide range of topics, such as the technology behind the CLARIN infrastructure, the use of CLARIN resources in diverse research projects, the achievements of selected national CLARIN consortia, and the challenges that CLARIN has faced and will face in the future. The book will be published in 2022, 10 years after the establishment of CLARIN as a European Research Infrastructure Consortium by the European Commission (Decision 2012/136/EU)

Publikationsserver des Instituts für Deutsche Sprache

Aspects of the Syntax, Production and Pragmatics of code-switching - with special reference to Cantonese-English

Author: Chan Brian Hok-shing
Publication venue: UCL (University College London)
Publication date: 01/01/1999
Field of study

This dissertation argues for the position that code-switching utterances are constrained by the same set of mechanisms as those which govern monolingual utterances. While this thesis is in line with more recent code-switching theories (e.g. Belazi et al. 1994, MacSwan 1997, Mahootian 1993), this dissertation differs from those works in making two specific claims: Firstly, functional categories and lexical categories exhibit different syntactic behaviour in code-switching. Secondly, codeswitching is subject to the same principles not only in syntax, but also in production and pragmatics. Chapter 2 presents a critical review of constraints and processing models previously proposed in the literature. It is suggested that in view of the vast variety of data, no existing model is completely adequate. Nevertheless, it is argued that a model which does not postulate syntactic constraints (along the lines of Mahootian 1993, MacSwan 1997) or production principles (along the lines of de Bot 1992) specific to code switching is to be preferred on cognitive and theoretical grounds. Chapter 3 concerns word order between lexical heads and their complements in code-switching. It is shown that the language of a lexical head (i.e. noun or verb) may or may not determine the word order of its complement. Chapter 4 investigates word order between functional heads and their complements in code-switching. Contrary to the case with lexical categories, the language of functional heads (e.g. D, I and C) is shown to determine the word order of their complements in code-switching. It is proposed that word order between heads (lexical or functional) and complements is governed by head-parameters, and the difference between lexical heads and functional heads is due to their differential processing and production in terms of Levelt's (1989) algorithm. Chapter 5 investigates the selection properties of functional categories in codeswitching, with special reference to Cantonese-English. Contrary to the Functional Head Constraint (Belazi et al. 1994), it is shown that code-switching can occur freely between functional heads and their complements, provided that the c-selection requirements of the functional heads are satisfied. Chapter 6 investigates the selection properties of lexical categories in code-switching, again with special reference to Cantonese-English. It is shown that "language-specific" c-selection properties need not be observed: a Cantonese verb may take an English DP whereas an English verb may take a Cantonese demonstrative phrase (DemP). Similar phenomena are drawn from other language-pairs involving a language with morphological case and a language without morphological case. The difference between functional categories and lexical categories in their selection properties is again explained in terms of the different production processes they undergo. Chapter 7 is devoted to prepositions which have been problematic in terms of their status as a functional category or a lexical category. Based on the behaviour of prepositions in code-switching, it is suggested that prepositions display a dual character. It is proposed that prepositions may well point to the fact that the conventional dichotomy between functional categories and lexical categories is not a primitive one in the lexicon. Chapter 8 looks at code-switching in a wider perspective. and explores the pragmatic determinants of code-switching in the light of Relevance Theory (Sperber and Wilson 1995). It is argued that many types of code-switching (e.g. repetitions, quotations, etc.) are motivated by the desire to optimize the "relevance" of a message, with "relevance" as defined in Relevance Theory

UCL Discovery

Proceedings of the 13th Linguistic Annotation Workshop, August 1, 2019, Florence, Italy

Author: Friedrich Annemarie
Hoek Jet
Zeyrek Deniz
Publication venue
Publication date: 07/07/2023
Field of study

OPUS Augsburg

Recent Trends in Computational Intelligence

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Traditional models struggle to cope with complexity, noise, and the existence of a changing environment, while Computational Intelligence (CI) offers solutions to complicated problems as well as reverse problems. The main feature of CI is adaptability, spanning the fields of machine learning and computational neuroscience. CI also comprises biologically-inspired technologies such as the intellect of swarm as part of evolutionary computation and encompassing wider areas such as image processing, data collection, and natural language processing. This book aims to discuss the usage of CI for optimal solving of various applications proving its wide reach and relevance. Bounding of optimization methods and data mining strategies make a strong and reliable prediction tool for handling real-life applications

Directory of Open Access Books (DOAB)