152 research outputs found

    Promoting multiword expressions in A* TAG parsing

    Get PDF
    International audienceMultiword expressions (MWEs) are pervasive in natural languages and often have both idiomatic and compositional readings, which leads to high syntactic ambiguity. We show that for some MWE types idiomatic readings are usually the correct ones. We propose a heuristic for an A* parser for Tree Adjoining Grammars which benefits from this knowledge by promoting MWE-oriented analyses. This strategy leads to a substantial reduction in the parsing search space in case of true positive MWE occurrences, while avoiding parsing failures in case of false positives

    Representation and Processing of Composition, Variation and Approximation in Language Resources and Tools

    Get PDF
    In my habilitation dissertation, meant to validate my capacity of and maturity for directingresearch activities, I present a panorama of several topics in computational linguistics, linguisticsand computer science.Over the past decade, I was notably concerned with the phenomena of compositionalityand variability of linguistic objects. I illustrate the advantages of a compositional approachto the language in the domain of emotion detection and I explain how some linguistic objects,most prominently multi-word expressions, defy the compositionality principles. I demonstratethat the complex properties of MWEs, notably variability, are partially regular and partiallyidiosyncratic. This fact places the MWEs on the frontiers between different levels of linguisticprocessing, such as lexicon and syntax.I show the highly heterogeneous nature of MWEs by citing their two existing taxonomies.After an extensive state-of-the art study of MWE description and processing, I summarizeMultiflex, a formalism and a tool for lexical high-quality morphosyntactic description of MWUs.It uses a graph-based approach in which the inflection of a MWU is expressed in function ofthe morphology of its components, and of morphosyntactic transformation patterns. Due tounification the inflection paradigms are represented compactly. Orthographic, inflectional andsyntactic variants are treated within the same framework. The proposal is multilingual: it hasbeen tested on six European languages of three different origins (Germanic, Romance and Slavic),I believe that many others can also be successfully covered. Multiflex proves interoperable. Itadapts to different morphological language models, token boundary definitions, and underlyingmodules for the morphology of single words. It has been applied to the creation and enrichmentof linguistic resources, as well as to morphosyntactic analysis and generation. It can be integratedinto other NLP applications requiring the conflation of different surface realizations of the sameconcept.Another chapter of my activity concerns named entities, most of which are particular types ofMWEs. Their rich semantic load turned them into a hot topic in the NLP community, which isdocumented in my state-of-the art survey. I present the main assumptions, processes and resultsissued from large annotation tasks at two levels (for named entities and for coreference), parts ofthe National Corpus of Polish construction. I have also contributed to the development of bothrule-based and probabilistic named entity recognition tools, and to an automated enrichment ofProlexbase, a large multilingual database of proper names, from open sources.With respect to multi-word expressions, named entities and coreference mentions, I pay aspecial attention to nested structures. This problem sheds new light on the treatment of complexlinguistic units in NLP. When these units start being modeled as trees (or, more generally, asacyclic graphs) rather than as flat sequences of tokens, long-distance dependencies, discontinu-ities, overlapping and other frequent linguistic properties become easier to represent. This callsfor more complex processing methods which control larger contexts than what usually happensin sequential processing. Thus, both named entity recognition and coreference resolution comesvery close to parsing, and named entities or mentions with their nested structures are analogous3to multi-word expressions with embedded complements.My parallel activity concerns finite-state methods for natural language and XML processing.My main contribution in this field, co-authored with 2 colleagues, is the first full-fledged methodfor tree-to-language correction, and more precisely for correcting XML documents with respectto a DTD. We have also produced interesting results in incremental finite-state algorithmics,particularly relevant to data evolution contexts such as dynamic vocabularies or user updates.Multilingualism is the leitmotif of my research. I have applied my methods to several naturallanguages, most importantly to Polish, Serbian, English and French. I have been among theinitiators of a highly multilingual European scientific network dedicated to parsing and multi-word expressions. I have used multilingual linguistic data in experimental studies. I believethat it is particularly worthwhile to design NLP solutions taking declension-rich (e.g. Slavic)languages into account, since this leads to more universal solutions, at least as far as nominalconstructions (MWUs, NEs, mentions) are concerned. For instance, when Multiflex had beendeveloped with Polish in mind it could be applied as such to French, English, Serbian and Greek.Also, a French-Serbian collaboration led to substantial modifications in morphological modelingin Prolexbase in its early development stages. This allowed for its later application to Polishwith very few adaptations of the existing model. Other researchers also stress the advantages ofNLP studies on highly inflected languages since their morphology encodes much more syntacticinformation than is the case e.g. in English.In this dissertation I am also supposed to demonstrate my ability of playing an active rolein shaping the scientific landscape, on a local, national and international scale. I describemy: (i) various scientific collaborations and supervision activities, (ii) roles in over 10 regional,national and international projects, (iii) responsibilities in collective bodies such as program andorganizing committees of conferences and workshops, PhD juries, and the National UniversityCouncil (CNU), (iv) activity as an evaluator and a reviewer of European collaborative projects.The issues addressed in this dissertation open interesting scientific perspectives, in whicha special impact is put on links among various domains and communities. These perspectivesinclude: (i) integrating fine-grained language data into the linked open data, (ii) deep parsingof multi-word expressions, (iii) modeling multi-word expression identification in a treebank as atree-to-language correction problem, and (iv) a taxonomy and an experimental benchmark fortree-to-language correction approaches

    Power System Simulation, Control and Optimization

    Get PDF
    This Special Issue “Power System Simulation, Control and Optimization” offers valuable insights into the most recent research developments in these topics. The analysis, operation, and control of power systems are increasingly complex tasks that require advanced simulation models to analyze and control the effects of transformations concerning electricity grids today: Massive integration of renewable energies, progressive implementation of electric vehicles, development of intelligent networks, and progressive evolution of the applications of artificial intelligence

    Universal Logic and the Geography of Thought - Reflections on logical pluralism in the light of culture

    Get PDF
    The aim of this dissertation is to provide an analysis for those involved and interested in the interdisciplinary study of logic, particularly Universal Logic. While continuing to remain aware of the importance of the central issues of logic, we hope that the factor of culture is also given serious consideration. Universal Logic provides a general theory of logic to study the most general and abstract properties of the various possible logics. As well as elucidating the basic knowledge and necessary definitions, we would especially like to address the problems of motivation concerning logical investigations in different cultures. First of all, I begin by considering Universal Logic as understood by Jean-Yves BĂ©ziau, and examine the basic ideas underlying the Universal Logic project. The basic approach, as originally employed by Universal Logicians, is introduced, after which the relationship between algebras and logics at an abstract level is discussed, i.e., Universal Algebra and Universal Logic. Secondly,I focus on a discussion of the translation paradox , which will enable readers to become more familiar with the new subject of logical translation, and subsequently comprehensively summarize its development in the literature. Besides helping readers to become more acquainted with the concept of logical translation, the discussion here will also attempt to formulate a new direction in support of logical pluralism as identified by Ruldof Carnap (1934), JC Beall and Greg Restall (2005), respectively. Thirdly, I provide a discussion of logical pluralism. Logical pluralism can be traced back to the principle of tolerance raised by Ruldof Carnap (1934), and readers will gain a comprehensive understanding of this concept from the discussion. Moreover,an attempt will be made to clarify the real and important issues in the contemporary debate between pluralism and monism within the field of logic in general. Fourthly, I study the phenomena of cultural-difference as related to the geography of thought. Two general systems in the geography of thought are distinguished, which we here call thought-analytic and thought-holistic. They are proposed to analyze and challenge the universality assumption regarding cognitive processes. People from different cultures and backgrounds have many differences in diverse areas, and these differences, if taken for granted, have proven particularly problematic in understanding logical thinking across cultures. Interestingly, the universality of cognitive processes has been challenged, especially by Richard Nisbett s research in cultural psychology. With respect to these concepts, C-UniLog can also be considered in relation to empirical evidence obtained by Richard Nisbett et al. In the final stage of this dissertation, I will propose an interpretation of the concept of logical translation, i.e., translations between formal logical mode (as cognitive processes in the case of westerners) and dialectical logical mode (as cognitive processes in the case of Asians). From this, I will formulate a new interpretation of the principle of tolerance, as well as of logical pluralism

    E-Learning

    Get PDF
    E-learning enables students to pace their studies according to their needs, making learning accessible to (1) people who do not have enough free time for studying - they can program their lessons according to their available schedule; (2) those far from a school (geographical issues), or the ones unable to attend classes due to some physical or medical restriction. Therefore, cultural, geographical and physical obstructions can be removed, making it possible for students to select their path and time for the learning course. Students are then allowed to choose the main objectives they are suitable to fulfill. This book regards E-learning challenges, opening a way to understand and discuss questions related to long-distance and lifelong learning, E-learning for people with special needs and, lastly, presenting case study about the relationship between the quality of interaction and the quality of learning achieved in experiences of E-learning formation

    Unmet goals of tracking: within-track heterogeneity of students' expectations for

    Get PDF
    Educational systems are often characterized by some form(s) of ability grouping, like tracking. Although substantial variation in the implementation of these practices exists, it is always the aim to improve teaching efficiency by creating homogeneous groups of students in terms of capabilities and performances as well as expected pathways. If students’ expected pathways (university, graduate school, or working) are in line with the goals of tracking, one might presume that these expectations are rather homogeneous within tracks and heterogeneous between tracks. In Flanders (the northern region of Belgium), the educational system consists of four tracks. Many students start out in the most prestigious, academic track. If they fail to gain the necessary credentials, they move to the less esteemed technical and vocational tracks. Therefore, the educational system has been called a 'cascade system'. We presume that this cascade system creates homogeneous expectations in the academic track, though heterogeneous expectations in the technical and vocational tracks. We use data from the International Study of City Youth (ISCY), gathered during the 2013-2014 school year from 2354 pupils of the tenth grade across 30 secondary schools in the city of Ghent, Flanders. Preliminary results suggest that the technical and vocational tracks show more heterogeneity in student’s expectations than the academic track. If tracking does not fulfill the desired goals in some tracks, tracking practices should be questioned as tracking occurs along social and ethnic lines, causing social inequality

    Sustainable Production in Food and Agriculture Engineering

    Get PDF
    This book is a collection of original research and review papers that report on the state of the art and recent advancements in food and agriculture engineering, such as sustainable production and food technology. Encompassed within are applications in food and agriculture engineering, biosystem engineering, plant and animal production engineering, food and agricultural processing engineering, storing industry, economics and production management and agricultural farms management, agricultural machines and devices, and IT for agricultural engineering and ergonomics in agriculture

    Explanation Techniques using Markov Logic Networks

    Get PDF
    Explanation Techniques using Markov Logic Network
    • …
    corecore