47 research outputs found

    Automatic reconstruction of itineraries from descriptive texts

    Get PDF
    Esta tesis se inscribe dentro del marco del proyecto PERDIDO donde los objetivos son la extracción y reconstrucción de itinerarios a partir de documentos textuales. Este trabajo se ha realizado en colaboración entre el laboratorio LIUPPA de l' Université de Pau et des Pays de l' Adour (France), el grupo de Sistemas de Información Avanzados (IAAA) de la Universidad de Zaragoza y el laboratorio COGIT de l' IGN (France). El objetivo de esta tesis es concebir un sistema automático que permita extraer, a partir de guías de viaje o descripciones de itinerarios, los desplazamientos, además de representarlos sobre un mapa. Se propone una aproximación para la representación automática de itinerarios descritos en lenguaje natural. Nuestra propuesta se divide en dos tareas principales. La primera pretende identificar y extraer de los textos describiendo itinerarios información como entidades espaciales y expresiones de desplazamiento o percepción. El objetivo de la segunda tarea es la reconstrucción del itinerario. Nuestra propuesta combina información local extraída gracias al procesamiento del lenguaje natural con datos extraídos de fuentes geográficas externas (por ejemplo, gazetteers). La etapa de anotación de informaciones espaciales se realiza mediante una aproximación que combina el etiquetado morfo-sintáctico y los patrones léxico-sintácticos (cascada de transductores) con el fin de anotar entidades nombradas espaciales y expresiones de desplazamiento y percepción. Una primera contribución a la primera tarea es la desambiguación de topónimos, que es un problema todavía mal resuelto dentro del reconocimiento de entidades nombradas (Named Entity Recognition - NER) y esencial en la recuperación de información geográfica. Se plantea un algoritmo no supervisado de georreferenciación basado en una técnica de clustering capaz de proponer una solución para desambiguar los topónimos los topónimos encontrados en recursos geográficos externos, y al mismo tiempo, la localización de topónimos no referenciados. Se propone un modelo de grafo genérico para la reconstrucción automática de itinerarios, donde cada nodo representa un lugar y cada arista representa un camino enlazando dos lugares. La originalidad de nuestro modelo es que además de tener en cuenta los elementos habituales (caminos y puntos del recorrido), permite representar otros elementos involucrados en la descripción de un itinerario, como por ejemplo los puntos de referencia visual. Se calcula de un árbol de recubrimiento mínimo a partir de un grafo ponderado para obtener automáticamente un itinerario bajo la forma de un grafo. Cada arista del grafo inicial se pondera mediante un método de análisis multicriterio que combina criterios cualitativos y cuantitativos. El valor de estos criterios se determina a partir de informaciones extraídas del texto e informaciones provenientes de recursos geográficos externos. Por ejemplo, se combinan las informaciones generadas por el procesamiento del lenguaje natural como las relaciones espaciales describiendo una orientación (ej: dirigirse hacia el sur) con las coordenadas geográficas de lugares encontrados dentro de los recursos para determinar el valor del criterio ``relación espacial''. Además, a partir de la definición del concepto de itinerario y de las informaciones utilizadas en la lengua para describir un itinerario, se ha modelado un lenguaje de anotación de información espacial adaptado a la descripción de desplazamientos, apoyándonos en las recomendaciones del consorcio TEI (Text Encoding and Interchange). Finalmente, se ha implementado y evaluado las diferentes etapas de nuestra aproximación sobre un corpus multilingüe de descripciones de senderos y excursiones (francés, español, italiano)

    Proceedings of the VIIth GSCP International Conference

    Get PDF
    The 7th International Conference of the Gruppo di Studi sulla Comunicazione Parlata, dedicated to the memory of Claire Blanche-Benveniste, chose as its main theme Speech and Corpora. The wide international origin of the 235 authors from 21 countries and 95 institutions led to papers on many different languages. The 89 papers of this volume reflect the themes of the conference: spoken corpora compilation and annotation, with the technological connected fields; the relation between prosody and pragmatics; speech pathologies; and different papers on phonetics, speech and linguistic analysis, pragmatics and sociolinguistics. Many papers are also dedicated to speech and second language studies. The online publication with FUP allows direct access to sound and video linked to papers (when downloaded)

    Governance for Sustainability in Forest Communities: Case Study of the Antoine-Labelle Regional County Municipality in Quebec

    Get PDF
    Significant progress towards sustainability has been made in recent decades. However, persistent and deepening unsustainable trajectories underline that mere continuation of current strategies will not suffice in reversing unsustainable trends. Governance for sustainability emerged as a field of inquiry aiming at prescribing decision-making structures and processes to support progress towards sustainability. Governance for sustainability in a complex and dynamic world poses a perplexing challenge. Goals are ambiguous; uncertainty is inherent; power is distributed and knowledge is incomplete (Leach, Scoones, & Stirling, 2010; Newig, Voß, & Monstadt, 2008). In this emerging context, the objective of governance is not to steer society towards perspicuous preferences but to iteratively and collectively negotiate and redefine actions and goals (Leach, Scoones, & Stirling, 2010; Meadowcroft, 2008) in order to influence and better cope with social-ecological system change. Core principles of sustainability serve as an ethical compass for deliberations and decision-making and need to be specified according to the different contexts, needs and options available. Sustainability assessment overlaps with, and plays a role in governance for sustainability. In the field of sustainability, different approaches such as transition management, resilience and transformation in social-ecological systems, and future studies are directing attention towards understanding change (and resistance to change) and how it can be influenced intentionally in a complex and dynamic world. This dissertation integrates lessons on how to generate intentional change into Gibson’s (2017, 2005) sustainability assessment framework and specifies sustainability requirements for their application to forest communities. Forest communities in Canada and in many parts of the world are faced with the challenge of responding to a recent forestry crisis that has revealed their vulnerability to narrow economic dependence on global paper and lumber markets while also taking part in the broader quest to reverse deepening unsustainable societal trajectories that are common to many rural and remote communities. Two recent local governance initiatives in the Antoine-Labelle region in Quebec – a Vision exercise and The Bourdon project, a forest community part of the Canadian Forest Communities Program – serve as ground for exploration and application. The Antoine-Labelle region was greatly impacted by the Canadian forestry crisis that struck most dramatically early in this century. Employment within the forest sector was reduced by more than 50 percent between 2004 and 2007 and the industrial structure was significantly modified. A qualitative approach assessing results according to three categories (i.e., significant and minor contributions to sustainability, and unmet criteria) was used to gain insights into the areas of strengths and weaknesses of the initiatives and identifying routes for improvement in sustainability contributions. The assessment led to three major findings. First, the assessment uncovered a need for increased capacities for environmental stewardship and for local governance actors and citizens to be involved in forest governance. This finding responds to a long-standing problem, identified in the historical account of the evolution of forest management policies in Canada, which illustrates that forest communities, including First Nation communities, were repeatedly excluded from forest management decision-making. The second major finding concerns path dependence and the marginalization of forest communities by provincial policies. While both initiatives were disruptive in opening-up forest governance and local governance to other stakeholders and First Nation communities, progress remains tentative. Findings indicate that while the forestry crisis brought reasons to examine broader options for forest community futures, it also increased the pressure to narrow considerations and prioritize economic imperatives. The third finding relates to the importance of lack of political will as a significant impasse to governance for sustainability. Gaining political support in a time of crisis and when funds are available is a relatively simple task. Moving beyond ad hoc initiatives and following through with organizational and institutional change is where the real challenge lies. Lasting contributions towards sustainability remain tentative and highly dependent upon the willingness of local actors to continue building on acquired capacities, knowledge and experience. This dissertation contributes to sustainability assessment literature and recent attempts to integrate resilience and transition management lessons into sustainability assessment criteria. Further insights into future studies were drawn by putting forward key requirements for scenario building and vision exercises and by highlighting the differences in how different approaches frame the use of future studies. Another contribution pertains to the originality of the specified assessment framework for forest communities and its complementarity to other assessment frameworks. In addition, the case study of the Antoine-Labelle region and the assessment of two local governance initiatives that were put forward represent a substantive contribution to understanding possible pathways forward as well as the region’s strengths and weaknesses based on a sustainability perspective

    Clitics in the wild

    Get PDF
    This collective monograph is the first data-oriented, empirical in-depth study of the system of clitics on Bosnian, Croatian and Serbian. It fills the gap between the theoretical and normative literature by including solid data on variation found in dialects and spoken language and obtained from massive Web Corpora and speakers’ acceptability judgements. The authors investigate three primary sources of variation: inventory, placement and morphonological processes. A separate part of the book is dedicated to the phenomenon of clitic climbing, the major challenge for any syntactic theory. The theory of complexity serves as the explanation for the very diverse constraints on clitic climbing established in the empirical studies. It allows to construct a series of hierarchies where the factors relevant for predicting clitic climbing interact with each other. Thus, the study pushes our understanding of clitics away from fine-grained descriptions and syntactic generalisations towards a probabilistic modelling of syntax

    Analyse syntaxique robuste du français : concilier méthodes statistiques et connaissances linguistiques dans l'outil Talismane

    Get PDF
    In this thesis we explore robust statistical syntax analysis for French. Our main concern is to explore methods whereby the linguist can inject linguistic knowledge and/or resources into the robust statistical engine in order to improve results for specific phenomena. We first explore the dependency annotation schema for French, concentrating on certain phenomena. Next, we look into the various algorithms capable of producing this annotation, and in particular on the transition-based parsing algorithm used in the rest of this thesis. After exploring supervised machine learning algorithms for NLP classification problems, we present the Talismane toolkit for syntax analysis, built within the framework of this thesis, including four statistical modules - sentence boundary detection, tokenisation, pos-tagging and parsing - as well as the various linguistic resources used for the baseline model, including corpora, lexicons and feature sets. Our first experiments attempt various machine learning configurations in order to identify the best baseline. We then look into improvements made possible by beam search and beam propagation. Finally, we present a series of experiments aimed at correcting errors related to specific linguistic phenomena, using targeted features. One our innovation is the introduction of rules that can impose or prohibit certain decisions locally, thus bypassing the statistical model. We explore the usage of rules for errors that the features are unable to correct. Finally, we look into the enhancement of targeted features by large scale linguistic resources, and in particular a semi-supervised approach using a distributional semantic resource.Dans cette thèse, nous explorons l'analyse syntaxique robuste statistique du français. Notre principal souci est de trouver des méthodes qui permettent au linguiste d'injecter des connaissances et/ou des ressources linguistiques dans un moteur statistique afin d'améliorer les résultats de certains phénomènes spécifiques. D'abord nous décrivons le schéma d'annotation en dépendances du français, et les algorithmes capables de produire cette annotation, en particulier le parsing par transitions. Après avoir exploré les algorithmes d'apprentissage automatique supervisé pour les problèmes de classification en TAL, nous présentons l'analyseur syntaxique Talismane développé dans le cadre de cette thèse et comprenant quatre modules statistiques – le découpage en phrases, la segmentation en mots, l'étiquetage morpho-syntaxique et le parsing – ainsi que les diverses ressources linguistiques utilisées par le modèle de base. Nos premières expériences tentent d'identifier la meilleure configuration de base parmi de nombreuses configurations possibles. Ensuite nous explorons les améliorations apportées par la recherche par faisceau et la propagation du faisceau. Enfin nous présentons une série d'expériences dont le but est de corriger des erreurs linguistiques spécifiques au moyen de traits ciblés. Une de nos innovations est l'introduction des règles qui imposent ou interdisent certaines décisions locales, permettant ainsi de contourner le modèle statistique. Nous explorons l'utilisation de règles pour les erreurs que les traits n'ont pu corriger. Finalement, nous présentons une expérience semi-supervisée avec une ressource de sémantique distributionnelle

    Technology, culture and HRM : a 'neo-contingency' Anglo-French comparison

    Get PDF
    The objective of this research is to provide a new perspective on how technology, culture and human resources management policies and practices (HRWP) are approached by firms located in France and England. This thesis offers a new perspective: 'neo-contingency' approach; which analyses both the contingency and the divergence theories. The contingency variable chosen in this study is technology. The divergence theory is specifically illustrated by the national education approach. The HRMPP studied are: recruitment and selection, organisation, training and compensation. One hundred and sixty-three high-, mid- and low-tech firms answered a questionnaire which is analysed along with twenty-five semi-structured interviews in eight firms located in both France and England. The technological aspect of the neocontingency approach is validated in two factors: 1) technological profile and 2) longterm approach to training. The cultural aspect is notable in two cases: 1) long-term approach to training, and 2) compensation based on performance. A key implication of the findings in this thesis is that employees working in intensive technology firms need a creative and adaptive HR management approach, which would better enable them to cope with the challenge presented by the business environment. The results that this thesis report are illustrated by the strong influence of educational systems on managers' behaviour. French firms seem to prefer higher levels of control and formalisation in recruitment and selection, training and compensation than the W which could be interpreted as a reflection of the French educational system. It is important to highlight that the culture and technology factors cannot be the total 'determinants' of organisations. At most, they are important features that, along with others, Cinfluence' organisations' internal operation. Future 'neo-contingency' studies using other organisational characteristics as moderators and a large sample are needed in order to gain insights into the neo-contingency approach proposed in this thesis.EThOS - Electronic Theses Online ServiceGBUnited Kingdo

    First International Workshop on Lexical Resources

    Get PDF
    International audienceLexical resources are one of the main sources of linguistic information for research and applications in Natural Language Processing and related fields. In recent years advances have been achieved in both symbolic aspects of lexical resource development (lexical formalisms, rule-based tools) and statistical techniques for the acquisition and enrichment of lexical resources, both monolingual and multilingual. The latter have allowed for faster development of large-scale morphological, syntactic and/or semantic resources, for widely-used as well as resource-scarce languages. Moreover, the notion of dynamic lexicon is used increasingly for taking into account the fact that the lexicon undergoes a permanent evolution.This workshop aims at sketching a large picture of the state of the art in the domain of lexical resource modeling and development. It is also dedicated to research on the application of lexical resources for improving corpus-based studies and language processing tools, both in NLP and in other language-related fields, such as linguistics, translation studies, and didactics

    Tools for linguistic variation

    Get PDF
    Índice / Index / Sommaire:- Introducción a los problemas y métodos según los principios de la Escuela Dialectométrica de Salzburgo (con ejemplos sacados del “Atlante Italo-Svizzero”, AIS) (Hans Goebl).- Some further dialectometrical stops (John Nerbonne, Jelena Prokic, Martijn Wieling and Charlotte Gooskens).- Tools for dialect syntax: the case of CORDIAL-SIN (an annotated corpus of Portuguese dialects) (Ernestina Carrilho).- Le projet Vivaldi: présentation d’un atlas linguistique parlant virtual (Roland Bauer).- Le Thesaurus Occitan: une base de données multimedia consacrée aux dialectes occitans (Guylaine Brun-Trigaud).- The Thesaurus Occitan: a multimedia database dedicated to Occitan dialects; presentation of its morphosyntax module (Pierre-Aurélien Georges).- New methods for the study of grammatical variation and the Audible Corpus of Spoken Rural Spanish (Inés Fernández Ordóñez).- The application of speech synthesis and speech recognition techniques in dialectal studies (María Pilar Perea).- Relevancia del análisis lingüístico en el tratamiento cuantitativo de la variación dialectal (Esteve Clua).- El procesamiento informático de los materiales del Atlas Lingüístico de la Península Ibérica de Tomás Navarro Tomás (Pilar García Mouton).- Un retrato del artículo vasco en el año 1895 mediante el programa VDM (Ekaitz Santazilia).- Technology for prosodic variation (Gotzon Aurrekoetxea and Aitor Iglesias)

    The lexeme in descriptive and theoretical morphology

    Get PDF
    After being dominant during about a century since its invention by Baudouin de Courtenay at the end of the nineteenth century, morpheme is more and more replaced by lexeme in contemporary descriptive and theoretical morphology. The notion of a lexeme is usually associated with the work of P. H. Matthews (1972, 1974), who characterizes it as a lexical entity abstracting over individual inflected words. Over the last three decades, the lexeme has become a cornerstone of much work in both inflectional morphology and word formation (or, as it is increasingly been called, lexeme formation). The papers in the present volume take stock of the descriptive and theoretical usefulness of the lexeme, but also adress many of the challenges met by classical lexeme-based theories of morphology
    corecore