Search CORE

316 research outputs found

An exploratory data-driven analysis for describing discourse organization

Author: Ho-Dac Lydia-Mai
Publication venue: Frankfurt/Berlin: Peter Lang
Publication date: 01/01/2010
Field of study

International audienceThis paper focuses on the role of elements placed in the initial position i.e. elements fulfilling the role of Theme in discourse organisation. The large-scale corpus study proposes a new methodology based on automatic tagging and quantitative analysis of the discourse roles of sentence-initial elements. The theoretically-based hypothesis is that initial position has an important function in discourse organisation. Initial position, defined as the starting point of the message, is composed of the first elements that the reader perceives. The analysis of the distribution and the use in discourse of these elements gives us a great overview on the textual organisation of different types of text

Scientific Publications of the University of Toulouse II Le Mirail

HAL Descartes

Discourse organisation through Theme position

Author: Ho-Dac Lydia-Mai
Publication venue: Fakultät 4 - Philosophische Fakultät II. Fachrichtung 4.6 - Angewandte Sprachwissenschaft sowie Übersetzen und Dolmetschen
Publication date: 01/01/2008
Field of study

This paper focuses on the role of elements placed in the initial position i.e. elements fulfilling the role of Theme in discourse organisation. The large-scale corpus study proposes a new methodology based on quantitative analysis of the discourse roles of sentence-initial elements. The theoretically-based hypothesis is that Theme position has an important function in discourse organisation. Theme, defined as the starting point of the message, is composed of the first elements that the reader perceives. The analysis of the distribution and the use in discourse of these elements gives us a great overview on the textual organisation of different types of text

La subjectivité à travers les médias : étude comparée de les médias participatifs et de la presse traditionnelle

Author: Ho-Dac Lydia-Mai
Küppers Anne
Publication venue: Bases, Corpus, Langage - UMR 7320
Publication date: 01/01/2011
Field of study

National audienceSubjectivity in mass media: comparing participatory and traditional journalese This paper investigates linguistic differences and similarities in a traditional newspaper and participatory on-line media in order to examine to what extent the on-line production has an impact on the language use. Our method of investigation is based on corpus analysis. We analyze a large-scale corpus (8,000,000 words) composed of datasets representing different steps on a graduate scale from traditional printed newspaper to online citizen press.Cette étude propose une analyse des différences et similitudes linguistiques dans la presse écrite traditionnelle et les médias participatifs en ligne afin d'évaluer dans quelle mesure la production et la diffusion en ligne peuvent modifier nos usages linguistiques. Les analyses effectuées se basent sur un large corpus (8 millions de mots) qui représente des modes d'expression et des degrés de subjectivité a priori différents

Scientific Publications of the University of Toulouse II Le Mirail

HAL Descartes

OpenEdition

Private State in Public Media: Subjectivity in French Traditional and Online News

Author: Ho-Dac Lydia-Mai
Küppers A.
Publication venue: HAL CCSD
Publication date: 17/08/2010
Field of study

International audienceThis paper reports on ongoing work dealing with the linguistic impact of putting the news on-line. In this framework, we investigate differences in one traditional newspaper and two forms of alternative on-line media with respect to the expression of authorial stance. Our research is based on a comparable large-scale corpus of articles published on the websites of the three respective media and aims at answering the question to what extent the presence of the author varies in the different media. - Is it a matter of amount and mode of the author's presence? - Is it a matter of lexical choice and diversity? - If this were the case, what expressions are used in the respective media? Our endeavour will be a methodological one. We firstly present our data, and thus describe the different news media included in our analysis and the diverse computer aided and manual production steps we performed in order to build up the corpus. Secondly, we outline our working hypotheses that are linked to the chosen types of media and describe the theoretical framework within which they are situated. Thirdly, we present our research method as well as some first results and insights gained throughout the pilot study of our data

Scientific Publications of the University of Toulouse II Le Mirail

HAL Descartes

Les discussions Wikipedia : un corpus pour caractériser le genre « discussion »

Author: Ho-Dac Lydia-Mai
Laippala Veronika
Publication venue: HAL CCSD
Publication date: 23/10/2015
Field of study

International audienceCette présentation propose une description des caractéristiques intra-linguistiques des discussions Wikipedia, forum de discussion associé à chaque article de l'encyclopédie Wikipedia. Après un exposé des propriétés qui font de ces textes un objet d'étude particulièrement intéressant pour les linguistiques de corpus, nous présenterons la procédure de constitution du corpus de discussion et une première description quantitative du corpus constitué. Nous finirons sur une présentation rapide d'un ensemble d'études linguistiques envisagées sur ce corpus

Scientific Publications of the University of Toulouse II Le Mirail

HAL Descartes

A kinematic study of coarticulation of Cantonese fricative /s/ using electromagnetic articulography (EMA)

Author: Lai Ho-ning, Lydia
黎浩寧
Publication venue: The University of Hong Kong (Pokfulam, Hong Kong)
Publication date: 01/01/2009
Field of study

Includes bibliographical references (p. 25-29).Thesis (B.Sc)--University of Hong Kong, 2009."A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, June 30, 2009."published_or_final_versionSpeech and Hearing SciencesBachelorBachelor of Science in Speech and Hearing Science

HKU Scholars Hub

A kinematic study of coarticulation of Cantonese fricative /s/ using electromagnetic articulography (EMA)

Author: Lai Ho-ning, Lydia
黎浩寧
Publication venue: The University of Hong Kong (Pokfulam, Hong Kong)
Publication date: 01/01/2009
Field of study

Revistes Catalanes amb Accés Obert

Repositori Institucional URV

HKU Scholars Hub

Hemeroteca Cientifica Catalana

L'anticorrecteur : outil d'évaluation positive de l'orthographe et de la grammaire

Author: Delbar Valentine
Ho-Dac Lydia-Mai
Muller Sophie
Publication venue: HAL CCSD
Publication date: 04/07/2016
Field of study

International audienceThis study aims at testing out a new form of evaluation for spell and grammar checking. A new tool, called "anti-correcteur", was integrated in Cordial, a French spell and grammar checker, for measuring success rates in common spelling difficulties defined according to literature in French language teaching and corpus-based analysis. This module proposes to assess spelling skills not only against errors, but also by taking successes into account. This paper presents a first experiment of such a positive evaluation by exploring results given by the "anti-correcteur" applied on a diversified corpus in terms of level of literacy and genre.L'objectif de cette étude est d'expérimenter l'intégration d'une nouvelle forme d'évaluation dans un correcteur orthographique et grammatical. L'« anti-correcteur » a pour objet de mesurer le taux de réussites orthographiques et grammaticales d'un texte sur certains points jugés difficiles selon la littérature et une observation d'erreurs en corpus. L'évaluation du niveau d'écriture ne se base plus uniquement sur les erreurs commises, mais également sur les réussites réalisées. Une version bêta de ce nouveau mode d'évaluation positive a été intégré dans le correcteur Cordial. Cet article a pour but de discuter de l'intérêt de ce nouveau rapport à l'orthographe et de présenter quelques premiers éléments d'analyse résultant de l'application de l'anti-correcteur sur un corpus de productions variées en matière de niveau d'écriture et genre discursif

Scientific Publications of the University of Toulouse II Le Mirail

HAL Descartes

Annotation des structures discursives : l'expérience ANNODIS

Author: Ho-Dac Lydia-Mai
Péry-Woodley Marie-Paule
Publication venue: 'EDP Sciences'
Publication date: 01/01/2014
Field of study

International audienceLa ressource ANNODIS est un corpus diversifié de français écrit enrichi d'annotations concernant le niveau discursif. Son originalité réside dans sa mutualisation de deux approches complémentaires qui permettent, par leur oppositions et rapprochements, de poser un certain nombre de questions concernant l'annotation de structures discursives. cet article propose de revenir sur les enjeux principaux qui ont motivés les membres du projet ANNODIS : 1) stabiliser un certain nombre de définition linguistique de phénomènes discursives ciblées et 2) confronter aux données réelles une certaine modélisation de la construction de la cohérence discursive. Ce double objectif est révélateur des deux approches mises à l'épreuve dans l'expérience ANNODIS. Cet article revient sur les enjeux de cette ressource en terme à la fois de structures discursives et de campagne d'annotation. Un regard particulier sera porté sur la question du devenir des annotations, notamment dans un domaine encore peu stabilisé

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

EDP Sciences OAI-PMH repository (1.2.0)

Directory of Open Access Journals

HAL Descartes

ANNODIS : une ressource pour l'identification de systèmes de marqueurs du discours

Author: Ho-Dac Lydia-Mai
Péry-Woodley Marie-Paule
Publication venue: HAL CCSD
Publication date: 11/05/2012
Field of study

National audienceA la recherche des "marqueurs" impliqués dans la signalisation de l'organisation discursive, et des interactions ou jeux de contraintes entre différents systèmes de marqueurs, de nombreux travaux visent à définir des combinaisons ou faisceaux d'indices discursifs. L'étude que nous présentons s'inscrit dans cette lignée, mais de manière descriptive et empirique à travers l'application de techniques de fouille à un corpus annoté manuellement. Nous décrivons brièvement ce corpus, puis la méthode qui nous permet de passer d'abord des traits (pré-marqués automatiquement) aux indices (annotés manuellement), puis des indices aux combinaisons que nous appelons "cuesets"

Scientific Publications of the University of Toulouse II Le Mirail

HAL Descartes