33 research outputs found

    Automatic dictionary and rule-based systems for extracting information from text

    Get PDF
    The paper offers a general introduction to the use of meta-information in a text mining perspective. The aim is to build a meta-dictionary as an available linguistic resource useful for different applications. The procedure is based on the use of a hybrid system. The suggested algorithm employs, conjointly and in a recursive way, dictionaries and rules, the latter both lexical and textual. An application on a corpus of diaries from the Time Use Survey (TUS) by Istat is illustrate

    Il riconoscimento automatico di locuzioni verbali con l’ausilio del software Taltac2

    No full text
    This paper concerns the use of a linguistic resource constituted by verbal locutions derived from the GRADIT, to be applied – via an algorithm implemented in the TaLTaC2 software – for analysing any text. Some of the quantitative characteristics of the resource and the construction of the local grammar of a locution are illustrated. This is done, following the logic of the linguistics of corpora, by applying the resource to a huge corpus of newspaper articles. The calculation of the frequency of these locutions at the level of lemmas provides knowledge of their use in the press, to be exploited for the analysis of specific corpora. Moreover, the key algorithm of automatic recognition is described, as well as the application of the resource to Obama’s speech at the University of Cairo

    Déclarations et répliques gouvernementales dans le discours parlementaire italien, deux genres discoursifs.

    No full text
    analisi statistica di discorsi parlamentari di insediamento dei governi della prima repubblic

    Statistica testuale e text mining: alcuni paradigmi applicativi

    No full text
    In this paper, after reconstructing some essential phases in the evolution of automatic analysis of texts, the steps of an ideal strategy for the statistical analysis of textual data are defined. The characteristics of lexical and textual analysis are described, as well as some techniques of information extraction, that employ resources which are endogenous and exogenous with respect to the texts to be examined. In order to show the potential of textual statistics and of the most recent Text Mining applications, some relevant case studies concerning statistical survey and document analysis are illustrated

    Déclarations et répliques gouvernementales dans le discours parlementaire italien, deux genres discursifs

    No full text
    Bolasco Sergio. Déclarations et répliques gouvernementales dans le discours parlementaire italien, deux genres discursifs. In: Mots, n°64, décembre 2000. Autour d'une crise franco-australienne. Stéréotypies xénophiles et xénophobes, sous la direction de Albane Cain, Christine Develotte et Pierre Fiala. pp. 97-112
    corecore