Search CORE

2,047 research outputs found

Morphonette: a morphological network of French

Author: Hathout Nabil
Publication venue
Publication date: 29/04/2010
Field of study

This paper describes in details the first version of Morphonette, a new French morphological resource and a new radically lexeme-based method of morphological analysis. This research is grounded in a paradigmatic conception of derivational morphology where the morphological structure is a structure of the entire lexicon and not one of the individual words it contains. The discovery of this structure relies on a measure of morphological similarity between words, on formal analogy and on the properties of two morphological paradigms

arXiv.org e-Print Archive

Scientific Publications of the University of Toulouse II Le Mirail

HAL Descartes

Constraint Logic Programming for Natural Language Processing

Author: Blache Philippe
Hathout Nabil
Publication venue
Publication date: 01/01/1995
Field of study

This paper proposes an evaluation of the adequacy of the constraint logic programming paradigm for natural language processing. Theoretical aspects of this question have been discussed in several works. We adopt here a pragmatic point of view and our argumentation relies on concrete solutions. Using actual contraints (in the CLP sense) is neither easy nor direct. However, CLP can improve parsing techniques in several aspects such as concision, control, efficiency or direct representation of linguistic formalism. This discussion is illustrated by several examples and the presentation of an HPSG parser.Comment: 15 pages, uuencoded and compressed postscript to appear in Proceedings of the 5th Int. Workshop on Natural Language Understanding and Logic Programming. Lisbon, Portugal. 199

arXiv.org e-Print Archive

CiteSeerX

Saving energy through heat loss survey of residential areas of Winnipeg

Author: Hathout Salah
Publication venue: Institute of Urban Studies
Publication date: 01/01/1980
Field of study

1 v. (various pagings) : ill. 6 diagrams (31x236 cm.) in envelope. (available in hard copy at Institute of Urban Studies Library

WinnSpace Repository

Acquisition morphologique à partir d'un dictionnaire informatisé

Author: Hathout Nabil
Publication venue: HAL CCSD
Publication date: 10/02/2009
Field of study

10 pagesThe paper presents a linguistic and computational model aiming at making the morphological structure of the lexicon emerge from the formal and semantic regularities of the words it contains. The model is word-based. The proposed morphological structure consists of (1) binary relations that connect each headword with words that are morphologically related, and especially with the members of its morphological family and its derivational series, and of (2) the analogies that hold between the words. The model has been tested on the lexicon of French using the TLFi machine readable dictionary.L'article propose un modèle linguistique et informatique permettant de faire émerger la structure morphologique dérivationnelle du lexique à partir des régularités sémantiques et formelles des mots qu'il contient. Ce modèle est radicalement lexématique. La structure morphologique est constituée par les relations que chaque mot entretient avec les autres unités du lexique et notamment avec les mots de sa famille morphologique et de sa série dérivationnelle. Ces relations forment des paradigmes analogiques. La modélisation a été testée sur le lexique du français en utilisant le dictionnaire informatisé TLFi

Scientific Publications of the University of Toulouse II Le Mirail

HAL Descartes

Exurban housing development in the Winnipeg-Selkirk corridor

Author: Barber Josh
Hathout Salah
Publication venue: Institute of Urban Studies
Publication date: 01/01/1977
Field of study

82 leaves : ill. ; 28 cm

WinnSpace Repository

WEBAFFIX : une boîte à outils d'acquisition lexicale à partir du Web

Author: Hathout Nabil
Tanguy Ludovic
Publication venue: 'Universite du Quebec a Montreal - Departement de Didactique'
Publication date: 01/01/2005
Field of study

International audienceThis paper deals with the design and use of Webaffix, a tool for semi-automatically detecting new word forms from the World Wide Web. We focus mainly on new derived words, i.e. coined from other lexemes through suffixation and/or prefixation processes. We develop the techniques and methods used in Webaffix, along with a sample of results obtained via several studies on French. Resources such as the ones created through the use of Webaffix are useful not only for natural language processing and information retrieval tasks, but also for the linguistic study of word creation.Nous présentons ici Webaffix, un outil et une méthodologie qui permet d'enrichir et de constituer semi-automatiquement des données lexicales en utilisant le Web comme corpus. Notre approche concerne plus spécifiquement la détection et l'analyse d'unités lexicales construites par suffixation ou préfixation. Nous présentons les méthodes et techniques utilisées par Webaffix, en déclinant les différents modes d'utilisation que nous avons envisagés et mis en pratique, ainsi que des exemples de résultats produits par diverses campagnes d'utilisation. Les données ainsi recueillies sont utiles comme ressources pour différentes applications en traitement automatique des langues, mais permettent également d'étudier à grande échelle les phénomènes de création lexicale

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

HAL Descartes

Hal-Diderot

Looking for French deverbal nouns in an evolving Web (a short history of WAC)

Author: Hathout Nabil
Sajous Franck
Tanguy Ludovic
Publication venue: HAL CCSD
Publication date: 07/09/2009
Field of study

International audienceThis paper describes an 8-year-long research effort for automatically collecting new French deverbal nouns on the Web. The goal has remained the same: building an extensive and cumulative list of noun-verb pairs where the noun denotes the action expressed by the verb (e.g. production - produce). This list is used for both linguistic research and for NLP applications. The initial method consisted in taking advantage of the former Altavista search engine, allowing for a direct access to unknown word forms. The second technique led us to develop a specific crawler, which raised a number of technical difficulties. In the third experiment, we use a collection of web pages made available to us by a commercial search engine. Through all these stages, the general method has remained the same, and the results are similar and cumulative, although the technical environment has greatly evolved

Scientific Publications of the University of Toulouse II Le Mirail

HAL Descartes

Ne jetons pas le Wiktionnaire avec l'oripeau du Web ! Études et réalisations fondées sur le dictionnaire collaboratif

Author: Calderone Basilio
Hathout Nabil
Sajous Franck
Publication venue: HAL CCSD
Publication date: 01/01/2014
Field of study

Wiktionnaire est l'édition française de Wiktionnary, le dictionnaire libre multilingue accessible en ligne. Satellite de Wikipédia, dont il constitue le "compagnon lexical", le projet dictionnairique reste dans l'ombre de l'encyclopédie. Fondé comme elle sur le principe du wiki, il peut être alimenté et modifié par tout internaute, avec publication immédiate. Si la ressource encyclopédique a été abondamment utilisée dans certaines disciplines, le dictionnaire collaboratif semble avoir reçu moins d'attention de la part de la communauté scientifique. Ce moindre intérêt pourrait être le fruit d'une méconnaissance ou d'un rejet a priori de l'amateurisme que l'on associe volontiers aux contributions effectuées par des naïfs. Nous présentons dans cet article quelques caractéristiques du Wiktionnaire, ainsi que des réalisations issues de cette ressource. Ce travail entend illustrer les possibilités offertes par ce dictionnaire singulier et permettre de décider si l'on peut tirer ou non bénéfice de son exploitation, et pour quel usage. Plus précisément, nous questionnons la légimité des ressources approvisionnées "par les foules" et nous étudions dans quelle mesure le Wiktionnaire peut, par ses spécificités, compléter les ressources dictionnairiques existantes dans le cadre d'études linguistiques et, d'autre part, servir de point de départ à la constitution d'un lexique électronique pour des domaines comme le traitement automatique des langues et la psycholinguistique. Notre contribution à la caractérisation du Wiktionnaire s'accompagne de la mise à disposition de deux lexiques construits à partir du dictionnaire collaboratif. Le premier est un lexique morphophonologique à très large couverture. Destiné notamment aux applications de TAL, nous donnons des exemples possibles d'utilisation en linguistique outillée. Le second est un lexique orienté vers la psycholinguistique. Dérivé du premier, il contient moins d'entrées, mais comprend pour chacune d'elle un ensemble d'informations habituellement utilisées dans cette discipline. Ces lexiques sont à la fois sont téléchargeables et interrogeables en ligne

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

EDP Sciences OAI-PMH repository (1.2.0)

HAL Descartes