46 research outputs found
Tools for Terminology Processing
International audienceAutomatic terminology processing appeared 10 years ago when electronic corpora became widely available. Such processing may be statistically or linguistically based and produces terminology resources that can be used in a number of applications : indexing, information retrieval, technology watch, etc. We present the tools that have been developed in the IRIN Institute. They all take as input texts (or collection of texts) and reflect different states of terminology processing: term acquisition, term recognition and term structuring
Coping with Alternate Formulations of Questions and Answers
We present in this chapter the QALC system which has participated in the four TREC QA evaluations. We focus here on the problem of linguistic variation in order to be able to relate questions and answers. We present first, variation at the term level which consists in retrieving questions terms in document sentences even if morphologic, syntactic or semantic variations alter them. Our second subject matter concerns variation at the sentence level that we handle as different partial reformulations of questions. Questions are associated with extraction patterns based on the question syntactic type and the object that is under query. We present the whole system thus allowing situating how QALC deals with variation, and different evaluations
Cross-language Ontology Learning: Incorporating and Exploiting Cross-language Data in the Ontology Learning Process
Hans Hjelm. Cross-language Ontology Learning:
Incorporating and Exploiting Cross-language Data in the Ontology Learning Process.
NEALT Monograph Series, Vol. 1 (2009), 159 pages.
© 2009 Hans Hjelm.
Published by
Northern European Association for Language
Technology (NEALT)
http://omilia.uio.no/nealt .
Electronically published at
Tartu University Library (Estonia)
http://hdl.handle.net/10062/10126
Introduction - Permanence et changements
Avec le web, les moteurs de recherche, les blogues et les wikis, la relation Ă lâinformation sâest transformĂ©e au point oĂč les repĂšres habituels sâĂ©moussent et doivent ĂȘtre redĂ©finis de fond en comble. Du coup, le travail des archivistes et des bibliothĂ©caires doit lâĂȘtre tout autant. Riches dâune solide tradition et conscientes des dĂ©fis posĂ©s par la modernitĂ© la plus radicale, les sciences de lâinformation se sont Ă©largies. Mais il ne sâagit plus seulement de conserver et de diffuser le savoir, il sâagit dâen repenser le traitement et lâaccĂšs.
Conçu par lâĂcole de bibliothĂ©conomie et des sciences de lâinformation (EBSI) de lâUniversitĂ© de MontrĂ©al, cet ouvrage se situe au carrefour de deux grandes traditions, amĂ©ricaine et française, et a pour ambition de fournir les clĂ©s du monde des sciences de lâinformation en se fondant sur des savoirs pratiques et concrets. Les auteurs prĂ©sentent ici un savoir Ă la fine pointe des sciences de lâinformation pour rĂ©pondre Ă la complexitĂ© des enjeux actuels et futurs.
Introduction, bibliographie, index et table des matiĂšres de lâouvrage
TerminoloĆĄka naÄela in oblikoslovnoskladenjske terminoloĆĄke variacije
The article deals with terminological principles and terminological variability, with an emphasis on morphosyntactic terminological variations. Examples of such variations were empirically observed in a specialized corpus of texts on computer and information sciences. Based on the examples retrieved, the article presents a typology for describing morphosyntactic terminological variations in Slovenian. It also highlights the logical and conceptual relations established through such terminological variations.Älanek obravnava tematiko terminoloĆĄkih naÄel in terminoloĆĄke variabilnosti, s poudarkom na oblikoslovno-skladenjskih terminoloĆĄkih variacijah. Primeri tovrstnih terminoloĆĄkih variacij so bili empiriÄno opazovani v specializiranem korpusu besedil s podroÄja raÄunalniĆĄtva in informatike. Na podlagi odkritih primerov Älanek ponuja tipologijo za opis oblikoslovno-skladenjskih variacij v slovenĆĄÄini ter obenem ponazarja logiÄne in pojmovne odnose, ki se vzpostavljajo s takĆĄnimi variacijami
Term-driven E-Commerce
Die Arbeit nimmt sich der textuellen Dimension des E-Commerce an. Grundlegende Hypothese ist die textuelle Gebundenheit von Information und Transaktion im Bereich des elektronischen Handels. Ăberall dort, wo Produkte und Dienstleistungen angeboten, nachgefragt, wahrgenommen und bewertet werden, kommen natĂŒrlichsprachige AusdrĂŒcke zum Einsatz. Daraus resultiert ist zum einen, wie bedeutsam es ist, die Varianz textueller Beschreibungen im E-Commerce zu erfassen, zum anderen können die umfangreichen textuellen Ressourcen, die bei E-Commerce-Interaktionen anfallen, im Hinblick auf ein besseres VerstĂ€ndnis natĂŒrlicher Sprache herangezogen werden