Search CORE

3 research outputs found

From Terminology Extraction to Terminology Validation: An Approach Adapted to Log Files

Author: Bonniol Stéphane
Poncelet Pascal
Roche Mathieu
Saneifar Hassan
Publication venue: Graz University of Technology, Institut für Informationssysteme und Computer Medien
Publication date: 01/01/2015
Field of study

International audienceLog files generated by computational systems contain relevant and essential information. In some application areas like the design of integrated circuits, log files generated by design tools contain information which can be used in management information systems to evaluate the final products. However, the complexity of such textual data raises some challenges concerning the extraction of information from log files. Log files are usually multi-source, multi-format, and have a heterogeneous and evolving structure. Moreover, they usually do not respect natural language grammar and structures even though they are written in English. Classical methods of information extraction such as terminology extraction methods are particularly irrelevant to this context. In this paper, we introduce our approach Exterlog to extract terminology from log files. We detail how it deals with the specific features of such textual data. The performance is emphasized by favoring the most relevant terms of the domain based on a scoring function which uses a Web and context based measure. The experiments show that Exterlog is a well-adapted approach for terminology extraction from log files

ZENODO

HAL Descartes

Agritrop

ARPHA OAI-PMH Endpoint

ARPHA Preprints

HAL-CIRAD

From Terminology Extraction to Terminology Validation: An Approach Adapted to Log Files

Author: Hassan Saneifar
Mathieu Roche
Pascal Poncelet
Stéphane Bonniol
Publication venue
Publication date: 03/04/2020
Field of study

Abstract: Log files generated by computational systems contain relevant and essential information. In some application areas like the design of integrated circuits, log files generated by design tools contain information which can be used in management information systems to evaluate the final products. However, the complexity of such textual data raises some challenges concerning the extraction of information from log files. Log files are usually multi-source, multi-format, and have a heterogeneous and evolving structure. Moreover, they usually do not respect natural language grammar and structures even though they are written in English. Classical methods of information extraction such as terminology extraction methods are particularly irrelevant to this context. In this paper, we introduce our approach Exterlog to extract terminology from log files. We detail how it deals with the specific features of such textual data. The performance is emphasized by favoring the most relevant terms of the domain based on a scoring function which uses a Web and context based measure. The experiments show that Exterlog is a well-adapted approach for terminology extraction from log files

CiteSeerX

Enhancing passage retrieval in log files by query expansion based on explicit and pseudo relevance feedback

Author: Agichtein
Bai
Berger
Bernhard
Brill
Buscaldi
Carpineto
Chalendar
Clarke
Clarke
Cui
Daille
Fellbaum
Ferrés
Gillard
Grefenstette
Guiasu
Hassan Saneifar
Ittycheriah
Kaszkiel
Keikha
Khalid
Kosseim
Lamjiri
Lee
Lee
Li
Light
Ligozat
Lin
Llopis
Lv
Manning
Mathieu Roche
Monz
Ofoghi
O’Connor
Pasca
Pascal Poncelet
Rijsbergen
Rocchio
Roche
Salton
Salton
Salton
Saneifar
Saneifar
Saneifar
Soboro
Stéphane Bonniol
Tellex
Tiedemann
Tiedemann
Tiedemann
Van der Plas
Voorhees
Wade
Wu
Xu
Xu
Yang
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref