494 research outputs found
NLP-based Metadata Extraction for Legal Text Consolidation
The paper describes a system for the automatic consolidation of Italian legislative texts to be used as a support of an editorial consolidating activity and dealing with the following typology of textual amendments: repeal, substitution and integration. The focus of the paper is on the semantic analysis of the textual amendment provisions and the formalized representation of the amendments in terms of metadata. The proposed approach to consolidation is metadata- oriented and based on Natural Language Processing (NLP) techniques: we use XML-based standards for metadata annotation of legislative acts and a flexible NLP architecture for extracting metadata from parsed texts. An evaluation of achieved results is also provided
Automatic extraction of knowledge from web documents
A large amount of digital information available is written as text documents in the form of web pages, reports, papers, emails, etc. Extracting the knowledge of interest from such documents from multiple sources in a timely fashion is therefore crucial. This paper provides an update on the Artequakt system which uses natural language tools to automatically extract knowledge about artists from multiple documents based on a predefined ontology. The ontology represents the type and form of knowledge to extract. This knowledge is then used to generate tailored biographies. The information extraction process of Artequakt is detailed and evaluated in this paper
Design of a Controlled Language for Critical Infrastructures Protection
We describe a project for the construction of controlled language for critical infrastructures protection (CIP). This project originates
from the need to coordinate and categorize the communications on CIP at the European level. These communications can be physically
represented by official documents, reports on incidents, informal communications and plain e-mail. We explore the application of
traditional library science tools for the construction of controlled languages in order to achieve our goal. Our starting point is an
analogous work done during the sixties in the field of nuclear science known as the Euratom Thesaurus.JRC.G.6-Security technology assessmen
Final FLaReNet deliverable: Language Resources for the Future - The Future of Language Resources
Language Technologies (LT), together with their backbone, Language Resources (LR), provide an essential support to the challenge of Multilingualism and ICT of the future. The main task of language technologies is to bridge language barriers and to help creating a new environment where information flows smoothly across frontiers and languages, no matter the country, and the language, of origin. To achieve this goal, all players involved need to act as a community able to join forces on a set of shared priorities. However, until now the field of Language Resources and Technology has long suffered from an excess of individuality and fragmentation, with a lack of coherence concerning the priorities for the field, the direction to move, not to mention a common timeframe. The context encountered by the FLaReNet project was thus represented by an active field needing a coherence that can only be given by sharing common priorities and endeavours. FLaReNet has contributed to the creation of this coherence by gathering a wide community of experts and making them participate in the definition of an exhaustive set of recommendations
A Survey on Linked Data and the Social Web as facilitators for TEL recommender systems
Personalisation, adaptation and recommendation are central features
of TEL environments. In this context, information retrieval techniques are applied
as part of TEL recommender systems to filter and recommend learning resources
or peer learners according to user preferences and requirements. However,
the suitability and scope of possible recommendations is fundamentally
dependent on the quality and quantity of available data, for instance, metadata
about TEL resources as well as users. On the other hand, throughout the last
years, the Linked Data (LD) movement has succeeded to provide a vast body of
well-interlinked and publicly accessible Web data. This in particular includes
Linked Data of explicit or implicit educational nature. The potential of LD to
facilitate TEL recommender systems research and practice is discussed in this
paper. In particular, an overview of most relevant LD sources and techniques is
provided, together with a discussion of their potential for the TEL domain in
general and TEL recommender systems in particular. Results from highly related
European projects are presented and discussed together with an analysis of
prevailing challenges and preliminary solutions.LinkedU
- âŠ