Search CORE

4,153 research outputs found

IMAGACT: Deriving an Action Ontology from Spoken Corpora

Author: Frontini Francesca
Gagliardi Gloria
Monachini Monica
Moneglia Massimo
Panunzi Alessandro
Russo Irene
Publication venue: -
Publication date: 01/01/2012
Field of study

This paper presents the IMAGACT annotation infrastructure which uses both corpus - based and competence - based methods for the simultaneous extraction of a language independent Action ontology from English and Italian spontaneous speech corpora. The infrastructure relies on an innovative methodology based on images of prototypical scenes and will identify high frequency action concepts in everyday life, suitable for the implementation of an open set of languages

ARCHIVIO ISTITUZIONALE DELLA RICERCA-UNIVERSITA' DEGLI STUDI DI NAPOLI "L'ORIENTALE"

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

PUblication MAnagement

Can humain association norm evaluate latent semantic analysis?

Author: Gatkowska Izabela
Korzycki Michał
Lubaszewski Wiesław
Publication venue: [s.n.]
Publication date: 01/01/2013
Field of study

This paper presents the comparison of word association norm created by a psycholinguistic experiment to association lists generated by algorithms operating on text corpora. We compare lists generated by Church and Hanks algorithm and lists generated by LSA algorithm. An argument is presented on how those automatically generated lists reflect real semantic relations

Jagiellonian Univeristy Repository

Towards a FrameNet Resource for the Legal Domain

Author: Agnoloni Tommaso
Lenci Alessandro
Montemagni Simonetta
Sagri Maria Teresa
Tiscornia Daniela
Vecchi Eva Maria
Venturi Giulia
Publication venue: Huygens Editorial, Barcelona, Spain
Publication date
Field of study

In the AI&Law community, the importance of frame-based ontologies has been acknowledged since the early 90\u27s with the Van Kralingen\u27s proposal of a frame language for legal knowledge representation. This still appears to be a strongly felt need within the community. In this paper, we propose to face this need by developing a FrameNet resource for the legal domain based on Fillmore\u27s Frame Semantics, whose final outocme will include a frame-based lexical ontology and a legal corpus annotated with frame information. In particular, the paper focuses on methodological and design issues, ranging from the customization and extension of the general FrameNet for the legal domain to the linking of the developed resource with already existing Legal Ontologies

PUblication MAnagement

Proceedings of the International Workshop on Text Mining Research, Practice and Opportunities

Author
Publication venue
Publication date: 24/09/2005
Field of study

The University of Manchester - Institutional Repository

Corpógrafo V3: From Terminological Aid to Semi-automatic Knowledge Engine

Author: Cabral Luís
Maia Belinda
Pinto Ana
Santos Diana
Sarmento Luís
Publication venue
Publication date: 06/11/2008
Field of study

Repositório Comum

NewsReader: Using knowledge resources in a cross-lingual reading machine to generate more knowledge from massive streams of news

Author: Agerri Rodrigo
Aldabe Itziar
Cybulska Agata
Fokkens Antske
Laparra Egoitz
Minard Anne-Lyse
Palmero Aprosio Alessio
Rigau German
Rospocher Marco
Segers Roxane
van Erp Marieke
Vossen Piek
Publication venue
Publication date: 01/01/2016
Field of study

Abstract In this article, we describe a system that reads news articles in four different languages and detects what happened, who is involved, where and when. This event-centric information is represented as episodic situational knowledge on individuals in an interoperable RDF format that allows for reasoning on the implications of the events. Our system covers the complete path from unstructured text to structured knowledge, for which we defined a formal model that links interpreted textual mentions of things to their representation as instances. The model forms the skeleton for interoperable interpretation across different sources and languages. The real content, however, is defined using multilingual and cross-lingual knowledge resources, both semantic and episodic. We explain how these knowledge resources are used for the processing of text and ultimately define the actual content of the episodic situational knowledge that is reported in the news. The knowledge and model in our system can be seen as an example how the Semantic Web helps NLP. However, our systems also generate massive episodic knowledge of the same type as the Semantic Web is built on. We thus envision a cycle of knowledge acquisition and NLP improvement on a massive scale. This article reports on the details of the system but also on the performance of various high-level components. We demonstrate that our system performs at state-of-the-art level for various subtasks in the four languages of the project, but that we also consider the full integration of these tasks in an overall system with the purpose of reading text. We applied our system to millions of news articles, generating billions of triples expressing formal semantic properties. This shows the capacity of the system to perform at an unprecedented scale

Elsevier - Publisher Connector

VU Research Portal

Archivio della ricerca - Fondazione Bruno Kessler

Catalogo dei prodotti della ricerca

Open Access Repository

Models to represent linguistic linked data

Author: A. GÓMEZ-PÉREZ
Borin
Crystal
E. MONTIEL-PONSODA
Ehrmann
Farrar
Fellbaum
Fellbaum
Hanks
Hayes
Hellmann
Ide
J. BOSQUE-GIL
J. GRACIA
Klimek
Mel’cuk
Mel’cuk
Menke
Ogden
Peirce
Pustejovsky
Schuurman
Trippel
Vila-Suero
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2018
Field of study

As the interest of the Semantic Web and computational linguistics communities in linguistic linked data (LLD) keeps increasing and the number of contributions that dwell on LLD rapidly grows, scholars (and linguists in particular) interested in the development of LLD resources sometimes find it difficult to determine which mechanism is suitable for their needs and which challenges have already been addressed. This review seeks to present the state of the art on the models, ontologies and their extensions to represent language resources as LLD by focusing on the nature of the linguistic content they aim to encode. Four basic groups of models are distinguished in this work: models to represent the main elements of lexical resources (group 1), vocabularies developed as extensions to models in group 1 and ontologies that provide more granularity on specific levels of linguistic analysis (group 2), catalogues of linguistic data categories (group 3) and other models such as corpora models or service-oriented ones (group 4). Contributions encompassed in these four groups are described, highlighting their reuse by the community and the modelling challenges that are still to be faced

Crossref

Repositorio Universidad de Zaragoza

Archivo Digital UPM

Ontologies across disciplines

Author: Nickles Matthias
Pease Adam
Schalley Andrea C.
Schalley Andrea C.
Zaefferer Dietmar
Zaefferer Dietmar
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 01/01/2007
Field of study

Open Access LMU