Search CORE

52 research outputs found

The Verbmobil semantic database

Author: Heinecke Johannes
Worm Karsten
Publication venue: Sonstige Einrichtungen. DFKI Deutsches Forschungszentrum für Künstliche Intelligenz
Publication date: 01/01/1996
Field of study

The distributed development of the modules of a large natural language processing system at different sites makes interface definitions a vital issue. It becomes even more urgent when several modules with the same intended functionality are developed in parallel and should be indistinguishable with respect to their input—output—behaviour. Another important issue is the acquisition and maintenance of lexical information which should be stored independently of an application to make it (re)usable for different purposes. This paper describes the design and use of the Verbmobil Semantic Database which we developed in order to deal with these issues in the area of lexical semantics in Verbmobil

CiteSeerX

Universaar

Acronym

Détecter le potentiel d'ambiguïté d'une requête - le cas des recherches portant sur l'actualité

Author: Fabre Cécile
Heinecke Johannes
Lalleman Fanny
Publication venue: HAL CCSD
Publication date: 01/01/2012
Field of study

International audienceL'objectif du travail que nous présentons ici est d'examiner la notion d'ambigüité à travers l'étude des requêtes produites dans un système de RI, le site 2424actu.fr d'Orange, opérationnel du 1/10/2009 au 1/09/2011. Celui-ci vise le traitement d'une base de documents relatifs à l'actualité française, domaine particulièrement mouvant et par conséquent propice à l'examen de la question de l'ambiguïté. Nous cherchons à déterminer la nature de l'ambiguïté des requêtes en examinant les logs de requêtes disponibles et en les confrontant à différents indices contextuels qui enrichissent la perception de la variabilité sémantique des termes de la requête

Scientific Publications of the University of Toulouse II Le Mirail

EDP Sciences OAI-PMH repository (1.2.0)

HAL Descartes

MaskParse@Deskin at SemEval-2019 Task 1: Cross-lingual UCCA Semantic Parsing using Recursive Masked Sequence Tagging

Author: Damnati Geraldine
Heinecke Johannes
Marzinotto Gabriel
Publication venue: HAL CCSD
Publication date: 06/06/2019
Field of study

International audienceThis paper describes our recursive system for SemEval-2019 \textit{ Task 1: Cross-lingual Semantic Parsing with UCCA}. Each recursive step consists of two parts. We first perform semantic parsing using a sequence tagger to estimate the probabilities of the UCCA categories in the sentence. Then, we apply a decoding policy which interprets these probabilities and builds the graph nodes. Parsing is done recursively, we perform a first inference on the sentence to extract the main scenes and links and then we recursively apply our model on the sentence using a masking feature that reflects the decisions made in previous steps. Process continues until the terminal nodes are reached. We choose a standard neural tagger and we focused on our recursive parsing strategy and on the cross lingual transfer problem to develop a robust model for the French language, using only few training samples

arXiv.org e-Print Archive

HAL AMU

Knowledge-based semantic annotation and retrieval of multimedia content

Author: Akrivas Giorgos
Douze Matthijs
Heinecke Johannes
O'Connor Noel E.
Papadopoulos Georgios Th.
Saathoff Carsten
Waddington Simon
Publication venue: CEUR-Workshop Proceedings
Publication date: 01/12/2007
Field of study

aceMedia is a 4 year EC part-funded FP6 Integrated Project, ending in December 2007. The project has developed tools to enable users to manage and share both personal and purchased content across PC, STB and mobile platforms. Knowledge-based analysis and ontologies have been successfully exploited in an end-to-end system to enable automated semantic annotation and retrieval of multimedia content. The paper briefly describes the objectives of aceMedia and the application of knowledge-based analysis in the project

DCU Online Research Access Service

CALOR-Frame : un corpus de textes encyclopédiques annoté en cadres sémantiques

Author: Béchet Frédéric
Damnati Géraldine
Heinecke Johannes
Marzinotto Gabriel
Nasr Alexis
Publication venue: HAL CCSD
Publication date: 26/06/2017
Field of study

International audienceCALOR-Frame : a corpus of encyclopedic texts annotated with semantic frames CALOR-Frame is a corpus of History encyclopedic texts annotated in semantic frames, that has been jointly produced by Aix-Marseille University and Orange Labs. The constitution of this ressource has been driven by the more general context of Information Retrieval, with the purpose of enhancing access to Knowledge contents. Semantic Frame structuration enables advanced research fucntionalities, beyond keyword search. This article presents the annotation process that has been set up, using a tool to automatically validate generated annotations in an optimized way. The selection of texts and semantic frames is also motivated. MOTS-CLÉS : Cadre sémantique, corpus, apprentissage actif, étiquetage de séquence.Le corpus CALOR-Frame est un corpus annoté en cadres sémantiques, constitué de textes encyclo-pédiques dans le domaine de l'Histoire et produit conjointement par l'Université d'Aix-Marseille et Orange Labs. La constitution de cette ressource s'inscrit dans le cadre général de la recherche d'information avec pour objectif de favoriser l'accès aux contenus de la connaissance. La structuration en cadres sémantiques permet des recherches avancées dépassant le cadre de la simple recherche par mots-clés. Dans cet article est décrit le processus d'annotation en cadres sémantiques mis en place, qui utilise un outil de validation d'annotations automatiques à des fins d'optimisation. Le choix des textes et des cadres sémantiques considérés est également motivé

HAL AMU

CALOR-QUEST : un corpus d'entraînement et d'évaluation pour la compréhension automatique de textes

Author: Aloui Cindy
Bechet Frédéric
Béchet Frédéric
Charlet Delphine
Damnati Geraldine
Heinecke Johannes
Herledan Frédéric
Nasr Alexis
Publication venue: HAL CCSD
Publication date: 01/07/2019
Field of study

International audienceLa compréhension automatique de texte est une tâche faisant partie de la famille des systèmes de Question/Réponse où les questions ne sont pas à portée générale mais sont liées à un document particulier. Récemment de très grand corpus (SQuAD, MS MARCO) contenant des triplets (document, question, réponse) ont été mis à la disposition de la communauté scientifique afin de développer des méthodes supervisées à base de réseaux de neurones profonds en obtenant des résultats prometteurs. Ces méthodes sont cependant très gourmandes en données d'apprentissage, données qui n'existent pour le moment que pour la langue anglaise. Le but de cette étude est de permettre le développement de telles ressources pour d'autres langue à moindre coût en proposant une méthode générant des questions à partir d'une analyse sémantique de manière semi-automatique. La collecte de questions naturelle est réduite à un ensemble de validation/test. L'application de cette méthode sur le corpus CALOR-Frame a permis de développer la ressource CALOR-QUEST présentée dans cet article. ABSTRACT Machine reading comprehension is a task related to the Question-Answering task where questions are not generic in scope but are related to a particular document. Recently very large corpora (SQuAD, MS MARCO) containing triplets (document, question, answer) were made available to the scientific community to develop supervised methods based on deep neural networks with promising results. These methods need very large training corpus to be efficient, however such kind of data only exists for English at the moment. The purpose of this study is the development of such resources for other languages by proposing a method generating questions from a semantic frame analysis in a semi-automatic way. The collect of natural questions is reduced to a validation/test set. We applied this method on the French CALOR-Frame corpus in order to develop the CALOR-QUEST resource presented in this paper. MOTS-CLÉS : Compréhension automatique de texte, Question Réponse, Analyse en cadre séman-tique, Génération de questions

Klimaschutz in finanzschwachen Kommunen

Author: Altenburg Corinna
Heinbach Katharina
Heinecke Sabrina
Krone Elisabeth
Reiß Philipp
Rupp Johannes
Scheller Henrik
Walker Benedikt
Walter Jan
Publication venue: Institut für Ökologische Wirtschaftsforschung
Publication date
Field of study

KLIMASCHUTZ IN FINANZSCHWACHEN KOMMUNEN Klimaschutz in finanzschwachen Kommunen / Heinbach, Katharina (Rights reserved) ( -

Digitale Landesbibliothek Berlin (Zentral- und Landesbibliothek Berlin)