Search CORE

6,291 research outputs found

A Supervised Approach to Extractive Summarisation of Scientific Papers

Author: Augenstein Isabelle
Collins Ed
Riedel Sebastian
Publication venue
Publication date: 01/01/2017
Field of study

Automatic summarisation is a popular approach to reduce a document to its main arguments. Recent research in the area has focused on neural approaches to summarisation, which can be very data-hungry. However, few large datasets exist and none for the traditionally popular domain of scientific publications, which opens up challenging research avenues centered on encoding large, complex documents. In this paper, we introduce a new dataset for summarisation of computer science publications by exploiting a large resource of author provided summaries and show straightforward ways of extending it further. We develop models on the dataset making use of both neural sentence encoding and traditionally used summarisation features and show that models which encode sentences as well as their local and global context perform best, significantly outperforming well-established baseline methods.Comment: 11 pages, 6 figure

arXiv.org e-Print Archive

Crossref

UCL Discovery

Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing

Author: Barzilay Regina
Globerson Amir
Ram Ori
Schuster Tal
Publication venue
Publication date: 01/01/2019
Field of study

We introduce a novel method for multilingual transfer that utilizes deep contextual embeddings, pretrained in an unsupervised fashion. While contextual embeddings have been shown to yield richer representations of meaning compared to their static counterparts, aligning them poses a challenge due to their dynamic nature. To this end, we construct context-independent variants of the original monolingual spaces and utilize their mapping to derive an alignment for the context-dependent spaces. This mapping readily supports processing of a target language, improving transfer by context-aware embeddings. Our experimental results demonstrate the effectiveness of this approach for zero-shot and few-shot learning of dependency parsing. Specifically, our method consistently outperforms the previous state-of-the-art on 6 tested languages, yielding an improvement of 6.8 LAS points on average.Comment: NAACL 201

arXiv.org e-Print Archive

DSpace@MIT

Crossref

Writing Rangers

Author: Wehling Katharine
Publication venue: ValpoScholar
Publication date: 01/01/2016
Field of study

Valparaiso University

Atlas.txt : Linking Geo-referenced Data to Text for NLG

Author: Sripada Gowri Somayajulu
Thomas Kavita
Publication venue
Publication date: 01/01/2007
Field of study

Peer reviewedPreprin

Aberdeen University Research

Access to recorded interviews: A research agenda

Author: Heeren W.F.L.
Jong F.M.G. de
Oard D.W.
Ordelman R.J.F.
Publication venue: ACM
Publication date: 01/01/2008
Field of study

Recorded interviews form a rich basis for scholarly inquiry. Examples include oral histories, community memory projects, and interviews conducted for broadcast media. Emerging technologies offer the potential to radically transform the way in which recorded interviews are made accessible, but this vision will demand substantial investments from a broad range of research communities. This article reviews the present state of practice for making recorded interviews available and the state-of-the-art for key component technologies. A large number of important research issues are identified, and from that set of issues, a coherent research agenda is proposed

University of Twente Research Information

A reproducible approach with R markdown to automatic classification of medical certificates in French

Author: Beghini Federica
Di Nunzio Giorgio Maria
Henrot Genevi\ue8ve
Vezzani Federica
Publication venue: CEUR-WS
Publication date: 01/01/2017
Field of study

In this paper, we report the ongoing developments of our first participation to the Cross-Language Evaluation Forum (CLEF) eHealth Task 1: “Multilingual Information Extraction - ICD10 coding” (Névéol et al., 2017). The task consists in labelling death certificates, in French with international standard codes. In particular, we wanted to accomplish the goal of the ‘Replication track’ of this Task which promotes the sharing of tools and the dissemination of solid, reproducible results.In questo articolo presentiamo gli sviluppi del lavoro iniziato con la partecipazione al Laboratorio CrossLanguage Evaluation Forum (CLEF) eHealth denominato: “Multilingual Information Extraction - ICD10 coding” (Névéol et al., 2017) che ha come obiettivo quello di classificare certificati di morte in lingua francese con dei codici standard internazionali. In particolare, abbiamo come obiettivo quello proposto dalla ‘Replication track’ di questo Task, che promuove la condivisione di strumenti e la diffusione di risultati riproducibili

Archivio istituzionale della ricerca - Università di Padova

A Preliminary Evaluation of a Privacy-Preserving Dialogue System

Author
Publication venue: CEUR-WS.org
Publication date: 28/11/2021
Field of study

open3noopenBettina Fazzinga, Andrea Galassi, Paolo TorroniBettina Fazzinga, Andrea Galassi, Paolo Torron

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna