Search CORE

125 research outputs found

Controlling Output Length in Neural Encoder-Decoders

Author: Kikuchi Yuta
Neubig Graham
Okumura Manabu
Sasano Ryohei
Takamura Hiroya
Publication venue
Publication date: 01/01/2016
Field of study

Neural encoder-decoder models have shown great success in many sequence generation tasks. However, previous work has not investigated situations in which we would like to control the length of encoder-decoder outputs. This capability is crucial for applications such as text summarization, in which we have to generate concise summaries with a desired length. In this paper, we propose methods for controlling the output sequence length for neural encoder-decoder models: two decoding-based methods and two learning-based methods. Results show that our learning-based methods have the capability to control length without degrading summary quality in a summarization task.Comment: 11 pages. To appear in EMNLP 201

arXiv.org e-Print Archive

Crossref

Hypothesis Only Baselines in Natural Language Inference

Author: Haldar Aparajita
Naradowsky Jason
Poliak Adam
Rudinger Rachel
Van Durme Benjamin
Publication venue
Publication date: 01/01/2018
Field of study

We propose a hypothesis only baseline for diagnosing Natural Language Inference (NLI). Especially when an NLI dataset assumes inference is occurring based purely on the relationship between a context and a hypothesis, it follows that assessing entailment relations while ignoring the provided context is a degenerate solution. Yet, through experiments on ten distinct NLI datasets, we find that this approach, which we refer to as a hypothesis-only model, is able to significantly outperform a majority class baseline across a number of NLI datasets. Our analysis suggests that statistical irregularities may allow a model to perform NLI in some datasets beyond what should be achievable without access to the context.Comment: Accepted at *SEM 2018 as long paper. 12 page

arXiv.org e-Print Archive

Crossref

Scholarship, Research, and Creative Work at Bryn Mawr College | Bryn Mawr College Research

A Neural Attention Model for Abstractive Sentence Summarization

Author: Chopra Sumit
Rush Alexander M.
Weston Jason
Publication venue
Publication date: 01/01/2015
Field of study

Summarization based on text extraction is inherently limited, but generation-style abstractive methods have proven challenging to build. In this work, we propose a fully data-driven approach to abstractive sentence summarization. Our method utilizes a local attention-based model that generates each word of the summary conditioned on the input sentence. While the model is structurally simple, it can easily be trained end-to-end and scales to a large amount of training data. The model shows significant performance gains on the DUC-2004 shared task compared with several strong baselines.Comment: Proceedings of EMNLP 201

arXiv.org e-Print Archive

Crossref

Data2Game: Towards an Integrated Demonstrator

Author: Attema Ymko
Bakkes Sander
de Groot Thomas
de Heer Johan
de Jong Ton
Heylen Dirk
Hrynkiewicz Rafal
Mavromoustakos Blom Paris
Oortwijn Tije
Spronck Pieter
Steinrücke Johannes
Theune Mariët
van Stegeren Judith
Veldkamp Bernard
Publication venue
Publication date: 01/01/2021
Field of study

The Data2Game project investigates how the efficacy of computerized training games can be enhanced by tailoring training scenarios to the individual player. The research is centered around three research innovations: (1) techniques for the automated modelling of players’ affective states, based on exhibited social signals, (2) techniques for the automated generation of in-game narratives tailored to the learning needs of the player, and (3) validated studies on the relation of the player behavior and game properties to learning performance. This paper describes the integration of the main results into a joint prototype

University of Twente Research Information

Tilburg University Repository