Search CORE

186,121 research outputs found

Automatic Generation of Titles for a Corpus of Questions

Author: Cardeñosa Jesús
Carolina Carolina
Publication venue: Institute of Information Theories and Applications FOI ITHEA
Publication date: 01/01/2008
Field of study

This paper describes the followed methodology to automatically generate titles for a corpus of questions that belong to sociological opinion polls. Titles for questions have a twofold function: (1) they are the input of user searches and (2) they inform about the whole contents of the question and possible answer options. Thus, generation of titles can be considered as a case of automatic summarization. However, the fact that summarization had to be performed over very short texts together with the aforementioned quality conditions imposed on new generated titles led the authors to follow knowledge-rich and domain-dependent strategies for summarization, disregarding the more frequent extractive techniques for summarization

Bulgarian Digital Mathematics Library at IMI-BAS

Automatic Repair of Real Bugs: An Experience Report on the Defects4J Dataset

Author: Durieux Thomas
Martinez Matias
Monperrus Martin
Sommerard Romain
Xuan Jifeng
Publication venue
Publication date: 09/06/2015
Field of study

Defects4J is a large, peer-reviewed, structured dataset of real-world Java bugs. Each bug in Defects4J is provided with a test suite and at least one failing test case that triggers the bug. In this paper, we report on an experiment to explore the effectiveness of automatic repair on Defects4J. The result of our experiment shows that 47 bugs of the Defects4J dataset can be automatically repaired by state-of- the-art repair. This sets a baseline for future research on automatic repair for Java. We have manually analyzed 84 different patches to assess their real correctness. In total, 9 real Java bugs can be correctly fixed with test-suite based repair. This analysis shows that test-suite based repair suffers from under-specified bugs, for which trivial and incorrect patches still pass the test suite. With respect to practical applicability, it takes in average 14.8 minutes to find a patch. The experiment was done on a scientific grid, totaling 17.6 days of computation time. All their systems and experimental results are publicly available on Github in order to facilitate future research on automatic repair

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

An evaluation of pedagogically informed parameterised questions for self assessment

Author: Davis Hugh
Gilbert Lester
Sitthisak Onjira
Publication venue: 'Informa UK Limited'
Publication date: 01/09/2008
Field of study

Self-assessment is a crucial component of learning. Learners can learn by asking themselves questions and attempting to answer them. However, creating effective questions is time-consuming because it may require considerable resources and the skill of critical thinking. Questions need careful construction to accurately represent the intended learning outcome and the subject matter involved. There are very few systems currently available which generate questions automatically, and these are confined to specific domains. This paper presents a system for automatically generating questions from a competency framework, based on a sound pedagogical and technological approach. This makes it possible to guide learners in developing questions for themselves, and to provide authoring templates which speed the creation of new questions for self-assessment. This novel design and implementation involves an ontological database that represents the intended learning outcome to be assessed across a number of dimensions, including level of cognitive ability and subject matter. The system generates a list of all the questions that are possible from a given learning outcome, which may then be used to test for understanding, and so could determine the degree to which learners actually acquire the desired knowledge. The way in which the system has been designed and evaluated is discussed, along with its educational benefits

Southampton (e-Prints Soton)

Recommended from our members

Rethinking the Agreement in Human Evaluation Tasks

Author: Amidei Jacopo
Piwek Paul
Willis Alistair
Publication venue
Publication date: 01/01/2018
Field of study

Human evaluations are broadly thought to be more valuable the higher the inter-annotator agreement. In this paper we examine this idea. We will describe our experiments and analysis within the area of Automatic Question Generation. Our experiments show how annotators diverge in language annotation tasks due to a range of ineliminable factors. For this reason, we believe that annotation schemes for natural language generation tasks that are aimed at evaluating language quality need to be treated with great care. In particular, an unchecked focus on reduction of disagreement among annotators runs the danger of creating generation goals that reward output that is more distant from, rather than closer to, natural human-like language. We conclude the paper by suggesting a new approach to the use of the agreement metrics in natural language generation evaluation tasks

Open Research Online (The Open University)

Self-Supervised and Controlled Multi-Document Opinion Summarization

Author: Coavoux Maximin
Elsahar Hady
Gallé Matthias
Rozen Jos
Publication venue
Publication date: 30/04/2020
Field of study

We address the problem of unsupervised abstractive summarization of collections of user generated reviews with self-supervision and control. We propose a self-supervised setup that considers an individual document as a target summary for a set of similar documents. This setting makes training simpler than previous approaches by relying only on standard log-likelihood loss. We address the problem of hallucinations through the use of control codes, to steer the generation towards more coherent and relevant summaries.Finally, we extend the Transformer architecture to allow for multiple reviews as input. Our benchmarks on two datasets against graph-based and recent neural abstractive unsupervised models show that our proposed method generates summaries with a superior quality and relevance.This is confirmed in our human evaluation which focuses explicitly on the faithfulness of generated summaries We also provide an ablation study, which shows the importance of the control setup in controlling hallucinations and achieve high sentiment and topic alignment of the summaries with the input reviews.Comment: 18 pages including 5 pages appendi

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

Android app for automatic music composition

Author: Barbancho-Perez Ana Maria
Barbancho-Perez Isabel
Nieto Marina S.
Tardon-Garcia Lorenzo Jose
Publication venue
Publication date: 01/10/2018
Field of study

In this contribution, we present an Android App that generates music automatically replicating a specific contemporary genre, specifically rock music. The composition system is based on the analysis of a set of songs of the desired genre. The analysis results the automatic composition algorithm requires are exported to a JSON file that can be readily imported and parsed. The Android App developed creates a symbolic representation of the music generated for drums, bass and rhythm guitar that can be exported to a MIDI file or audio waveform. The evaluation results show that the automatic compositions generated are not distinguished amongst other excerpts created by human composers.Universidad de Málaga. Campus de Excelencia Internacional Andalucía Tech. Ministerio de Economía y Competitividad, Gobierno de España, Proyecto TIN2016-75866-C3-2-R

Repositorio Institucional Universidad de Málaga