6,271 research outputs found
Using the Annotated Bibliography as a Resource for Indicative Summarization
We report on a language resource consisting of 2000 annotated bibliography
entries, which is being analyzed as part of our research on indicative document
summarization. We show how annotated bibliographies cover certain aspects of
summarization that have not been well-covered by other summary corpora, and
motivate why they constitute an important form to study for information
retrieval. We detail our methodology for collecting the corpus, and overview
our document feature markup that we introduced to facilitate summary analysis.
We present the characteristics of the corpus, methods of collection, and show
its use in finding the distribution of types of information included in
indicative summaries and their relative ordering within the summaries.Comment: 8 pages, 3 figure
The Intellectual Impact of Agricultural Economists
agricultural economists, Agribusiness, Agricultural Finance, Labor and Human Capital, Production Economics,
Ocular-based automatic summarization of documents: is re-reading informative about the importance of a sentence?
Automatic document summarization (ADS) has been introduced as a viable solution for reducing the time and the effort needed to read the ever-increasing textual content that is disseminated. However, a successful universal ADS algorithm has not yet been developed. Also, despite progress in the field, many ADS techniques do not take into account the needs of different readers, providing a summary without internal consistency and the consequent need to re-read the original document. The present study was aimed at investigating the usefulness of using eye tracking for increasing the quality of ADS. The general idea was of that of finding ocular behavioural indicators that could be easily implemented in ADS algorithms. For instance, the time spent in re-reading a sentence might reflect the relative importance of that sentence, thus providing a hint for the selection of text contributing to the summary. We have tested this hypothesis by comparing metrics based on the analysis of eye movements of 30 readers with the highlights they made afterward. Results showed that the time spent reading a sentence was not significantly related to its subjective value, thus frustrating our attempt. Results also showed that the length of a sentence is an unavoidable confounding because longer sentences have both the highest probability of containing units of text judged as important, and receive more fixations and re-fixations
Automatic Repair of Real Bugs: An Experience Report on the Defects4J Dataset
Defects4J is a large, peer-reviewed, structured dataset of real-world Java
bugs. Each bug in Defects4J is provided with a test suite and at least one
failing test case that triggers the bug. In this paper, we report on an
experiment to explore the effectiveness of automatic repair on Defects4J. The
result of our experiment shows that 47 bugs of the Defects4J dataset can be
automatically repaired by state-of- the-art repair. This sets a baseline for
future research on automatic repair for Java. We have manually analyzed 84
different patches to assess their real correctness. In total, 9 real Java bugs
can be correctly fixed with test-suite based repair. This analysis shows that
test-suite based repair suffers from under-specified bugs, for which trivial
and incorrect patches still pass the test suite. With respect to practical
applicability, it takes in average 14.8 minutes to find a patch. The experiment
was done on a scientific grid, totaling 17.6 days of computation time. All
their systems and experimental results are publicly available on Github in
order to facilitate future research on automatic repair
Engineering polymer informatics: Towards the computer-aided design of polymers
The computer-aided design of polymers is one of the holy grails of modern chemical
informatics and of significant interest for a number of communities in polymer
science. The paper outlines a vision for the in silico design of polymers and presents
an information model for polymers based on modern semantic web technologies, thus
laying the foundations for achieving the vision
Text Inspector corpus linguistics tool on trial: Checking accuracy for students' writings assessment
Digital tools are increasingly present in education not only to enhance teaching but also to assist educators with lesson planning and students’ assessment. This undergraduate dissertation defends the use of corpus linguistics tools by language teachers to carry out their work more efficiently. In fact, the dissertation’s main objective is to test one of these applications called Text Inspector to find out if English teachers could use it to evaluate the accuracy of students' writings. To this end, corpora compiled from undergraduate dissertation abstracts of students in Engineering, Business Administration and Early Childhood Teaching at University of Valladolid (Uva) have been introduced in the software, which automatically determines the Common European Framework of Reference for languages (CEFR) level of each group. Then, some metrics have been applied to the data to scientifically validate the reliability of the tool, finding some limitations.Las herramientas digitales se incluyen cada vez más en Educación, no sólo para mejorar la enseñanza, sino también para planificar las clases y puntuar a los alumnos. Este trabajo final de grado defiende el uso de herramientas de lingüÃstica de corpus por parte de los profesores de idiomas para trabajar de forma más eficiente. De hecho, el objetivo principal del mismo es probar una de estas aplicaciones, llamada Text Inspector, para averiguar si los profesores de inglés podrÃan utilizarla para evaluar los escritos de sus alumnos. Para ello, se han introducido en el software corpus compilados a partir de abstracts de trabajos finales de grado de estudiantes de IngenierÃa, Administración de Empresas y Educación Infantil de la Universidad de Valladolid (Uva), determinando automáticamente para cada grupo su nivel del Marco Común Europeo de Referencia para las lenguas (MCER). A continuación, se han aplicado algunas métricas a los datos para validar cientÃficamente la fiabilidad de la herramienta, descubriendo algunas limitaciones.Departamento de FilologÃa InglesaGrado en Estudios Inglese
Perception and Acceptance of an Autonomous Refactoring Bot
The use of autonomous bots for automatic support in software development
tasks is increasing. In the past, however, they were not always perceived
positively and sometimes experienced a negative bias compared to their human
counterparts. We conducted a qualitative study in which we deployed an
autonomous refactoring bot for 41 days in a student software development
project. In between and at the end, we conducted semi-structured interviews to
find out how developers perceive the bot and whether they are more or less
critical when reviewing the contributions of a bot compared to human
contributions. Our findings show that the bot was perceived as a useful and
unobtrusive contributor, and developers were no more critical of it than they
were about their human colleagues, but only a few team members felt responsible
for the bot.Comment: 8 pages, 2 figures. To be published at 12th International Conference
on Agents and Artificial Intelligence (ICAART 2020
- …