Search CORE

108 research outputs found

Summarising News Stories for Children

Author: Macdonald Iain
Siddharthan Advaith
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2016
Field of study

This paper proposes a system to automatically summarise news articles in a manner suitable for children by deriving and combining statistical ratings for how important, positively oriented and easy to read each sentence is. Our results demonstrate that this approach succeeds in generating summaries that are suitable for children, and that there is further scope for combining this extractive approach with abstractive methods used in text implification

Aberdeen University Research

Crossref

Open Research Online (The Open University)

Text Simplification using Typed Dependencies : A Comparision of the Robustness of Different Generation Strategies

Author: Siddharthan Advaith
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2011
Field of study

This work was supported by the Economic and Social Research Council (Grant Number RES-000-22- 3272).Publisher PD

Aberdeen University Research

CiteSeerX

SaferDrive: an NLG-based Behaviour Change Support System for Drivers

Author: Braun Daniel
Reiter Ehud
Siddharthan Advaith
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 19/02/2018
Field of study

Despite the long history of Natural Language Generation (NLG) research, the potential for influencing real world behaviour through automatically generated texts has not received much attention. In this paper, we present SaferDrive, a behaviour change support system that uses NLG and telematic data in order to create weekly textual feedback for automobile drivers, which is delivered through a smartphone application. Usage-based car insurances use sensors to track driver behaviour. Although the data collected by such insurances could provide detailed feedback about the driving style, they are typically withheld from the driver and used only to calculate insurance premiums. SaferDrive instead provides detailed textual feedback about the driving style, with the intent to help drivers improve their driving habits. We evaluate the system with real drivers and report that the textual feedback generated by our system does have a positive influence on driving habits, especially with regard to speeding

Aberdeen University Research

Crossref

Open Research Online (The Open University)

Recognizing cited facts and principles in legal judgements

Author: Shulayeva Olga
Siddharthan Advaith
Wyner Adam
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

In common law jurisdictions, legal professionals cite facts and legal principles from precedent cases to support their arguments before the court for their intended outcome in a current case. This practice stems from the doctrine of stare decisis, where cases that have similar facts should receive similar decisions with respect to the principles. It is essential for legal professionals to identify such facts and principles in precedent cases, though this is a highly time intensive task. In this paper, we present studies that demonstrate that human annotators can achieve reasonable agreement on which sentences in legal judgements contain cited facts and principles (respectively, κ=0.65 and κ=0.95 for inter- and intra-annotator agreement). We further demonstrate that it is feasible to automatically annotate sentences containing such legal facts and principles in a supervised machine learning framework based on linguistic features, reporting per category precision and recall figures of between 0.79 and 0.89 for classifying sentences in legal judgements as cited facts, principles or neither using a Bayesian classifier, with an overall κ of 0.72 with the human-annotated gold standard

Aberdeen University Research

Crossref

Springer - Publisher Connector

Open Research Online (The Open University)

Cronfa at Swansea University

Recommended from our members

Columbia University at MSE 2005

Author: Evans Dave
Siddharthan Advaith
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2005
Field of study

We describe our participation in the Multilingual Summarization Evaluation 2005

Columbia University Academic Commons

Whose idea was this, and why does it matter? Attributing scientific work to citations

Author: Siddharthan Advaith
Teufel Simone
Publication venue
Publication date: 01/01/2007
Field of study

Scientific papers revolve around citations, and for many discourse level tasks one needs to know whose work is being talked about at any point in the discourse. In this paper, we introduce the scientific attribution task, which links different linguistic expressions to citations. We discuss the suitability of different evaluation metrics and evaluate our classification approach to deciding attribution both intrinsically and in an extrinsic evaluation where information about scientific attribution is shown to improve performance on Argumentative Zoning, a rhetorical classification task

CiteSeerX

Open Research Online (The Open University)

Generating Referring Expressions in Open Domains

Author: Copestake Ann
Siddharthan Advaith
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2004
Field of study

We present an algorithm for generating referring expressions in open domains. Existing algorithms work at the semantic level and assume the availability of a classification for attributes, which is only feasible for restricted domains. Our alternative works at the realisation level, relies on Word-Net synonym and antonym sets, and gives equivalent results on the examples cited in the literature and improved results for examples that prior approaches cannot handle. We believe that ours is also the first algorithm that allows for the incremental incorporation of relations. We present a novel corpus-evaluation using referring expressions from the Penn Wall Street Journal Treebank

CiteSeerX

Crossref

Columbia University Academic Commons

Open Research Online (The Open University)

Incorporating Constraints into Matrix Factorization for Clothes Package Recommendation

Author: Lin Chenghua
Masthoff Judith
Siddharthan Advaith
Wibowa Agung
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 11/07/2018
Field of study

Recommender systems have been widely applied in the literature to suggest individual items to users. In this paper, we consider the harder problem of package recommendation, where items are recommended together as a package. We focus on the clothing domain, where a package recommendation involves a combination of a "top'' (e.g. a shirt) and a "bottom'' (e.g. a pair of trousers). The novelty in this work is that we combined matrix factorisation methods for collaborative filtering with hand-crafted and learnt fashion constraints on combining item features such as colour, formality and patterns. Finally, to better understand where the algorithms are underperforming, we conducted focus groups, which lead to deeper insights into how to use constraints to improve package recommendation in this domain

Crossref

Open Research Online (The Open University)

Recommended from our members

Summarising the points made in online political debates

Author: Egan Charlie
Siddharthan Advaith
Wyner Adam
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 12/08/2016
Field of study

Online communities host growing numbers of discussions amongst large groups of participants on all manner of topics. This user-generated content contains millions of statements of opinions and ideas. We propose an abstractive approach to summarize such argumentative discussions, making key content accessible through ‘point’ extraction, where a point is a verb and its syntactic arguments. Our approach uses both dependency parse information and verb case frames to identify and extract valid points, and generates an abstractive summary that discusses the key points being made in the debate. We performed a human evaluation of our approach using a corpus of online political debates and report significant improvements over a high-performing extractive summarizer

Open Research Online (The Open University)