Verbose, Laconic or Just Right: A Simple Computational Model of Content Appropriateness under Length Constraints

Louis, Annie P; Nenkova, Ani

research

Verbose, Laconic or Just Right: A Simple Computational Model of Content Appropriateness under Length Constraints

Authors: Annie P Louis
Ani Nenkova
Publication date: 1 January 2014
Publisher: 'Association for Computational Linguistics (ACL)'
Doi

Abstract

Length constraints impose implicit requirements on the type of content that can be included in a text. Here we pro- pose the first model to computationally assess if a text deviates from these requirements. Specifically, our model predicts the appropriate length for texts based on content types present in a snippet of constant length. We consider a range of features to approximate content type, including syntactic phrasing, constituent compression probability, presence of named entities, sentence specificity and intersentence continuity. Weights for these features are learned using a corpus of summaries written by experts and on high quality journalistic writing. During test time, the difference between actual and predicted length allows us to quantify text verbosity. We use data from manual evaluation of summarization systems to assess the verbosity scores produced by our model. We show that the automatic verbosity scores are significantly negatively correlated with manual content quality scores given to the summaries

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

University of Essex Research Repository

oai:repository.essex.ac.uk:185...

Last time updated on 09/02/2017

Edinburgh Research Explorer

oai:pure.ed.ac.uk:publications...

Last time updated on 09/08/2016