Machine Learning and Text Segmentation in Novelty Detection

McKeown, Kathleen; Schiffman, Barry

research

Machine Learning and Text Segmentation in Novelty Detection

Authors: Kathleen McKeown
Barry Schiffman
Publication date: 1 January 2004
Publisher: 'Columbia University Libraries/Information Services'
Doi

Abstract

This paper explores a combination of machine learning, approximate text segmentation and a vector-space model to distinguish novel information from repeated information. In experiments with the data from the Novelty Track at the Text Retrieval Conference, we show improvements over a variety of approaches, in particular in raising precision scores on this data, while maintaining a reasonable amount of recall

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.94.77...

Last time updated on 23/10/2014

Sustaining member

Columbia University Academic Commons

oai:academiccommons.columbia.e...

Last time updated on 02/10/2018