2 research outputs found
Incremental clustering of news reports
When an event occurs in the real world, numerous news reports describing this
event start to appear on different news sites within a few minutes of the event occurrence.
This may result in a huge amount of information for users, and automated processes may be
required to help manage this information. In this paper, we describe a clustering system that
can cluster news reports from disparate sources into event-centric clusters—i.e., clusters of
news reports describing the same event. A user can identify any RSS feed as a source of news
he/she would like to receive and our clustering system can cluster reports received from the
separate RSS feeds as they arrive without knowing the number of clusters in advance. Our
clustering system was designed to function well in an online incremental environment. In
evaluating our system, we found that our system is very good in performing fine-grained
clustering, but performs rather poorly when performing coarser-grained clustering.peer-reviewe