6 research outputs found
A Topic Recommender for Journalists
The way in which people acquire information on events and form their own
opinion on them has changed dramatically with the advent of social media. For many
readers, the news gathered from online sources become an opportunity to share points
of view and information within micro-blogging platforms such as Twitter, mainly
aimed at satisfying their communication needs. Furthermore, the need to deepen the
aspects related to news stimulates a demand for additional information which is often
met through online encyclopedias, such as Wikipedia. This behaviour has also
influenced the way in which journalists write their articles, requiring a careful assessment
of what actually interests the readers. The goal of this paper is to present
a recommender system, What to Write and Why, capable of suggesting to a journalist,
for a given event, the aspects still uncovered in news articles on which the
readers focus their interest. The basic idea is to characterize an event according to
the echo it receives in online news sources and associate it with the corresponding
readers’ communicative and informative patterns, detected through the analysis of
Twitter and Wikipedia, respectively. Our methodology temporally aligns the results
of this analysis and recommends the concepts that emerge as topics of interest from
Twitter and Wikipedia, either not covered or poorly covered in the published news
articles
Mining Social Media for Newsgathering: A Review
Social media is becoming an increasingly important data source for learning
about breaking news and for following the latest developments of ongoing news.
This is in part possible thanks to the existence of mobile devices, which
allows anyone with access to the Internet to post updates from anywhere,
leading in turn to a growing presence of citizen journalism. Consequently,
social media has become a go-to resource for journalists during the process of
newsgathering. Use of social media for newsgathering is however challenging,
and suitable tools are needed in order to facilitate access to useful
information for reporting. In this paper, we provide an overview of research in
data mining and natural language processing for mining social media for
newsgathering. We discuss five different areas that researchers have worked on
to mitigate the challenges inherent to social media newsgathering: news
discovery, curation of news, validation and verification of content,
newsgathering dashboards, and other tasks. We outline the progress made so far
in the field, summarise the current challenges as well as discuss future
directions in the use of computational journalism to assist with social media
newsgathering. This review is relevant to computer scientists researching news
in social media as well as for interdisciplinary researchers interested in the
intersection of computer science and journalism.Comment: Accepted for publication in Online Social Networks and Medi
Recommendations for item set completion: On the semantics of item co-occurrence with data sparsity, input size, and input modalities
We address the problem of recommending relevant items to a user in order to "complete" a partial set of items already known. We consider the two scenarios of citation and subject label recommendation, which resemble different semantics of item co-occurrence: relatedness for co-citations and diversity for subject labels. We assess the influence of the completeness of an already known partial item set on the recommender performance. We also investigate data sparsity through a pruning parameter and the influence of using additional metadata. As recommender models, we focus on different autoencoders, which are particularly suited for reconstructing missing items in a set. We extend autoencoders to exploit a multi-modal input of text and structured data. Our experiments on six real-world datasets show that supplying the partial item set as input is helpful when item co-occurrence resembles relatedness, while metadata are effective when co-occurrence implies diversity. This outcome means that the semantics of item co-occurrence is an important factor. The simple item co-occurrence model is a strong baseline for citation recommendation. However, autoencoders have the advantage to enable exploiting additional metadata besides the partial item set as input and achieve comparable performance. For the subject label recommendation task, the title is the most important attribute. Adding more input modalities sometimes even harms the result. In conclusion, it is crucial to consider the semantics of the item co-occurrence for the choice of an appropriate recommendation model and carefully decide which metadata to exploit