Search CORE

13 research outputs found

Shedding light on a living lab: the CLEF NEWSREEL open recommendation platform

Author: Agosti M.
Azzopardi L.
Bennett J.
Cleverdon C.
Dror G.
Seiler J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2014
Field of study

In the CLEF NEWSREEL lab, participants are invited to evaluate news recommendation techniques in real-time by providing news recommendations to actual users that visit commercial news portals to satisfy their information needs. A central role within this lab is the communication between participants and the users. This is enabled by The Open Recommendation Platform (ORP), a web-based platform which distributes users' impressions of news articles to the participants and returns their recommendations to the readers. In this demo, we illustrate the platform and show how requests are handled to provide relevant news articles in real-time

DepositOnce

Crossref

Enlighten

Benchmarking News Recommendations in a Living Lab

Author: Brodt Torben
Heintz Tobias
Hopfgartner Frank
Kille Benjamin
Lommatzsch Andreas
Plumbaum Till
Publication venue: Springer Verlag
Publication date: 01/01/2014
Field of study

Most user-centric studies of information access systems in literature suffer from unrealistic settings or limited numbers of users who participate in the study. In order to address this issue, the idea of a living lab has been promoted. Living labs allow us to evaluate research hypotheses using a large number of users who satisfy their information need in a real context. In this paper, we introduce a living lab on news recommendation in real time. The living lab has first been organized as News Recommendation Challenge at ACM RecSys’13 and then as campaign-style evaluation lab NEWSREEL at CLEF’14. Within this lab, researchers were asked to provide news article recommendations to millions of users in real time. Different from user studies which have been performed in a laboratory, these users are following their own agenda. Consequently, laboratory bias on their behavior can be neglected. We outline the living lab scenario and the experimental setup of the two benchmarking events. We argue that the living lab can serve as reference point for the implementation of living labs for the evaluation of information access systems

Crossref

Enlighten

Users' reading habits in online news portals

Author: Albayrak Sahin
Esiyok Cagdas
Hopfgartner Frank
Jain Brijnesh Johannes
Kille Benjamin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2014
Field of study

The aim of this study is to survey reading habits of users of an online news portal. The assumption motivating this study is that insight into the reading habits of users can be helpful to design better news recommendation systems. We estimated the transition probabilities that users who read an article of one news category will move to read an article of another (not necessarily distinct) news category. For this, we analyzed the users' click behavior within plista data set. Key findings are the popularity of category local, loyalty of readers to the same category, observing similar results when addressing enforced click streams, and the case that click behavior is highly influenced by the news category

DepositOnce

Crossref

Enlighten

Benchmarking news recommendations: the CLEF NewsREEL use case

Author: Brodt Torben
Hopfgartner Frank
Kille Benjamin
Larson Martha
Lommatzsch Andreas
Seiler Jonas
Serény Andrá
Turrin Roberto
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/12/2015
Field of study

The CLEF NewsREEL challenge is a campaign-style evaluation lab allowing participants to evaluate and optimize news recommender algorithms. The goal is to create an algorithm that is able to generate news items that users would click, respecting a strict time constraint. The lab challenges participants to compete in either a "living lab" (Task 1) or perform an evaluation that replays recorded streams (Task 2). In this report, we discuss the objectives and challenges of the NewsREEL lab, summarize last year's campaign and outline the main research challenges that can be addressed by participating in NewsREEL 2016

Crossref

Enlighten

CLEF 2017 NewsREEL Overview: Offline and Online Evaluation of Stream-based News Recommender Systems

Author: Brodt T.
Cappellato L.
Hopfgartner F.
Kille B.
Larson M.
Lommatzsch A.
Publication venue
Publication date: 01/01/2017
Field of study

The CLEF NewsREEL challenge allows researchers to evaluate news recommendation algorithms both online (NewsREEL Live) and offline (News- REEL Replay). Compared with the previous year NewsREEL challenged participants with a higher volume of messages and new news portals. In the 2017 edition of the CLEF NewsREEL challenge a wide variety of new approaches have been implemented ranging from the use of existing machine learning frameworks, to ensemble methods to the use of deep neural networks. This paper gives an overview over the implemented approaches and discusses the evaluation results. In addition, the main results of Living Lab and the Replay task are explained

Enlighten

Overview of CLEF NEWSREEL 2014: News Recommendations Evaluation Labs

Author: Brodt Torben
Heintz Tobias
Hopfgartner Frank
Kille Benjamin
Lommatzsch Andreas
Seiler Jonas
Publication venue: CEUR
Publication date: 01/01/2014
Field of study

This paper summarises objectives, organisation, and results of the first news recommendation evaluation lab (NEWSREEL 2014). NEWSREEL targeted the evaluation of news recommendation algorithms in the form of a campaignstyle evaluation lab. Participants had the chance to apply two types of evaluation schemes. On the one hand, participants could apply their algorithms onto a data set. We refer to this setting as off-line evaluation. On the other hand, participants could deploy their algorithms on a server to interactively receive recommendation requests. We refer to this setting as on-line evaluation. This setting ought to reveal the actual performance of recommendation methods. The competition strived to illustrate differences between evaluation with historical data and actual users. The on-line evaluation does reflect all requirements which active recommender systems face in practise. These requirements include real-time responses and large-scale data volumes. We present the competition’s results and discuss commonalities regarding participants’ approaches

DepositOnce

Enlighten

Report on the Evaluation-as-a-Service (EaaS) Expert Workshop

Author: Allan Hanbury
Anastasia Krithara
Balikas Georgios
Frank Hopfgartner
Henning Müller
Ivan Eggel
Jayashree Kalpathy-Cramer
Jimmy
Jimmy Lin
Krisztian Balog
Martin Potthast
Noriko Kando
Ounis Iadh
Potthast Martin
Simon Mercer
Tim Gollub
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/06/2015
Field of study

In this report, we summarize the outcome of the "Evaluation-as-a-Service" workshop that was held on the 5th and 6th March 2015 in Sierre, Switzerland. The objective of the meeting was to bring together initiatives that use cloud infrastructures, virtual machines, APIs (Application Programming Interface) and related projects that provide evaluation of information retrieval or machine learning tools as a service

Crossref

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

Enlighten

Offline and online evaluation of news recommender systems at swissinfo.ch

Author: Alazzawi Ayar
Bruttin Christophe
Donatsch Olivier
Faltings Boi
Garcin Florent
Huber Amr
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2014
Field of study

We report on the live evaluation of various news recom- mender systems conducted on the website swissinfo.ch. We demonstrate that there is a major diffierence between offine and online accuracy evaluations. In an offine setting, rec- ommending most popular stories is the best strategy, while in a live environment this strategy is the poorest. For online setting, context-tree recommender systems which profile the users in real-time improve the click-through rate by up to 35%. The visit length also increases by a factor of 2.5. Our experience holds important lessons for the evaluation of rec- ommender systems with offine data as well as for the use of the click-through rate as a performance indicator. Copyright © 2014 ACM

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Continuous evaluation of large-scale information access systems : a case for living labs

Author: Balog K.
Ferro N.
Hopfgartner F.
Kelly L.
Kille B.
Larson M.
Lommatzsch A.
Peters C.
Schuth A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

A/B testing is currently being increasingly adopted for the evaluation of commercial information access systems with a large user base since it provides the advantage of observing the efficiency and effectiveness of information access systems under real conditions. Unfortunately, unless university-based researchers closely collaborate with industry or develop their own infrastructure or user base, they cannot validate their ideas in live settings with real users. Without online testing opportunities open to the research communities, academic researchers are unable to employ online evaluation on a larger scale. This means that they do not get feedback for their ideas and cannot advance their research further. Businesses, on the other hand, miss the opportunity to have higher customer satisfaction due to improved systems. In addition, users miss the chance to benefit from an improved information access system. In this chapter, we introduce two evaluation initiatives at CLEF, NewsREEL and Living Labs for IR (LL4IR), that aim to address this growing “evaluation gap” between academia and industry. We explain the challenges and discuss the experiences organizing these living labs

Radboud Repository

White Rose Research Online