75 research outputs found
Topic-Level Bayesian Surprise and Serendipity for Recommender Systems
A recommender system that optimizes its recommendations solely to fit a
user's history of ratings for consumed items can create a filter bubble,
wherein the user does not get to experience items from novel, unseen
categories. One approach to mitigate this undesired behavior is to recommend
items with high potential for serendipity, namely surprising items that are
likely to be highly rated. In this paper, we propose a content-based
formulation of serendipity that is rooted in Bayesian surprise and use it to
measure the serendipity of items after they are consumed and rated by the user.
When coupled with a collaborative-filtering component that identifies similar
users, this enables recommending items with high potential for serendipity. To
facilitate the evaluation of topic-level models for surprise and serendipity,
we introduce a dataset of book reading histories extracted from Goodreads,
containing over 26 thousand users and close to 1.3 million books, where we
manually annotate 449 books read by 4 users in terms of their time-dependent,
topic-level surprise. Experimental evaluations show that models that use
Bayesian surprise correlate much better with the manual annotations of
topic-level surprise than distance-based heuristics, and also obtain better
serendipitous item recommendation performance
Chord Recognition in Symbolic Music: A Segmental CRF Model, Segment-Level Features, and Comparative Evaluations on Classical and Popular Music
We present a new approach to harmonic analysis that is trained to segment music into a sequence of chord spans tagged with chord labels. Formulated as a semi-Markov Conditional Random Field (semi-CRF), this joint segmentation and labeling approach enables the use of a rich set of segment-level features, such as segment purity and chord coverage, that capture the extent to which the events in an entire segment of music are compatible with a candidate chord label. The new chord recognition model is evaluated extensively on three corpora of Western classical music and a newly created corpus of rock music. Experimental results show that the semi-CRF model performs substantially better than previous approaches when trained on a sufficient number of labeled examples and remains competitive when the amount of training data is limited
- …