46 research outputs found
Current Challenges and Visions in Music Recommender Systems Research
Music recommender systems (MRS) have experienced a boom in recent years,
thanks to the emergence and success of online streaming services, which
nowadays make available almost all music in the world at the user's fingertip.
While today's MRS considerably help users to find interesting music in these
huge catalogs, MRS research is still facing substantial challenges. In
particular when it comes to build, incorporate, and evaluate recommendation
strategies that integrate information beyond simple user--item interactions or
content-based descriptors, but dig deep into the very essence of listener
needs, preferences, and intentions, MRS research becomes a big endeavor and
related publications quite sparse.
The purpose of this trends and survey article is twofold. We first identify
and shed light on what we believe are the most pressing challenges MRS research
is facing, from both academic and industry perspectives. We review the state of
the art towards solving these challenges and discuss its limitations. Second,
we detail possible future directions and visions we contemplate for the further
evolution of the field. The article should therefore serve two purposes: giving
the interested reader an overview of current challenges in MRS research and
providing guidance for young researchers by identifying interesting, yet
under-researched, directions in the field
A Scalable Framework for Automatic Playlist Continuation on Music Streaming Services
Music streaming services often aim to recommend songs for users to extend the
playlists they have created on these services. However, extending playlists
while preserving their musical characteristics and matching user preferences
remains a challenging task, commonly referred to as Automatic Playlist
Continuation (APC). Besides, while these services often need to select the best
songs to recommend in real-time and among large catalogs with millions of
candidates, recent research on APC mainly focused on models with few
scalability guarantees and evaluated on relatively small datasets. In this
paper, we introduce a general framework to build scalable yet effective APC
models for large-scale applications. Based on a represent-then-aggregate
strategy, it ensures scalability by design while remaining flexible enough to
incorporate a wide range of representation learning and sequence modeling
techniques, e.g., based on Transformers. We demonstrate the relevance of this
framework through in-depth experimental validation on Spotify's Million
Playlist Dataset (MPD), the largest public dataset for APC. We also describe
how, in 2022, we successfully leveraged this framework to improve APC in
production on Deezer. We report results from a large-scale online A/B test on
this service, emphasizing the practical impact of our approach in such a
real-world application.Comment: Accepted as a Full Paper at the SIGIR 2023 conferenc
Track Mix Generation on Music Streaming Services using Transformers
This paper introduces Track Mix, a personalized playlist generation system
released in 2022 on the music streaming service Deezer. Track Mix automatically
generates "mix" playlists inspired by initial music tracks, allowing users to
discover music similar to their favorite content. To generate these mixes, we
consider a Transformer model trained on millions of track sequences from user
playlists. In light of the growing popularity of Transformers in recent years,
we analyze the advantages, drawbacks, and technical challenges of using such a
model for mix generation on the service, compared to a more traditional
collaborative filtering approach. Since its release, Track Mix has been
generating playlists for millions of users daily, enhancing their music
discovery experience on Deezer.Comment: RecSys 2023 - Industry track with oral presentatio
Automated Playlist Continuation with Apache PredictionIO
The Minrva project team, a software development research group based at the University of Illinois Library, developed a data-focused recommender system to participate in the creative track of the 2018 ACM RecSys Challenge, which focused on music recommendation. We describe here the large-scale data processing the Minrva team researched and developed for foundational reconciliation of the Million Playlist Dataset using external authority data on the web (e.g. VIAF, WikiData). The secondary focus of the research was evaluating and adapting the processing tools that support data reconciliation. This paper reports on the playlist enrichment process, indexing, and subsequent recommendation model developed for the music recommendation challenge.Ope
A hybrid approach for item collection recommendations : an application to automatic playlist continuation
Current recommender systems aim mainly to generate accurate item recommendations, without properly evaluating the multiple dimensions of the recommendation problem. However, in many domains, like in music, where items are rarely consumed in isolation, users would rather need a set of items, designed to work well together, while having some cognitive properties as a whole, related to their perception of quality and satisfaction.
In this thesis, a hybrid case-based recommendation approach for item collections is proposed. In particular, an application to automatic playlist continuation, addressing similar cognitive concepts, rather than similar users, is presented. Playlists, that are sets of music items designed to be consumed as a sequence, with a specific purpose and within a specific context, are treated as cases. The proposed recommender system is based on a meta-level hybridization. First, Latent Dirichlet Allocation is applied to the set of past playlists, described as distributions over music styles, to identify their underlying concepts. Then, for a started playlist, its semantic characteristics, like its latent concept and the styles of the included items, are inferred, and Case-Based Reasoning is applied to the set of past playlists addressing the same concept, to construct and recommend a relevant playlist continuation. A graph-based item model is used to overcome the semantic gap between songs’ signal-based descriptions and users’ high-level preferences, efficiently capture the playlists’ structures and the similarity of the music items in those. As the proposed method bases its reasoning on previous playlists, it does not require the construction of complex user profiles to generate accurate recommendations. Furthermore, apart from relevance, support to parameters beyond accuracy, like increased coherence or support to diverse items is provided to deliver a more complete user experience.
Experiments on real music datasets have revealed improved results, compared to other state of the art techniques, while achieving a “good trade-off” between recommendations’ relevance, diversity and coherence. Finally, although actually focusing on playlist continuations, the designed approach could be easily adapted to serve other recommendation domains with similar characteristics.Los sistemas de recomendación actuales tienen como objetivo principal generar recomendaciones precisas de artículos, sin evaluar propiamente las múltiples dimensiones del problema de recomendación. Sin embargo, en dominios como la música, donde los artículos rara vez se consumen en forma aislada, los usuarios más bien necesitarían recibir recomendaciones de conjuntos de elementos, diseñados para que se complementaran bien juntos, mientras se cubran algunas propiedades cognitivas, relacionadas con su percepción de calidad y satisfacción. En esta tesis, se propone un sistema híbrido de recomendación meta-nivel, que genera recomendaciones de colecciones de artículos. En particular, el sistema se centra en la generación automática de continuaciones de listas de música, tratando conceptos cognitivos similares, en lugar de usuarios similares. Las listas de reproducción son conjuntos de elementos musicales diseñados para ser consumidos en secuencia, con un propósito específico y dentro de un contexto específico. El sistema propuesto primero aplica el método de Latent Dirichlet Allocation a las listas de reproducción, que se describen como distribuciones sobre estilos musicales, para identificar sus conceptos. Cuando se ha iniciado una nueva lista, se deducen sus características semánticas, como su concepto y los estilos de los elementos incluidos en ella. A continuación, el sistema aplica razonamiento basado en casos, utilizando las listas del mismo concepto, para construir y recomendar una continuación relevante. Se utiliza un grafo que modeliza las relaciones de los elementos, para superar el ?salto semántico? existente entre las descripciones de las canciones, normalmente basadas en características sonoras, y las preferencias de los usuarios, expresadas en características de alto nivel. También se utiliza para calcular la similitud de los elementos musicales y para capturar la estructura de las listas de dichos elementos. Como el método propuesto basa su razonamiento en las listas de reproducción y no en usuarios que las construyeron, no se requiere la construcción de perfiles de usuarios complejos para poder generar recomendaciones precisas. Aparte de la relevancia de las recomendaciones, el sistema tiene en cuenta parámetros más allá de la precisión, como mayor coherencia o soporte a la diversidad de los elementos para enriquecer la experiencia del usuario. Los experimentos realizados en bases de datos reales, han revelado mejores resultados, en comparación con las técnicas utilizadas normalmente. Al mismo tiempo, el algoritmo propuesto logra un "buen equilibrio" entre la relevancia, la diversidad y la coherencia de las recomendaciones generadas. Finalmente, aunque la metodología presentada se centra en la recomendación de continuaciones de listas de reproducción musical, el sistema se puede adaptar fácilmente a otros dominios con características similares.Postprint (published version
Towards a better understanding of music playlist titles and descriptions
Music playlists, either user-generated or curated by music streaming services, often come with titles and descriptions. Although informative, these titles and descriptions make up a sparse and noisy semantic space that is challenging to be leveraged for tasks such as making music recommendations. This dissertation is dedicated to developing a better understanding of playlist titles and descriptions by leveraging track sequences in playlists. Specifically, work has been done to capture latent patterns in tracks by an embedding approach, and the latent patterns are found to be well aligned with the organizing principles of mixtapes identified more than a decade ago. The effectiveness of the latent patterns is evaluated by the task of generating descriptive keywords/tags for playlists given tracks, indicating that the latent patterns learned from tracks in playlists are able to provide a good understanding of playlist titles and descriptions. The identified latent patterns are further leveraged to improve model performance on the task of predicting missing tracks given playlist titles and descriptions. Experimental results show that the proposed models yield improvements to the task, especially when playlist descriptions are provided as model input in addition to titles. The main contributions of this work include (1) providing a better solution to dealing with ``cold-start'' playlists in music recommender systems, and (2) proposing an effective approach to automatically generating descriptive keywords/tags for playlists using track sequences
Word2vec embeddings for playlist recommendation
Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2018, Director: Santi Seguí Mesquida i Guillem Pascual i Guinovart[en] We present an ML approach to musical playlist recommendation. Using the algorithm Word2Vec, a shallow two-layer neural network trained to reconstruct linguistic context of words, we have created several embeddings using tracks and playlist titles as words of an artificial vocabulary. Some experiments with different trade-offs between the diversity and the popularity of songs in playlists are analyzed and discussed. By means of combining a tracks embedding and a titles embedding our recommender has reached 19 percent of accuracy. Our model has been created and trained using the MPD (million playlists dataset) given by Spotify as part of the RecSys Challenge 2018
Music Recommendations
International audienceToday's online music services like Spotify provide their listeners with different types of music recommendations, e.g., in the form of weekly recommendations or personalized radio stations. Such recommendations are often based, at least in parts, on collaborative filtering techniques. In this chapter, we first review the different types of music recommendations that can be found in practice and discuss the specific challenges of the domain. Next, we discuss technical approaches for the problems of music discovery and next-track recommendation in more depth, with a specific focus on their practical application at Spotify. Finally, we further elaborate on open challenges in the field and revisit the specific problems of evaluating music recommendation systems in academic environments