16,882 research outputs found
Modelling Sequential Music Track Skips using a Multi-RNN Approach
Modelling sequential music skips provides streaming companies the ability to
better understand the needs of the user base, resulting in a better user
experience by reducing the need to manually skip certain music tracks. This
paper describes the solution of the University of Copenhagen DIKU-IR team in
the 'Spotify Sequential Skip Prediction Challenge', where the task was to
predict the skip behaviour of the second half in a music listening session
conditioned on the first half. We model this task using a Multi-RNN approach
consisting of two distinct stacked recurrent neural networks, where one network
focuses on encoding the first half of the session and the other network focuses
on utilizing the encoding to make sequential skip predictions. The encoder
network is initialized by a learned session-wide music encoding, and both of
them utilize a learned track embedding. Our final model consists of a majority
voted ensemble of individually trained models, and ranked 2nd out of 45
participating teams in the competition with a mean average accuracy of 0.641
and an accuracy on the first skip prediction of 0.807. Our code is released at
https://github.com/Varyn/WSDM-challenge-2019-spotify.Comment: 4 page
Predicting Session Length in Media Streaming
Session length is a very important aspect in determining a user's
satisfaction with a media streaming service. Being able to predict how long a
session will last can be of great use for various downstream tasks, such as
recommendations and ad scheduling. Most of the related literature on user
interaction duration has focused on dwell time for websites, usually in the
context of approximating post-click satisfaction either in search results, or
display ads. In this work we present the first analysis of session length in a
mobile-focused online service, using a real world data-set from a major music
streaming service. We use survival analysis techniques to show that the
characteristics of the length distributions can differ significantly between
users, and use gradient boosted trees with appropriate objectives to predict
the length of a session using only information available at its beginning. Our
evaluation on real world data illustrates that our proposed technique
outperforms the considered baseline.Comment: 4 pages, 3 figure
ํธ๋์คํฌ๋จธ ๊ธฐ๋ฐ ์์ ์คํธ๋ฆฌ๋ฐ ์ธ์ ์ถ์ฒ ์์คํ
ํ์๋
ผ๋ฌธ(์์ฌ) -- ์์ธ๋ํ๊ต๋ํ์ : ๋ฐ์ดํฐ์ฌ์ด์ธ์ค๋ํ์ ๋ฐ์ดํฐ์ฌ์ด์ธ์คํ๊ณผ, 2022. 8. ์ ํจํ.Recommendation systems have grown in popularity over the last few years, with the rise of big data and development of computing resources. Compared to simple rule based methods or content based filtering methods used for recommendation during the early development stage of recommendation systems, recent methodologies try to implement much more complex models. Latent factor models and collaborative filtering methods were developed to find similarities between users and items without actually knowing their characteristics, and gained popularity. Various item domains, mainly movie and retail, have extensively used these recommendation algorithms.
With the development of deep learning architectures, various deep learning based recommendation systems emerged in recent years. While a lot of them were focused on generating the predicted item ratings when given a big data comprised of user ids, item ids, and ratings, there were some efforts to generate next-item recommendations as well. Next-item recommendations receive a session or sequence of actions by some user, and try to predict the next action of a user. NVIDIA recently used Transformers, a deep learning architecture in the field of Natural Language Processing (NLP), to build a session based recommendation system called Transformers4Rec. The system showed state of the art performances for the usual movie and retail domains.
In the music domain, unfortunately, advanced models for session-based recommendations have been explored to a small extent. Therefore, this thesis will attempt to apply Transformer based architectures to session-based recommendation for music streaming, by utilizing a dataset from Spotify and framework from NVIDIA. In this thesis, unique characteristics of music data that validates this researchโs purpose are explored. The effectiveness of Transformer architectures on music data are shown with next-item prediction performances on actual user streaming session data, and methods for feature engineering and data preprocessing to ensure the best prediction results are investigated. An empirical analysis that compares various Transformer architectures is also provided, with models further analyzed with additional feature information.์ต๊ทผ ํธ๋์คํฌ๋จธ ๊ธฐ๋ฐ ์ถ์ฒ์์คํ
๋ค์ด ๋ค์ํ ๋ถ์ผ์์ ๋์ ์ฑ๋ฅ์ ๋ณด์ฌ์๋ค. ํ์ง๋ง ์์
์คํธ๋ฆฌ๋ฐ ๋ถ์ผ์๋ ์ ์ฉ๋์ง ์์์๊ณ , ์ด ๋
ผ๋ฌธ์ ํตํด ์์
์คํธ๋ฆฌ๋ฐ ๋ถ์ผ์ ํธ๋์คํฌ๋จธ ๊ธฐ๋ฐ ์ธ์
์ถ์ฒ์์คํ
์ด ์ด๋ค ์ฑ๋ฅ์ ๋ณด์ฌ์ฃผ๋์ง ํ์ํด ๋ณด์๋ค. ๋ฐ์ดํฐ ์ ์ฒ๋ฆฌ๋ฅผ ํตํด ์ ์ ๋ค์ด ์์
์ ์ค์ ๋ก ์ข์ํด์ ๋ค์์ ๋ฒํ ์ธ์
๋ค๋ง ๋จ๊ธฐ๋ ค ๋
ธ๋ ฅํ๊ณ , ์ธ์
๊ธฐ๋ฐ ์ถ์ฒ์์คํ
์ ๋ง๊ฒ ๋ฐ์ดํฐ๋ฅผ ์ ์ ํ๋ค. ์์
๊ณผ ๊ด๋ จ๋ ๋ค์ํ ์ ๋ณด๋ค๋ ๋ชจ๋ธ ํ๋ จ์ ๋ฐ์ํ๊ธฐ ์ํด ์นดํ
๊ณ ๋ฆฌ ํํ๋ก ๋ฐ๊ฟ์ฃผ์๊ณ , ํ๋ จ ์์ฒด๋ ์ธ์
๊ธฐ๋ฐ ์ถ์ฒ์์คํ
์์ ์์ฃผ ์ฐ์ด๋ ์ ์ง์ ํ๋ จ๋ฒ์ ํ์ฉํ๋ค. ์ต์ข
์คํ ๊ฒฐ๊ณผ์์๋ ๋ฐ์ดํฐ์ ๋น์ ์ ์ฑ๊ณผ ๋น๋ฐ์ง์ฑ์ ๊ทน๋ณตํ๊ณ ๋น์ทํ ๋ฐ์ดํฐ์
๊ณผ ๊ฒฝ์๋ ฅ์ ๊ฐ์ถ๋ ์ฑ๊ณผ๋ฅผ ๋ณด์ฌ์ฃผ์๋ค. ์ด ์ฐ๊ตฌ๋ฅผ ํตํด ์์
์คํธ๋ฆฌ๋ฐ ์ธ์
์ถ์ฒ์์คํ
์ ํธ๋์คํฌ๋จธ ๊ธฐ๋ฐ ๋ชจ๋ธ์ด๋ผ๋ ์๋ก์ด ๊ฐ๋ฅ์ฑ์ ๋ณด์ฌ ์ฃผ์๊ณ , ์ถํ ์ฐ๊ตฌ์๋ค์ด ์ฐธ๊ณ ํ ์ ์๋ ์์์ ์ ์ ๊ณตํ์๋ค.1 Introduction 1
1.1 Research Topic 1
1.2 Purpose of Research 2
1.3 Need for Research 3
1.3.1 Recent Trends 3
1.3.2 Dataset Characteristics 4
2 Related Works 9
2.1 Overview of NLP and RecSys 9
2.2 Past Works on Incorporating Features 12
3 Methodology 13
3.1 Music Streaming Sessions Dataset 13
3.2 Music Recommendation Model 14
3.2.1 NVTabular 15
3.2.2 Transformers4Rec 15
3.3 Feature Embeddings 16
3.4 Session Information 17
3.5 Transformer Architectures 18
3.6 Metrics 19
4 Experiments 21
4.1 Data Preprocessing 21
4.2 Embedding 23
4.2.1 No features 23
4.2.2 Session features 23
4.2.3 Song features 25
4.3 Hyperparameters 27
4.4 Training 28
4.4.1 Problem Statement 28
4.4.2 Pipeline 28
4.4.3 Incremental Training, Evaluation 29
4.5 Results 30
4.5.1 Simple item IDs 30
4.5.2 Item IDs + Session Information 31
4.5.3 Item IDs + Session Information + Track Metadata 32
5 Conclusion and Future Works 34
Bibliography 36
์ด ๋ก 41์
- โฆ