Search CORE

4 research outputs found

Neurala klickmodeller med latenta variabler för webbsöksystem

Author: Svebrant Henrik
Publication venue: KTH, Skolan för elektroteknik och datavetenskap (EECS)
Publication date: 01/01/2018
Field of study

User click modeling in web search is most commonly done through probabilistic graphical models. Due to the successful use of machine learning techniques in other fields of research, it is interesting to evaluate how machine learning can be applied to click modeling. In this thesis, modeling is done using recurrent neural networks trained on a distributed representation of the state of the art user browsing model (UBM). It is further evaluated how extending this representation with a set of latent variables that are easily derivable from click logs, can affect the model's prediction performance. Results show that a model using the original representation does not perform very well. However, the inclusion of simple variables can drastically increase the performance regarding the click prediction task. For which it manages to outperform the two chosen baseline models, which themselves are well performing already. It also leads to increased performance for the relevance prediction task, although the results are not as significant. It can be argued that the relevance prediction task is not a fair comparison to the baseline functions, due to them needing more significant amounts of data to learn the respective probabilities. However, it is favorable that the neural models manage to perform quite well using smaller amounts of data. It would be interesting to see how well such models would perform when trained on far greater data quantities than what was used in this project. Also tailoring the model for the use of LSTM, which supposedly could increase performance even more. Evaluating other representations than the one used would also be of interest, as this representation did not perform remarkably on its own.Klickmodellering av användare i söksystem görs vanligtvis med hjälp av probabilistiska modeller. På grund av maskininlärningens framgångar inom andra områden är det intressant att undersöka hur dessa tekniker kan appliceras för klickmodellering. Detta examensarbete undersöker klickmodellering med hjälp av recurrent neural networks tränade på en distribuerad representation av en populär och välpresterande klickmodell benämnd user browsing model (UBM). Det undersöks vidare hur utökandet av denna representation med statistiska variabler som enkelt kan utvinnas från klickloggar, kan påverka denna modells prestanda. Resultaten visar att grundrepresentationen inte presterar särskilt bra. Däremot har användningen av simpla variabler visats medföra drastiska prestandaökningar när det kommer till att förutspå en användares klick. I detta syfte lyckas modellerna prestera bättre än de två baselinemodeller som valts, vilka redan är välpresterande för syftet. De har även lyckats förbättra modellernas förmåga att förutspå relevans, fastän skillnaderna inte är lika drastiska. Relevans utgör inte en lika jämn jämförelse gentemot baselinemodellerna, då dessa kräver mycket större datamängder för att nå verklig prestanda. Det är däremot fördelaktigt att de neurala modellerna når relativt god prestanda för datamängden som använts. Det vore intressant att undersöka hur dessa modeller skulle prestera när de tränas på mycket större datamängder än vad som använts i detta projekt. Även att skräddarsy modellerna för LSTM, vilket borde kunna öka prestandan ytterligare. Att evaluera andra representationer än den som användes i detta projekt är också av intresse, då den använda representationen inte presterade märkvärdigt i sin grundform

Neurala klickmodeller med latenta variabler för webbsöksystem

Author: Svebrant Henrik
Publication venue: KTH, Skolan för elektroteknik och datavetenskap (EECS)
Publication date: 01/01/2018
Field of study

Publikationer från KTH

Neurala klickmodeller med latenta variabler för webbsöksystem

Author: Svebrant Henrik
Publication venue: KTH, Skolan för elektroteknik och datavetenskap (EECS)
Publication date: 01/01/2018
Field of study

Publikationer från KTH

Digitala Vetenskapliga Arkivet - Academic Archive On-line

A comparative study of the conventional item-based collaborative filtering and the Slope One algorithms for recommender systems

Author: Svanberg John
Svebrant Henrik
Publication venue: KTH, Skolan för datavetenskap och kommunikation (CSC)
Publication date: 01/01/2016
Field of study

Recommender systems are an important research topic in todays society as the amount of data increases across the globe. In order for commercial systems to give their users good and personalized recommendations on what data may be of interest to them in an effective manner, such a system must be able to give recommendations quickly and scale well as data increases. The purpose of this study is to evaluate two such algorithms with this in mind. The two different algorithm families tested are classified as item-based collaborative filtering but work very differently. It is therefore of interest to see how their complexities affect their performance, accuracy as well as scalability. The Slope One family is much simpler to implement and proves to be equally as efficient, if not even more efficient than the conventional item-based ones. Both families do require a precomputation stage before recommendations are possible to give, this is the stage where Slope One suffers in comparison to the conventional item-based one. The algorithms are tested using Lenskit, on data provided by GroupLens and their MovieLens project

Publikationer från KTH

Digitala Vetenskapliga Arkivet - Academic Archive On-line