83 research outputs found
Hierarchical Transformer with Spatio-Temporal Context Aggregation for Next Point-of-Interest Recommendation
Next point-of-interest (POI) recommendation is a critical task in
location-based social networks, yet remains challenging due to a high degree of
variation and personalization exhibited in user movements. In this work, we
explore the latent hierarchical structure composed of multi-granularity
short-term structural patterns in user check-in sequences. We propose a
Spatio-Temporal context AggRegated Hierarchical Transformer (STAR-HiT) for next
POI recommendation, which employs stacked hierarchical encoders to recursively
encode the spatio-temporal context and explicitly locate subsequences of
different granularities. More specifically, in each encoder, the global
attention layer captures the spatio-temporal context of the sequence, while the
local attention layer performed within each subsequence enhances subsequence
modeling using the local context. The sequence partition layer infers positions
and lengths of subsequences from the global context adaptively, such that
semantics in subsequences can be well preserved. Finally, the subsequence
aggregation layer fuses representations within each subsequence to form the
corresponding subsequence representation, thereby generating a new sequence of
higher-level granularity. The stacking of encoders captures the latent
hierarchical structure of the check-in sequence, which is used to predict the
next visiting POI. Extensive experiments on three public datasets demonstrate
that the proposed model achieves superior performance whilst providing
explanations for recommendations. Codes are available at
https://github.com/JennyXieJiayi/STAR-HiT
Exploring attributes, sequences, and time in Recommender Systems: From classical to Point-of-Interest recommendation
Tesis Doctoral inédita leída en la Universidad Autónoma de Madrid, Escuela Politécnica Superior, Departamento de Ingenieria Informática. Fecha de lectura: 08-07-2021Since the emergence of the Internet and the spread of digital communications
throughout the world, the amount of data stored on the Web has been
growing exponentially. In this new digital era, a large number of companies
have emerged with the purpose of ltering the information available on the
web and provide users with interesting items. The algorithms and models
used to recommend these items are called Recommender Systems. These
systems are applied to a large number of domains, from music, books, or
movies to dating or Point-of-Interest (POI), which is an increasingly popular
domain where users receive recommendations of di erent places when
they arrive to a city.
In this thesis, we focus on exploiting the use of contextual information, especially
temporal and sequential data, and apply it in novel ways in both
traditional and Point-of-Interest recommendation. We believe that this type
of information can be used not only for creating new recommendation models
but also for developing new metrics for analyzing the quality of these
recommendations. In one of our rst contributions we propose di erent
metrics, some of them derived from previously existing frameworks, using
this contextual information. Besides, we also propose an intuitive algorithm
that is able to provide recommendations to a target user by exploiting the
last common interactions with other similar users of the system.
At the same time, we conduct a comprehensive review of the algorithms
that have been proposed in the area of POI recommendation between 2011
and 2019, identifying the common characteristics and methodologies used.
Once this classi cation of the algorithms proposed to date is completed, we
design a mechanism to recommend complete routes (not only independent
POIs) to users, making use of reranking techniques. In addition, due to the
great di culty of making recommendations in the POI domain, we propose
the use of data aggregation techniques to use information from di erent
cities to generate POI recommendations in a given target city.
In the experimental work we present our approaches on di erent datasets
belonging to both classical and POI recommendation. The results obtained
in these experiments con rm the usefulness of our recommendation proposals,
in terms of ranking accuracy and other dimensions like novelty, diversity,
and coverage, and the appropriateness of our metrics for analyzing temporal
information and biases in the recommendations producedDesde la aparici on de Internet y la difusi on de las redes de comunicaciones
en todo el mundo, la cantidad de datos almacenados en la red ha crecido
exponencialmente. En esta nueva era digital, han surgido un gran n umero
de empresas con el objetivo de ltrar la informaci on disponible en la red
y ofrecer a los usuarios art culos interesantes. Los algoritmos y modelos
utilizados para recomendar estos art culos reciben el nombre de Sistemas de
Recomendaci on. Estos sistemas se aplican a un gran n umero de dominios,
desde m usica, libros o pel culas hasta las citas o los Puntos de Inter es (POIs,
en ingl es), un dominio cada vez m as popular en el que los usuarios reciben
recomendaciones de diferentes lugares cuando llegan a una ciudad.
En esta tesis, nos centramos en explotar el uso de la informaci on contextual,
especialmente los datos temporales y secuenciales, y aplicarla de forma novedosa
tanto en la recomendaci on cl asica como en la recomendaci on de POIs.
Creemos que este tipo de informaci on puede utilizarse no s olo para crear
nuevos modelos de recomendaci on, sino tambi en para desarrollar nuevas
m etricas para analizar la calidad de estas recomendaciones. En una de
nuestras primeras contribuciones proponemos diferentes m etricas, algunas
derivadas de formulaciones previamente existentes, utilizando esta informaci
on contextual. Adem as, proponemos un algoritmo intuitivo que es
capaz de proporcionar recomendaciones a un usuario objetivo explotando
las ultimas interacciones comunes con otros usuarios similares del sistema.
Al mismo tiempo, realizamos una revisi on exhaustiva de los algoritmos que
se han propuesto en el a mbito de la recomendaci o n de POIs entre 2011 y
2019, identi cando las caracter sticas comunes y las metodolog as utilizadas.
Una vez realizada esta clasi caci on de los algoritmos propuestos hasta la
fecha, dise~namos un mecanismo para recomendar rutas completas (no s olo
POIs independientes) a los usuarios, haciendo uso de t ecnicas de reranking.
Adem as, debido a la gran di cultad de realizar recomendaciones en el
ambito de los POIs, proponemos el uso de t ecnicas de agregaci on de datos
para utilizar la informaci on de diferentes ciudades y generar recomendaciones
de POIs en una determinada ciudad objetivo.
En el trabajo experimental presentamos nuestros m etodos en diferentes
conjuntos de datos tanto de recomendaci on cl asica como de POIs. Los
resultados obtenidos en estos experimentos con rman la utilidad de nuestras
propuestas de recomendaci on en t erminos de precisi on de ranking y de
otras dimensiones como la novedad, la diversidad y la cobertura, y c omo de
apropiadas son nuestras m etricas para analizar la informaci on temporal y
los sesgos en las recomendaciones producida
Trajectory Reconstruction and Mobility Pattern Analysis Based on Call Detail Record Data
Tehnoloogiad, mis kasutavad geograafilisi andmeid, on muutunud meie igapäevaelu tähtsaks osaks. Tänu sellele on kasvanud asukoha andmetemassiliine salvestamine ja kaevandamine. Seni on GPS tehnoloogiad olnud põhiliseks geograafiliste andmete kogumismeetodiks. Sellega paralleelselt on populaarsust kogunud mobiiliandmete kasutamine positsiooni tuvastamiseks ja liikumismustrite analüüsimiseks. Mobiiliandmete (CDR) põhjal trajektooride taastamiseks on vajalik meetodite kohendamine selleks, et tulemused oleksid korrektsed. Tänu sellele, et telekommunikatsiooni ettevõtted on alustanud suuremat koostööd ja hakanud CDR-andmeid järjest rohkem avalikustama, on mobiiliandmete kasutamine mitmetel aladel suurenenud. Töödeldud mobiiliandmed aitavad anda ülevaadet rahvastiku liikumisest erinevates ulatustes. Samal ajal on trajektooride taastamine CDR-andmetest kohati raskendatud võrreldes GPS-andmetega. Suurimaks probleemiks on algus- ja lõpp-positsioonide asukoha määramine, mis on veelgi enam raskendatud juhul kui objekt liigub.Selle lõputöö eesmärgiks on trajektooride taastamine anonüümsete kasutajatepoolt genereeritud CDR-andmete põhjal. Tulemuste valideerimine GPS-andmetega, mis on loodud paralleelselt mobiiliandmetega ning on vajalik selleks, et määrata saadud trajektooride täpsust. Loodud trajektoore saab kasutada objektide, sealhulgas ka inimeste, liikumismustrite analüüsimiseks ja rahvastiku paiknemise tuvastamiseks, mis aitab linnade planeerimisel ja infrastruktuuride optimeerimisel. Lõputöö väljunditeks on trajektooride taastamine ja täpsuse analüüsimine, lisaks sellele inimese liikumismudelite tuvastamine ja tihedamini külastatavate asukohtade identifitseerimine nagu näiteks kodu, töökoht ja poed.Up until now, GPS data has been greatly used for collecting highlyprecise locational data from moving objects including humans. In contrast, mobile phone data is becoming more and more popular in the last few years. The usage of mobile phone data, that is also known as CDR data, has many benefits over the widely used GPS. This means that the methods used for example in GPS trajectory reconstruction, need to have modifications made be compatible with CDR data.The fact that telecommunication companies have started to cooperate moreand share the CDR data with the public is also a boost to the usage of CDRdata. The processed and analyzed CDR data can be used to get an overview ofcrowd movement in different scales, for example traveling inside a city as opposed to between countries. Extracting trajectories from CDR data has numerous complications.This is due to the fact that the data might not be continuous anddiscovering of the starting point of the object in motion is complicated.The goal of this thesis is to use CDR data in the reconstruction of trajectoriesmade by an anonymous user and to validate the results with GPS data generated in parallel to the CDR data. Reconstructed trajectories can be used for movement analysis and population displacement and would help city planning by optimizing the infrastructures.Outcomes of this thesis are the reconstructed trajectories based on CDR dataand the precisions of final paths. Also, the frequency of CDR events is analyzedin addition to distance distribution. After that the areas that the user visits most frequently are extracted, such as home and work locations
A Taxonomy of Sequential Patterns Based Recommendation Systems
With remarkable expansion of information through the internet, users prefer to receive the exact information they need through some suggestions to save their time and money. Thus, recommendation systems have become the heart of business strategies of E-commerce as they can increase sales and revenue as well as customer loyalty. Recommendation systems techniques provide suggestions for items/products to be purchased, rented or used by a user. The most common type of recommendation system technique is Collaborative Filtering (CF), which takes user’s interest in an item (explicit rating) as input in a matrix known as the user-item rating matrix, and produces an output for unknown ratings of users for items from which top N recommended items for target users are defined. E-commerce recommendation systems usually deal with massive customer sequential databases such as historical purchase or click sequences. The time stamp of a click or purchase event is an important attribute of each dataset as the time interval between item purchases may be useful to learn the next items for purchase by users. Sequential Pattern Mining mines frequent or high utility sequential patterns from a sequential database. Recommendation systems accuracy will be improved if complex sequential patterns of user purchase behavior are learned by integrating sequential patterns of customer clicks and/or purchases into the user-item rating matrix input. Thus, integrating collaborative filtering (CF) and sequential pattern mining (SPM) of historical clicks and purchase data can improve recommendation accuracy, diversity and quality and this survey focuses on review of existing recommendation systems that are sequential pattern based exposing their methodologies, achievements, limitations, and potentials for solving more problems in this domain. This thesis provides a comprehensive and comparative study of the existing Sequential Pattern-based E-commerce recommendation systems (SP-based E-commerce RS) such as ChoRec05, ChenRec09, HuangRec09, LiuRec09, ChoiRec12, Hybrid Model RecSys16, Product RecSys16, SainiRec17, HPCRec18 and HSPCRec19. Thesis shows that integrating sequential patterns mining (SPM) of historical purchase and/or click sequences into user-item matrix for collaborative filtering (CF) (i) Improved recommendation accuracy (ii) Reduced limiting user-item rating data Sparsity (iii) Increased Novelty Rate of the recommendations and (iv) Improved Scalability of the recommendation system. Thus, the importance of sequential patterns of customer behavior in improving the quality of recommendation systems for the application domain of E-commerce is accentuated through this survey by having a comparative performance analysis of the surveyed systems
- …