52 research outputs found
Latitude, longitude, and beyond:mining mobile objects' behavior
Rapid advancements in Micro-Electro-Mechanical Systems (MEMS), and wireless communications, have resulted in a surge in data generation. Mobility data is one of the various forms of data, which are ubiquitously collected by different location sensing devices. Extensive knowledge about the behavior of humans and wildlife is buried in raw mobility data. This knowledge can be used for realizing numerous viable applications ranging from wildlife movement analysis, to various location-based recommendation systems, urban planning, and disaster relief. With respect to what mentioned above, in this thesis, we mainly focus on providing data analytics for understanding the behavior and interaction of mobile entities (humans and animals). To this end, the main research question to be addressed is: How can behaviors and interactions of mobile entities be determined from mobility data acquired by (mobile) wireless sensor nodes in an accurate and efficient manner? To answer the above-mentioned question, both application requirements and technological constraints are considered in this thesis. On the one hand, applications requirements call for accurate data analytics to uncover hidden information about individual behavior and social interaction of mobile entities, and to deal with the uncertainties in mobility data. Technological constraints, on the other hand, require these data analytics to be efficient in terms of their energy consumption and to have low memory footprint, and processing complexity
A Distributed Routing Algorithm for Internet-wide Geocast
Geocast is the concept of sending data packets to nodes in a specified
geographical area instead of nodes with a specific address. To route geocast
messages to their destination we need a geographic routing algorithm that can
route packets efficiently to the devices inside the destination area. Our goal
is to design an algorithm that can deliver shortest path tree like forwarding
while relying purely on distributed data without central knowledge. In this
paper, we present two algorithms for geographic routing. One based purely on
distance vector data, and one more complicated algorithm based on path data. In
our evaluation, we show that our purely distance vector based algorithm can
come close to shortest path tree performance when a small number of routers are
present in the destination area. We also show that our path based algorithm can
come close to the performance of a shortest path tree in almost all geocast
situations
Spaceprint: a Mobility-based Fingerprinting Scheme for Public Spaces
In this paper, we address the problem of how automated situation-awareness
can be achieved by learning real-world situations from ubiquitously generated
mobility data. Without semantic input about the time and space where situations
take place, this turns out to be a fundamental challenging problem.
Uncertainties also introduce technical challenges when data is generated in
irregular time intervals, being mixed with noise, and errors. Purely relying on
temporal patterns observable in mobility data, in this paper, we propose
Spaceprint, a fully automated algorithm for finding the repetitive pattern of
similar situations in spaces. We evaluate this technique by showing how the
latent variables describing the category, and the actual identity of a space
can be discovered from the extracted situation patterns. Doing so, we use
different real-world mobility datasets with data about the presence of mobile
entities in a variety of spaces. We also evaluate the performance of this
technique by showing its robustness against uncertainties
Unsupervised Discretization by Two-dimensional MDL-based Histogram
Unsupervised discretization is a crucial step in many knowledge discovery
tasks. The state-of-the-art method for one-dimensional data infers locally
adaptive histograms using the minimum description length (MDL) principle, but
the multi-dimensional case is far less studied: current methods consider the
dimensions one at a time (if not independently), which result in
discretizations based on rectangular cells of adaptive size. Unfortunately,
this approach is unable to adequately characterize dependencies among
dimensions and/or results in discretizations consisting of more cells (or bins)
than is desirable. To address this problem, we propose an expressive model
class that allows for far more flexible partitions of two-dimensional data. We
extend the state of the art for the one-dimensional case to obtain a model
selection problem based on the normalised maximum likelihood, a form of refined
MDL. As the flexibility of our model class comes at the cost of a vast search
space, we introduce a heuristic algorithm, named PALM, which partitions each
dimension alternately and then merges neighbouring regions, all using the MDL
principle. Experiments on synthetic data show that PALM 1) accurately reveals
ground truth partitions that are within the model class (i.e., the search
space), given a large enough sample size; 2) approximates well a wide range of
partitions outside the model class; 3) converges, in contrast to its closest
competitor IPD; and 4) is self-adaptive with regard to both sample size and
local density structure of the data despite being parameter-free. Finally, we
apply our algorithm to two geographic datasets to demonstrate its real-world
potential.Comment: 30 pages, 9 figure
Spaceprint: a Mobility-based Fingerprinting Scheme for Spaces
In this paper, we address the problem of how automated situational awareness in a specifi c location can be achieved by characterizing the fingerprint of recurrent situations from ubiquitously generated mobility data. Without semantic input about the time and space (location) where situations take place, this turns out to be a fundamental challenging problem. Uncertainties in data also introduce technical challenges when data is generated in irregular time intervals, being mixed with noise, and errors. Purely relying on temporal patterns observable in mobility data, in this paper, we propose Spaceprint, a fully automated algorithm for fi nding the repetitive pattern of similar situations in spaces. We evaluate this technique by showing how the latent variables describing the actual identity of a space can be discovered from the extracted situation patterns
Joint Geographical and Temporal Modeling based on Matrix Factorization for Point-of-Interest Recommendation
With the popularity of Location-based Social Networks, Point-of-Interest
(POI) recommendation has become an important task, which learns the users'
preferences and mobility patterns to recommend POIs. Previous studies show that
incorporating contextual information such as geographical and temporal
influences is necessary to improve POI recommendation by addressing the data
sparsity problem. However, existing methods model the geographical influence
based on the physical distance between POIs and users, while ignoring the
temporal characteristics of such geographical influences. In this paper, we
perform a study on the user mobility patterns where we find out that users'
check-ins happen around several centers depending on their current temporal
state. Next, we propose a spatio-temporal activity-centers algorithm to model
users' behavior more accurately. Finally, we demonstrate the effectiveness of
our proposed contextual model by incorporating it into the matrix factorization
model under two different settings: i) static and ii) temporal. To show the
effectiveness of our proposed method, which we refer to as STACP, we conduct
experiments on two well-known real-world datasets acquired from Gowalla and
Foursquare LBSNs. Experimental results show that the STACP model achieves a
statistically significant performance improvement, compared to the
state-of-the-art techniques. Also, we demonstrate the effectiveness of
capturing geographical and temporal information for modeling users' activity
centers and the importance of modeling them jointly.Comment: To be appear in ECIR 202
Automated classification of pre-defined movement patterns:A comparison between GNSS and UWB technology
Advanced real-time location systems (RTLS) allow for collecting spatio-temporal data from human movement behaviours. Tracking individuals in small areas such as schoolyards or nursing homes might impose difficulties for RTLS in terms of positioning accuracy. However, to date, few studies have investigated the performance of different localisation systems regarding the classification of human movement patterns in small areas. The current study aims to design and evaluate an automated framework to classify human movement trajectories obtained from two different RTLS: Global Navigation Satellite System (GNSS) and Ultra-wideband (UWB), in areas of approximately 100 square meters. Specifically, we designed a versatile framework which takes GNSS or UWB data as input, extracts features from these data and classifies them according to the annotated spatial patterns. The automated framework contains three choices for applying noise removal: (i) no noise removal, (ii) Savitzky Golay filter on the raw location data or (iii) Savitzky Golay filter on the extracted features, as well as three choices regarding the classification algorithm: Decision Tree (DT), Random Forest (RF) or Support Vector Machine (SVM). We integrated different stages within the framework with the Sequential Model-Based Algorithm Configuration (SMAC) to perform automated hyperparameter optimisation. The best performance is achieved with a pipeline consisting of noise removal applied to the raw location data with an RF model for the GNSS and no noise removal with an SVM model for the UWB. We further demonstrate through statistical analysis that the UWB achieves significantly higher results than the GNSS in classifying movement patterns
Category-Aware Location Embedding for Point-of-Interest Recommendation
Recently, Point of interest (POI) recommendation has gained ever-increasing
importance in various Location-Based Social Networks (LBSNs). With the recent
advances of neural models, much work has sought to leverage neural networks to
learn neural embeddings in a pre-training phase that achieve an improved
representation of POIs and consequently a better recommendation. However,
previous studies fail to capture crucial information about POIs such as
categorical information.
In this paper, we propose a novel neural model that generates a POI embedding
incorporating sequential and categorical information from POIs. Our model
consists of a check-in module and a category module. The check-in module
captures the geographical influence of POIs derived from the sequence of users'
check-ins, while the category module captures the characteristics of POIs
derived from the category information. To validate the efficacy of the model,
we experimented with two large-scale LBSN datasets. Our experimental results
demonstrate that our approach significantly outperforms state-of-the-art POI
recommendation methods.Comment: 4 pages, 1 figure
- …