1,703 research outputs found
Time-delayed collective flow diffusion models for inferring latent people flow from aggregated data at limited locations
The rapid adoption of wireless sensor devices has made it easier to record location information of people in a variety of spaces (e.g., exhibition halls). Location information is often aggregated due to privacy and/or cost concerns. The aggregated data we use as input consist of the numbers of incoming and outgoing people at each location and at each time step. Since the aggregated data lack tracking information of individuals, determining the flow of people between locations is not straightforward. In this article, we address the problem of inferring latent people flows, that is, transition populations between locations, from just aggregated population data gathered from observed locations. Existing models assume that everyone is always in one of the observed locations at every time step; this, however, is an unrealistic assumption, because we do not always have a large enough number of sensor devices to cover the large-scale spaces targeted. To overcome this drawback, we propose a probabilistic model with flow conservation constraints that incorporate travel duration distributions between observed locations. To handle noisy settings, we adopt noisy observation models for the numbers of incoming and outgoing people, where the noise is regarded as a factor that may disturb flow conservation, e.g., people may appear in or disappear from the predefined space of interest. We develop an approximate expectation-maximization (EM) algorithm that simultaneously estimates transition populations and model parameters. Our experiments demonstrate the effectiveness of the proposed model on real-world datasets of pedestrian data in exhibition halls, bike trip data and taxi trip data in New York City
Deep Gravity: enhancing mobility flows generation with deep neural networks and geographic information
The movements of individuals within and among cities influence key aspects of
our society, such as the objective and subjective well-being, the diffusion of
innovations, the spreading of epidemics, and the quality of the environment.
For this reason, there is increasing interest around the challenging problem of
flow generation, which consists in generating the flows between a set of
geographic locations, given the characteristics of the locations and without
any information about the real flows. Existing solutions to flow generation are
mainly based on mechanistic approaches, such as the gravity model and the
radiation model, which suffer from underfitting and overdispersion, neglect
important variables such as land use and the transportation network, and cannot
describe non-linear relationships between these variables. In this paper, we
propose the Multi-Feature Deep Gravity (MFDG) model as an effective solution to
flow generation. On the one hand, the MFDG model exploits a large number of
variables (e.g., characteristics of land use and the road network; transport,
food, and health facilities) extracted from voluntary geographic information
data (OpenStreetMap). On the other hand, our model exploits deep neural
networks to describe complex non-linear relationships between those variables.
Our experiments, conducted on commuting flows in England, show that the MFDG
model achieves a significant increase in the performance (up to 250\% for
highly populated areas) than mechanistic models that do not use deep neural
networks, or that do not exploit geographic voluntary data. Our work presents a
precise definition of the flow generation problem, which is a novel task for
the deep learning community working with spatio-temporal data, and proposes a
deep neural network model that significantly outperforms current
state-of-the-art statistical models
A Survey of Location Prediction on Twitter
Locations, e.g., countries, states, cities, and point-of-interests, are
central to news, emergency events, and people's daily lives. Automatic
identification of locations associated with or mentioned in documents has been
explored for decades. As one of the most popular online social network
platforms, Twitter has attracted a large number of users who send millions of
tweets on daily basis. Due to the world-wide coverage of its users and
real-time freshness of tweets, location prediction on Twitter has gained
significant attention in recent years. Research efforts are spent on dealing
with new challenges and opportunities brought by the noisy, short, and
context-rich nature of tweets. In this survey, we aim at offering an overall
picture of location prediction on Twitter. Specifically, we concentrate on the
prediction of user home locations, tweet locations, and mentioned locations. We
first define the three tasks and review the evaluation metrics. By summarizing
Twitter network, tweet content, and tweet context as potential inputs, we then
structurally highlight how the problems depend on these inputs. Each dependency
is illustrated by a comprehensive review of the corresponding strategies
adopted in state-of-the-art approaches. In addition, we also briefly review two
related problems, i.e., semantic location prediction and point-of-interest
recommendation. Finally, we list future research directions.Comment: Accepted to TKDE. 30 pages, 1 figur
Recommended from our members
Efficient Algorithms for Robust Spatiotemporal Data Analysis
Many large-scale data analysis applications involve data that can vary over both time and space. Often the primary goal of analyzing spatiotemporal data is identifying trends, movements, and sudden changes with respect to time, location, or both. This can include a variety of applications in economics (housing prices, unemployment, job movement, etc), city planning (traffic, power consumption, resource allocation, etc), and ecology (migration patterns, species variety, habitat change, etc). Like many domains, one of the major challenges of spatiotemporal data is dealing with noise and missing or untrustworthy observations. These uncertainties make it difficult to ascertain the distinct roles that changes in time and location have on the data. To this end, I have developed two different approaches for dealing with data uncertainty in different spatiotemporal applications. The first approach, dubbed the Quantile Scan algorithm, makes use of quantile regression to more accurately identify anomalous regions in the data. The flexibility of this framework allows ‘anomalies’ to be defined with respect to any quantile of interest. I develop a version of the Quantile Scan algorithm for analyzing spatial, and spatiotemporal data. The second approach is a unique variation of Collective Graphical Models (CGMs) to incorporate multiple views of the data. This multiview model learns and leverages shared information between the views to better compensate for missing observations. Both the Quantile Scan and Multiview CGM algorithms improve accuracy and robustness on noisy data without sacrificing runtime. The speed and accuracy of these models is demonstrated on a variety of synthetic and real-world datasets, compared against existing algorithms
Brain connectivity analysis: a short survey
This short survey the reviews recent literature on brain connectivity studies. It encompasses all forms of static and dynamic
connectivity whether anatomical, functional, or effective. The last decade has seen an ever increasing number of studies devoted
to deduce functional or effective connectivity, mostly from functional neuroimaging experiments. Resting state conditions have
become a dominant experimental paradigm, and a number of resting state networks, among them the prominent default mode
network, have been identified. Graphical models represent a convenient vehicle to formalize experimental findings and to closely
and quantitatively characterize the various networks identified. Underlying these abstract concepts are anatomical networks, the
so-called connectome, which can be investigated by functional imaging techniques as well. Future studies have to bridge the gap between anatomical neuronal connections and related functional or effective connectivities
Understanding Mobility and Transport Modal Disparities Using Emerging Data Sources: Modelling Potentials and Limitations
Transportation presents a major challenge to curb climate change due in part to its ever-increasing travel demand. Better informed policy-making requires up-to-date empirical mobility data to model viable mitigation options for reducing emissions from the transport sector. On the one hand, the prevalence of digital technologies enables a large-scale collection of human mobility traces, providing big potentials for improving the understanding of mobility patterns and transport modal disparities. On the other hand, the advancement in data science has allowed us to continue pushing the boundary of the potentials and limitations, for new uses of big data in transport.This thesis uses emerging data sources, including Twitter data, traffic data, OpenStreetMap (OSM), and trip data from new transport modes, to enhance the understanding of mobility and transport modal disparities, e.g., how car and public transit support mobility differently. Specifically, this thesis aims to answer two research questions: (1) What are the potentials and limitations of using these emerging data sources for modelling mobility? (2) How can these new data sources be properly modelled for characterising transport modal disparities? Papers I-III model mobility mainly using geotagged social media data, and reveal the potentials and limitations of this data source by validating against established sources (Q1). Papers IV-V combine multiple data sources to characterise transport modal disparities (Q2) which further demonstrate the modelling potentials of the emerging data sources (Q1).Despite a biased population representation and low and irregular sampling of the actual mobility, the geolocations of Twitter data can be used in models to produce good agreements with the other data sources on the fundamental characteristics of individual and population mobility. However, its feasibility for estimating travel demand depends on spatial scale, sparsity, sampling method, and sample size. To extend the use of social media data, this thesis develops two novel approaches to address the sparsity issue: (1) An individual-based mobility model that fills the gaps in the sparse mobility traces for synthetic travel demand; (2) A population-based model that uses Twitter geolocations as attractions instead of trips for estimating the flows of people between regions. This thesis also presents two reproducible data fusion frameworks for characterising transport modal disparities. They demonstrate the power of combining different data sources to gain new insights into the spatiotemporal patterns of travel time disparities between car and public transit, and the competition between ride-sourcing and public transport
Advances in Object and Activity Detection in Remote Sensing Imagery
The recent revolution in deep learning has enabled considerable development in the fields of object and activity detection. Visual object detection tries to find objects of target classes with precise localisation in an image and assign each object instance a corresponding class label. At the same time, activity recognition aims to determine the actions or activities of an agent or group of agents based on sensor or video observation data. It is a very important and challenging problem to detect, identify, track, and understand the behaviour of objects through images and videos taken by various cameras. Together, objects and their activity recognition in imaging data captured by remote sensing platforms is a highly dynamic and challenging research topic. During the last decade, there has been significant growth in the number of publications in the field of object and activity recognition. In particular, many researchers have proposed application domains to identify objects and their specific behaviours from air and spaceborne imagery. This Special Issue includes papers that explore novel and challenging topics for object and activity detection in remote sensing images and videos acquired by diverse platforms
- …