98 research outputs found
Interplay between telecommunications and face-to-face interactions - a study using mobile phone data
In this study we analyze one year of anonymized telecommunications data for
over one million customers from a large European cellphone operator, and we
investigate the relationship between people's calls and their physical
location. We discover that more than 90% of users who have called each other
have also shared the same space (cell tower), even if they live far apart.
Moreover, we find that close to 70% of users who call each other frequently (at
least once per month on average) have shared the same space at the same time -
an instance that we call co-location. Co-locations appear indicative of
coordination calls, which occur just before face-to-face meetings. Their number
is highly predictable based on the amount of calls between two users and the
distance between their home locations - suggesting a new way to quantify the
interplay between telecommunications and face-to-face interactions
GeoIntelligence: Data Mining Locational Social Media Content for Profiling and Information Gathering
The current social media landscape has resulted in a situation where people are encouraged to share a greater amount of information about their day-to-day lives than ever before. In this environment a large amount of personal data is disclosed in a public forum with little to no regard for the potential privacy impacts. This paper focuses on the presence of geographic data within images, metadata and individual postings. The GeoIntelligence project aims to aggregate this information to educate users on the possible implications of the utilisation of these services as well as providing service to law enforcement and business. This paper demonstrates the ability to profile users on an individual and group basis from data posted openly to social networking services
Digital neighborhoods
With the advent of ‘big data’ there is an increased interest in using social media to describe city dynamics. This paper employs geo-located social media data to identify ‘digital neighborhoods’ – those areas in the city where social media is used more often. Starting with geo-located Twitter and Foursquare data for the New York City region in 2014, we applied spatial clustering techniques to detect significant groupings or ‘neighborhoods’ where social media use is high or low. The results show that beyond the business districts, digital neighborhoods occur in communities undergoing shifting socio-demographics. Neighborhoods that are not digitally oriented tend to have higher proportion of minorities and lower incomes, highlighting a social–economic divide in how social media is used in the city. Understanding the differences in these neighborhoods can help city planners interested in generating economic development proposals, civic engagement strategies, and urban design ideas that target these areas
Fundamental structures of dynamic social networks
Social systems are in a constant state of flux with dynamics spanning from
minute-by-minute changes to patterns present on the timescale of years.
Accurate models of social dynamics are important for understanding spreading of
influence or diseases, formation of friendships, and the productivity of teams.
While there has been much progress on understanding complex networks over the
past decade, little is known about the regularities governing the
micro-dynamics of social networks. Here we explore the dynamic social network
of a densely-connected population of approximately 1000 individuals and their
interactions in the network of real-world person-to-person proximity measured
via Bluetooth, as well as their telecommunication networks, online social media
contacts, geo-location, and demographic data. These high-resolution data allow
us to observe social groups directly, rendering community detection
unnecessary. Starting from 5-minute time slices we uncover dynamic social
structures expressed on multiple timescales. On the hourly timescale, we find
that gatherings are fluid, with members coming and going, but organized via a
stable core of individuals. Each core represents a social context. Cores
exhibit a pattern of recurring meetings across weeks and months, each with
varying degrees of regularity. Taken together, these findings provide a
powerful simplification of the social network, where cores represent
fundamental structures expressed with strong temporal and spatial regularity.
Using this framework, we explore the complex interplay between social and
geospatial behavior, documenting how the formation of cores are preceded by
coordination behavior in the communication networks, and demonstrating that
social behavior can be predicted with high precision.Comment: Main Manuscript: 16 pages, 4 figures. Supplementary Information: 39
pages, 34 figure
GeoIntelligence: Data Mining Locational Social Media Content for Profiling and Information Gathering
The current social media landscape has resulted in a situation where people are encouraged to share a greater amount of information about their day-to-day lives than ever before. In this environment a large amount of personal data is disclosed in a public forum with little to no regard for the potential privacy impacts. This paper focuses on the presence of geographic data within images, metadata and individual postings. The GeoIntelligence project aims to aggregate this information to educate users on the possible implications of the utilisation of these services as well as providing service to law enforcement and business. This paper demonstrates the ability to profile users on an individual and group basis from data posted openly to social networking services
DeepCity: A Feature Learning Framework for Mining Location Check-ins
Online social networks being extended to geographical space has resulted in
large amount of user check-in data. Understanding check-ins can help to build
appealing applications, such as location recommendation. In this paper, we
propose DeepCity, a feature learning framework based on deep learning, to
profile users and locations, with respect to user demographic and location
category prediction. Both of the predictions are essential for social network
companies to increase user engagement. The key contribution of DeepCity is the
proposal of task-specific random walk which uses the location and user
properties to guide the feature learning to be specific to each prediction
task. Experiments conducted on 42M check-ins in three cities collected from
Instagram have shown that DeepCity achieves a superior performance and
outperforms other baseline models significantly
Can co-location be used as a proxy for face-to-face contacts?
Technological advances have led to a strong increase in the number of data
collection efforts aimed at measuring co-presence of individuals at different
spatial resolutions. It is however unclear how much co-presence data can inform
us on actual face-to-face contacts, of particular interest to study the
structure of a population in social groups or for use in data-driven models of
information or epidemic spreading processes. Here, we address this issue by
leveraging data sets containing high resolution face-to-face contacts as well
as a coarser spatial localisation of individuals, both temporally resolved, in
various contexts. The co-presence and the face-to-face contact temporal
networks share a number of structural and statistical features, but the former
is (by definition) much denser than the latter. We thus consider several
down-sampling methods that generate surrogate contact networks from the
co-presence signal and compare them with the real face-to-face data. We show
that these surrogate networks reproduce some features of the real data but are
only partially able to identify the most central nodes of the face-to-face
network. We then address the issue of using such down-sampled co-presence data
in data-driven simulations of epidemic processes, and in identifying efficient
containment strategies. We show that the performance of the various sampling
methods strongly varies depending on context. We discuss the consequences of
our results with respect to data collection strategies and methodologies
- …