54,241 research outputs found
Mining large-scale human mobility data for long-term crime prediction
Traditional crime prediction models based on census data are limited, as they
fail to capture the complexity and dynamics of human activity. With the rise of
ubiquitous computing, there is the opportunity to improve such models with data
that make for better proxies of human presence in cities. In this paper, we
leverage large human mobility data to craft an extensive set of features for
crime prediction, as informed by theories in criminology and urban studies. We
employ averaging and boosting ensemble techniques from machine learning, to
investigate their power in predicting yearly counts for different types of
crimes occurring in New York City at census tract level. Our study shows that
spatial and spatio-temporal features derived from Foursquare venues and
checkins, subway rides, and taxi rides, improve the baseline models relying on
census and POI data. The proposed models achieve absolute R^2 metrics of up to
65% (on a geographical out-of-sample test set) and up to 89% (on a temporal
out-of-sample test set). This proves that, next to the residential population
of an area, the ambient population there is strongly predictive of the area's
crime levels. We deep-dive into the main crime categories, and find that the
predictive gain of the human dynamics features varies across crime types: such
features bring the biggest boost in case of grand larcenies, whereas assaults
are already well predicted by the census features. Furthermore, we identify and
discuss top predictive features for the main crime categories. These results
offer valuable insights for those responsible for urban policy or law
enforcement
Couples’ places of meeting in late 20th century Britain: class, continuity and change
This article examines couples’ places or contexts of meeting in the second half of the 20th century in Great Britain, utilizing a typology developed by Bozon and Héran. The continuities are as striking as the changes, with social networks maintaining a consistent level of importance, but with trends towards meeting at places of education and work, and away from meeting in public places for drinking, eating or socializing. Rather than reflecting the impact of the rise of individualism and self-identity, these trends arguably reflect the changing importance of settings within people's daily lives, as may the recent growth in internet dating. Social class appears to have become more strongly related to the likelihood of meeting in ‘public’ settings, apparently more common in Britain than elsewhere. Achieved characteristics, especially occupational class, have a greater impact than parental class. Variations between place of meeting categories in the extent of occupational class homogamy appear to reflect levels of class homogeneity within settings more than the impact of either individualism or a homogamy norm. Regional variations in places of meeting highlight the ongoing importance of structural factors such as patterns of sociability or cultural norms
Great cities look small
Great cities connect people; failed cities isolate people. Despite the
fundamental importance of physical, face-to-face social-ties in the functioning
of cities, these connectivity networks are not explicitly observed in their
entirety. Attempts at estimating them often rely on unrealistic
over-simplifications such as the assumption of spatial homogeneity. Here we
propose a mathematical model of human interactions in terms of a local strategy
of maximising the number of beneficial connections attainable under the
constraint of limited individual travelling-time budgets. By incorporating
census and openly-available online multi-modal transport data, we are able to
characterise the connectivity of geometrically and topologically complex
cities. Beyond providing a candidate measure of greatness, this model allows
one to quantify and assess the impact of transport developments, population
growth, and other infrastructure and demographic changes on a city. Supported
by validations of GDP and HIV infection rates across United States metropolitan
areas, we illustrate the effect of changes in local and city-wide
connectivities by considering the economic impact of two contemporary inter-
and intra-city transport developments in the United Kingdom: High Speed Rail 2
and London Crossrail. This derivation of the model suggests that the scaling of
different urban indicators with population size has an explicitly mechanistic
origin.Comment: 19 pages, 8 figure
Examining the relationship between different urbanization settings, smartphone use to access the Internet and trip frequencies
No abstract available
Recommended from our members
A conceptual framework for studying collective reactions to events in location-based social media
Events are a core concept of spatial information, but location-based social media (LBSM) provide information on reactions to events. Individuals have varied degrees of agency in initiating, reacting to or modifying the course of events, and reactions include observations of occurrence, expressions containing sentiment or emotions, or a call to action. Key characteristics of reactions include referent events and information about who reacted, when, where and how. Collective reactions are composed of multiple individual reactions sharing common referents. They can be characterized according to the following dimensions: spatial, temporal, social, thematic and interlinkage. We present a conceptual framework, which allows characterization and comparison of collective reactions. For a thematically well-defined class of event such as storms, we can explore differences and similarities in collective attribution of meaning across space and time. Other events may have very complex spatio-temporal signatures (e.g. political processes such as Brexit or elections), which can be decomposed into series of individual events (e.g. a temporal window around the result of a vote). The purpose of our framework is to explore ways in which collective reactions to events in LBSM can be described and underpin the development of methods for analysing and understanding collective reactions to events
Recommended from our members
Re-thinking households - using administrative data to count and classify households with some application
Households rather than individuals are being increasingly used for research and to target and evaluate public policy. As a result accurate and timely household level statistics have become an increasing necessity especially at local level. However, present sources of information on households are fragmented with significant gaps and inaccuracies that limit their usefulness. This paper reviews present statistical arrangements and then describes a new approach to data collection and household classification based on local administrative sources. The result is a more integrated and flexible system. The utility and advantages are demonstrated using recent examples from the six Olympic London Boroughs
A Multi-Gene Genetic Programming Application for Predicting Students Failure at School
Several efforts to predict student failure rate (SFR) at school accurately
still remains a core problem area faced by many in the educational sector. The
procedure for forecasting SFR are rigid and most often times require data
scaling or conversion into binary form such as is the case of the logistic
model which may lead to lose of information and effect size attenuation. Also,
the high number of factors, incomplete and unbalanced dataset, and black boxing
issues as in Artificial Neural Networks and Fuzzy logic systems exposes the
need for more efficient tools. Currently the application of Genetic Programming
(GP) holds great promises and has produced tremendous positive results in
different sectors. In this regard, this study developed GPSFARPS, a software
application to provide a robust solution to the prediction of SFR using an
evolutionary algorithm known as multi-gene genetic programming. The approach is
validated by feeding a testing data set to the evolved GP models. Result
obtained from GPSFARPS simulations show its unique ability to evolve a suitable
failure rate expression with a fast convergence at 30 generations from a
maximum specified generation of 500. The multi-gene system was also able to
minimize the evolved model expression and accurately predict student failure
rate using a subset of the original expressionComment: 14 pages, 9 figures, Journal paper. arXiv admin note: text overlap
with arXiv:1403.0623 by other author
- …