749 research outputs found
Recommended from our members
The Parable of Google Flu: Traps in Big Data Analysis
Large errors in flu prediction were largely avoidable, which offers lessons for the use of big data. In February 2013, Google Flu Trends (GFT) made headlines but not for a reason that Google executives or the creators of the flu tracking system would have hoped. Nature reported that GFT was predicting more than double the proportion of doctor visits for influenza-like illness (ILI) than the Centers for Disease Control and Prevention (CDC), which bases its estimates on surveillance reports from laboratories across the United States ( 1, 2). This happened despite the fact that GFT was built to predict CDC reports. Given that GFT is often held up as an exemplary use of big data ( 3, 4), what lessons can we draw from this error?Other Research Uni
Cultural transmission and optimization dynamics
We study the one-dimensional version of Axelrod's model of cultural
transmission from the point of view of optimization dynamics. We show the
existence of a Lyapunov potential for the dynamics. The global minimum of the
potential, or optimum state, is the monocultural uniform state, which is
reached for an initial diversity of the population below a critical value.
Above this value, the dynamics settles in a multicultural or polarized state.
These multicultural attractors are not local minima of the potential, so that
any small perturbation initiates the search for the optimum state. Cultural
drift is modelled by such perturbations acting at a finite rate. If the noise
rate is small, the system reaches the optimum monocultural state. However, if
the noise rate is above a critical value, that depends on the system size,
noise sustains a polarized dynamical state.Comment: 11 pages, 10 figures include
Sustaining the Internet with Hyperbolic Mapping
The Internet infrastructure is severely stressed. Rapidly growing overheads
associated with the primary function of the Internet---routing information
packets between any two computers in the world---cause concerns among Internet
experts that the existing Internet routing architecture may not sustain even
another decade. Here we present a method to map the Internet to a hyperbolic
space. Guided with the constructed map, which we release with this paper,
Internet routing exhibits scaling properties close to theoretically best
possible, thus resolving serious scaling limitations that the Internet faces
today. Besides this immediate practical viability, our network mapping method
can provide a different perspective on the community structure in complex
networks
On Uniqueness of Boundary Blow-up Solutions of a Class of Nonlinear Elliptic Equations
We study boundary blow-up solutions of semilinear elliptic equations
with , or with , where is a second order
elliptic operator with measurable coefficients. Several uniqueness theorems and
an existence theorem are obtained.Comment: To appear in Comm. Partial Differential Equations; 10 page
Does \u2018bigger\u2019mean \u2018better\u2019? Pitfalls and shortcuts associated with big data for social research
\u2018Big data is here to stay.\u2019 This key statement has a double value: is an assumption as well as the reason why a theoretical reflection is needed. Furthermore, Big data is something that is gaining visibility and success in social sciences even, overcoming the division between humanities and computer sciences. In this contribution some considerations on the presence and the certain persistence of Big data as a socio-technical assemblage will be outlined. Therefore, the intriguing opportunities for social research linked to such interaction between practices and technological development will be developed. However, despite a promissory rhetoric, fostered by several scholars since the birth of Big data as a labelled concept, some risks are just around the corner. The claims for the methodological power of bigger and bigger datasets, as well as increasing speed in analysis and data collection, are creating a real hype in social research. Peculiar attention is needed in order to avoid some pitfalls. These risks will be analysed for what concerns the validity of the research results \u2018obtained through Big data. After a pars distruens, this contribution will conclude with a pars construens; assuming the previous critiques, a mixed methods research design approach will be described as a general proposal with the objective of stimulating a debate on the integration of Big data in complex research projecting
Mobile Communication Signatures of Unemployment
The mapping of populations socio-economic well-being is highly constrained by
the logistics of censuses and surveys. Consequently, spatially detailed changes
across scales of days, weeks, or months, or even year to year, are difficult to
assess; thus the speed of which policies can be designed and evaluated is
limited. However, recent studies have shown the value of mobile phone data as
an enabling methodology for demographic modeling and measurement. In this work,
we investigate whether indicators extracted from mobile phone usage can reveal
information about the socio-economical status of microregions such as districts
(i.e., average spatial resolution < 2.7km). For this we examine anonymized
mobile phone metadata combined with beneficiaries records from unemployment
benefit program. We find that aggregated activity, social, and mobility
patterns strongly correlate with unemployment. Furthermore, we construct a
simple model to produce accurate reconstruction of district level unemployment
from their mobile communication patterns alone. Our results suggest that
reliable and cost-effective economical indicators could be built based on
passively collected and anonymized mobile phone data. With similar data being
collected every day by telecommunication services across the world,
survey-based methods of measuring community socioeconomic status could
potentially be augmented or replaced by such passive sensing methods in the
future
Partisan Asymmetries in Online Political Activity
We examine partisan differences in the behavior, communication patterns and
social interactions of more than 18,000 politically-active Twitter users to
produce evidence that points to changing levels of partisan engagement with the
American online political landscape. Analysis of a network defined by the
communication activity of these users in proximity to the 2010 midterm
congressional elections reveals a highly segregated, well clustered partisan
community structure. Using cluster membership as a high-fidelity (87% accuracy)
proxy for political affiliation, we characterize a wide range of differences in
the behavior, communication and social connectivity of left- and right-leaning
Twitter users. We find that in contrast to the online political dynamics of the
2008 campaign, right-leaning Twitter users exhibit greater levels of political
activity, a more tightly interconnected social structure, and a communication
network topology that facilitates the rapid and broad dissemination of
political information.Comment: 17 pages, 10 figures, 6 table
Forecasting in the light of Big Data
Predicting the future state of a system has always been a natural motivation
for science and practical applications. Such a topic, beyond its obvious
technical and societal relevance, is also interesting from a conceptual point
of view. This owes to the fact that forecasting lends itself to two equally
radical, yet opposite methodologies. A reductionist one, based on the first
principles, and the naive inductivist one, based only on data. This latter view
has recently gained some attention in response to the availability of
unprecedented amounts of data and increasingly sophisticated algorithmic
analytic techniques. The purpose of this note is to assess critically the role
of big data in reshaping the key aspects of forecasting and in particular the
claim that bigger data leads to better predictions. Drawing on the
representative example of weather forecasts we argue that this is not generally
the case. We conclude by suggesting that a clever and context-dependent
compromise between modelling and quantitative analysis stands out as the best
forecasting strategy, as anticipated nearly a century ago by Richardson and von
Neumann
Validation of Dunbar's number in Twitter conversations
Modern society's increasing dependency on online tools for both work and
recreation opens up unique opportunities for the study of social interactions.
A large survey of online exchanges or conversations on Twitter, collected
across six months involving 1.7 million individuals is presented here. We test
the theoretical cognitive limit on the number of stable social relationships
known as Dunbar's number. We find that users can entertain a maximum of 100-200
stable relationships in support for Dunbar's prediction. The "economy of
attention" is limited in the online world by cognitive and biological
constraints as predicted by Dunbar's theory. Inspired by this empirical
evidence we propose a simple dynamical mechanism, based on finite priority
queuing and time resources, that reproduces the observed social behavior.Comment: 8 pages, 6 figure
Fibers and global geometry of functions
Since the seminal work of Ambrosetti and Prodi, the study of global folds was
enriched by geometric concepts and extensions accomodating new examples. We
present the advantages of considering fibers, a construction dating to Berger
and Podolak's view of the original theorem. A description of folds in terms of
properties of fibers gives new perspective to the usual hypotheses in the
subject. The text is intended as a guide, outlining arguments and stating
results which will be detailed elsewhere
- …