15,748 research outputs found
Cross correlations of the American baby names
The quantitative description of cultural evolution is a challenging task. The
most difficult part of the problem is probably to find the appropriate
measurable quantities that can make more quantitative such evasive concepts as,
for example, dynamics of cultural movements, behavior patterns and traditions
of the people. A strategy to tackle this issue is to observe particular
features of human activities, i.e. cultural traits, such as names given to
newborns. We study the names of babies born in the United States of America
from 1910 to 2012. Our analysis shows that groups of different correlated
states naturally emerge in different epochs, and we are able to follow and
decrypt their evolution. While these groups of states are stable across many
decades, a sudden reorganization occurs in the last part of the twentieth
century. We think that this kind of quantitative analysis can be possibly
extended to other cultural traits: although databases covering more than one
century (as the one we used) are rare, the cultural evolution on shorter time
scales can be studied thanks to the fact that many human activities are usually
recorded in the present digital era.Comment: submitted for consideration to PNA
Fertility and its Meaning: Evidence from Search Behavior
Fertility choices are linked to the different preferences and constraints of
individuals and couples, and vary importantly by socio-economic status, as well
by cultural and institutional context. The meaning of childbearing and
child-rearing, therefore, differs between individuals and across groups. In
this paper, we combine data from Google Correlate and Google Trends for the
U.S. with ground truth data from the American Community Survey to derive new
insights into fertility and its meaning. First, we show that Google Correlate
can be used to illustrate socio-economic differences on the circumstances
around pregnancy and birth: e.g., searches for "flying while pregnant" are
linked to high income fertility, and "paternity test" are linked to non-marital
fertility. Second, we combine several search queries to build predictive models
of regional variation in fertility, explaining about 75% of the variance.
Third, we explore if aggregated web search data can also be used to model
fertility trends.Comment: This is a preprint of a short paper accepted at ICWSM'17. Please cite
that version instea
Inferring processes of cultural transmission: the critical role of rare variants in distinguishing neutrality from novelty biases
Neutral evolution assumes that there are no selective forces distinguishing
different variants in a population. Despite this striking assumption, many
recent studies have sought to assess whether neutrality can provide a good
description of different episodes of cultural change. One approach has been to
test whether neutral predictions are consistent with observed progeny
distributions, recording the number of variants that have produced a given
number of new instances within a specified time interval: a classic example is
the distribution of baby names. Using an overlapping generations model we show
that these distributions consist of two phases: a power law phase with a
constant exponent of -3/2, followed by an exponential cut-off for variants with
very large numbers of progeny. Maximum likelihood estimations of the model
parameters provide a direct way to establish whether observed empirical
patterns are consistent with neutral evolution. We apply our approach to a
complete data set of baby names from Australia. Crucially we show that analyses
based on only the most popular variants, as is often the case in studies of
cultural evolution, can provide misleading evidence for underlying transmission
hypotheses. While neutrality provides a plausible description of progeny
distributions of abundant variants, rare variants deviate from neutrality.
Further, we develop a simulation framework that allows for the detection of
alternative cultural transmission processes. We show that anti-novelty bias is
able to replicate the complete progeny distribution of the Australian data set
Gender Differences in Equity Crowdfunding
Online peer-to-peer investment platforms are increasingly popular venues for entrepreneurs and investors to engage in financial transactions without the involvement of banks and loan managers. Despite their purported transparency and lack of bias, it is unclear whether social inequalities present in traditional capital markets transfer to these platforms as well, impeding their hoped revolutionary potential. In this paper we analyze nearly four years' worth of data from one of the leading UK-based equity crowdfunding platforms. Specifically, we investigate gender-related differences in patterns of entrepreneurship, investment, and success. In agreement with offline trends, men have more activity on the platform. Yet, women entrepreneurs benefit of higher success rates in fund-raising, a finding that mimics trends seen on some rewards-based crowdfunding platforms. Surprisingly, we also find that female investors tend to choose campaigns that have lower success rates. Our findings contribute to a better understanding of gender-related discrepancies in success on the online capital market and point to differences in activity that are key factors in the apparent patterns of gender inequality
Neutral evolution and turnover over centuries of English word popularity
Here we test Neutral models against the evolution of English word frequency
and vocabulary at the population scale, as recorded in annual word frequencies
from three centuries of English language books. Against these data, we test
both static and dynamic predictions of two neutral models, including the
relation between corpus size and vocabulary size, frequency distributions, and
turnover within those frequency distributions. Although a commonly used Neutral
model fails to replicate all these emergent properties at once, we find that
modified two-stage Neutral model does replicate the static and dynamic
properties of the corpus data. This two-stage model is meant to represent a
relatively small corpus (population) of English books, analogous to a `canon',
sampled by an exponentially increasing corpus of books in the wider population
of authors. More broadly, this mode -- a smaller neutral model within a larger
neutral model -- could represent more broadly those situations where mass
attention is focused on a small subset of the cultural variants.Comment: 12 pages, 5 figures, 1 tabl
- …