3,099 research outputs found
#Santiago is not #Chile, or is it? A Model to Normalize Social Media Impact
Online social networks are known to be demographically biased. Currently
there are questions about what degree of representativity of the physical
population they have, and how population biases impact user-generated content.
In this paper we focus on centralism, a problem affecting Chile. Assuming that
local differences exist in a country, in terms of vocabulary, we built a
methodology based on the vector space model to find distinctive content from
different locations, and use it to create classifiers to predict whether the
content of a micro-post is related to a particular location, having in mind a
geographically diverse selection of micro-posts. We evaluate them in a case
study where we analyze the virtual population of Chile that participated in the
Twitter social network during an event of national relevance: the municipal
(local governments) elections held in 2012. We observe that the participating
virtual population is spatially representative of the physical population,
implying that there is centralism in Twitter. Our classifiers out-perform a non
geographically-diverse baseline at the regional level, and have the same
accuracy at a provincial level. However, our approach makes assumptions that
need to be tested in multi-thematic and more general datasets. We leave this
for future work.Comment: Accepted in ChileCHI 2013, I Chilean Conference on Human-Computer
Interactio
Dealing with Integer-valued Variables in Bayesian Optimization with Gaussian Processes
Bayesian optimization (BO) methods are useful for optimizing functions that
are expensive to evaluate, lack an analytical expression and whose evaluations
can be contaminated by noise. These methods rely on a probabilistic model of
the objective function, typically a Gaussian process (GP), upon which an
acquisition function is built. This function guides the optimization process
and measures the expected utility of performing an evaluation of the objective
at a new point. GPs assume continous input variables. When this is not the
case, such as when some of the input variables take integer values, one has to
introduce extra approximations. A common approach is to round the suggested
variable value to the closest integer before doing the evaluation of the
objective. We show that this can lead to problems in the optimization process
and describe a more principled approach to account for input variables that are
integer-valued. We illustrate in both synthetic and a real experiments the
utility of our approach, which significantly improves the results of standard
BO methods on problems involving integer-valued variables.Comment: 7 page
First Women, Second Sex: Gender Bias in Wikipedia
Contributing to history has never been as easy as it is today. Anyone with
access to the Web is able to play a part on Wikipedia, an open and free
encyclopedia. Wikipedia, available in many languages, is one of the most
visited websites in the world and arguably one of the primary sources of
knowledge on the Web. However, not everyone is contributing to Wikipedia from a
diversity point of view; several groups are severely underrepresented. One of
those groups is women, who make up approximately 16% of the current contributor
community, meaning that most of the content is written by men. In addition,
although there are specific guidelines of verifiability, notability, and
neutral point of view that must be adhered by Wikipedia content, these
guidelines are supervised and enforced by men.
In this paper, we propose that gender bias is not about participation and
representation only, but also about characterization of women. We approach the
analysis of gender bias by defining a methodology for comparing the
characterizations of men and women in biographies in three aspects: meta-data,
language, and network structure. Our results show that, indeed, there are
differences in characterization and structure. Some of these differences are
reflected from the off-line world documented by Wikipedia, but other
differences can be attributed to gender bias in Wikipedia content. We
contextualize these differences in feminist theory and discuss their
implications for Wikipedia policy.Comment: 10 pages, ACM style. Author's version of a paper to be presented at
ACM Hypertext 201
Three-Body Continuum Wave Functions with a Box Boundary Condition
8 págs.; 4 figs.In this work we investigate the connection between discretized three-body continuum wave functions,
in particular via a box boundary condition, and thewave functions computed with the correct asymptotics.
The three-body wave functions are in both cases obtained by means of the adiabatic expansion method. The
information concerning all the possible incoming and outgoing channels, which appears naturally when the
continuum is not discretized, seems to be lost when the discretization is implemented. In this work we show
that both methods are fully equivalent, and the full information contained in the three-body wave function is
actually preserved in the discrete spectrum. Therefore, in those cases when the asymptotic behaviour is not
known analytically, i.e., when the Coulomb interaction is involved, the discretization technique can be safely
used. © Springer-Verlag Wien 2015This work was supported by funds provided by DGI of MINECO (Spain) under Contract No. FIS2011-23565Peer Reviewe
- …