372 research outputs found
New substitution bases for complexity classes
The set 0(), the 0 closure of F , is the closure with respect to substitution and concatenation recursion on notation of a set of basic functions comprehending the set F . By improving earlier work, we show that 0() is the substitution closure of a simple function set and characterize well\u2010known function complexity classes as the substitution closure of finite sets of simple functions
Searching for superspreaders of information in real-world social media
A number of predictors have been suggested to detect the most influential
spreaders of information in online social media across various domains such as
Twitter or Facebook. In particular, degree, PageRank, k-core and other
centralities have been adopted to rank the spreading capability of users in
information dissemination media. So far, validation of the proposed predictors
has been done by simulating the spreading dynamics rather than following real
information flow in social networks. Consequently, only model-dependent
contradictory results have been achieved so far for the best predictor. Here,
we address this issue directly. We search for influential spreaders by
following the real spreading dynamics in a wide range of networks. We find that
the widely-used degree and PageRank fail in ranking users' influence. We find
that the best spreaders are consistently located in the k-core across
dissimilar social platforms such as Twitter, Facebook, Livejournal and
scientific publishing in the American Physical Society. Furthermore, when the
complete global network structure is unavailable, we find that the sum of the
nearest neighbors' degree is a reliable local proxy for user's influence. Our
analysis provides practical instructions for optimal design of strategies for
"viral" information dissemination in relevant applications.Comment: 12 pages, 7 figure
Ultimate periodicity of b-recognisable sets : a quasilinear procedure
It is decidable if a set of numbers, whose representation in a base b is a
regular language, is ultimately periodic. This was established by Honkala in
1986.
We give here a structural description of minimal automata that accept an
ultimately periodic set of numbers. We then show that it can verified in linear
time if a given minimal automaton meets this description.
This thus yields a O(n log(n)) procedure for deciding whether a general
deterministic automaton accepts an ultimately periodic set of numbers.Comment: presented at DLT 201
Algorithmic statistics revisited
The mission of statistics is to provide adequate statistical hypotheses
(models) for observed data. But what is an "adequate" model? To answer this
question, one needs to use the notions of algorithmic information theory. It
turns out that for every data string one can naturally define
"stochasticity profile", a curve that represents a trade-off between complexity
of a model and its adequacy. This curve has four different equivalent
definitions in terms of (1)~randomness deficiency, (2)~minimal description
length, (3)~position in the lists of simple strings and (4)~Kolmogorov
complexity with decompression time bounded by busy beaver function. We present
a survey of the corresponding definitions and results relating them to each
other
Discussion quality diffuses in the digital public square
Studies of online social influence have demonstrated that friends have
important effects on many types of behavior in a wide variety of settings.
However, we know much less about how influence works among relative strangers
in digital public squares, despite important conversations happening in such
spaces. We present the results of a study on large public Facebook pages where
we randomly used two different methods--most recent and social feedback--to
order comments on posts. We find that the social feedback condition results in
higher quality viewed comments and response comments. After measuring the
average quality of comments written by users before the study, we find that
social feedback has a positive effect on response quality for both low and high
quality commenters. We draw on a theoretical framework of social norms to
explain this empirical result. In order to examine the influence mechanism
further, we measure the similarity between comments viewed and written during
the study, finding that similarity increases for the highest quality
contributors under the social feedback condition. This suggests that, in
addition to norms, some individuals may respond with increased relevance to
high-quality comments.Comment: 10 pages, 6 figures, 2 table
Game interpretation of Kolmogorov complexity
The Kolmogorov complexity function K can be relativized using any oracle A,
and most properties of K remain true for relativized versions. In section 1 we
provide an explanation for this observation by giving a game-theoretic
interpretation and showing that all "natural" properties are either true for
all sufficiently powerful oracles or false for all sufficiently powerful
oracles. This result is a simple consequence of Martin's determinacy theorem,
but its proof is instructive: it shows how one can prove statements about
Kolmogorov complexity by constructing a special game and a winning strategy in
this game. This technique is illustrated by several examples (total conditional
complexity, bijection complexity, randomness extraction, contrasting plain and
prefix complexities).Comment: 11 pages. Presented in 2009 at the conference on randomness in
Madison
Exploring the Complex Pattern of Information Spreading in Online Blog Communities
Information spreading in online social communities has attracted tremendous attention due to its utmost practical values in applications. Despite that several individual-level diffusion data have been investigated, we still lack the detailed understanding of the spreading pattern of information. Here, by comparing information flows and social links in a blog community, we find that the diffusion processes are induced by three different spreading mechanisms: social spreading, self-promotion and broadcast. Although numerous previous studies have employed epidemic spreading models to simulate information diffusion, we observe that such models fail to reproduce the realistic diffusion pattern. In respect to users behaviors, strikingly, we find that most users would stick to one specific diffusion mechanism. Moreover, our observations indicate that the social spreading is not only crucial for the structure of diffusion trees, but also capable of inducing more subsequent individuals to acquire the information. Our findings suggest new directions for modeling of information diffusion in social systems, and could inform design of efficient propagation strategies based on users behaviors
An output-sensitive algorithm for the minimization of 2-dimensional String Covers
String covers are a powerful tool for analyzing the quasi-periodicity of
1-dimensional data and find applications in automata theory, computational
biology, coding and the analysis of transactional data. A \emph{cover} of a
string is a string for which every letter of lies within some
occurrence of . String covers have been generalized in many ways, leading to
\emph{k-covers}, \emph{-covers}, \emph{approximate covers} and were
studied in different contexts such as \emph{indeterminate strings}.
In this paper we generalize string covers to the context of 2-dimensional
data, such as images. We show how they can be used for the extraction of
textures from images and identification of primitive cells in lattice data.
This has interesting applications in image compression, procedural terrain
generation and crystallography
Two-dimensional ranking of Wikipedia articles
The Library of Babel, described by Jorge Luis Borges, stores an enormous
amount of information. The Library exists {\it ab aeterno}. Wikipedia, a free
online encyclopaedia, becomes a modern analogue of such a Library. Information
retrieval and ranking of Wikipedia articles become the challenge of modern
society. While PageRank highlights very well known nodes with many ingoing
links, CheiRank highlights very communicative nodes with many outgoing links.
In this way the ranking becomes two-dimensional. Using CheiRank and PageRank we
analyze the properties of two-dimensional ranking of all Wikipedia English
articles and show that it gives their reliable classification with rich and
nontrivial features. Detailed studies are done for countries, universities,
personalities, physicists, chess players, Dow-Jones companies and other
categories.Comment: RevTex 9 pages, data, discussion added, more data at
http://www.quantware.ups-tlse.fr/QWLIB/2drankwikipedia
Worldwide spreading of economic crisis
We model the spreading of a crisis by constructing a global economic network
and applying the Susceptible-Infected-Recovered (SIR) epidemic model with a
variable probability of infection. The probability of infection depends on the
strength of economic relations between the pair of countries, and the strength
of the target country. It is expected that a crisis which originates in a large
country, such as the USA, has the potential to spread globally, like the recent
crisis. Surprisingly we show that also countries with much lower GDP, such as
Belgium, are able to initiate a global crisis. Using the {\it k}-shell
decomposition method to quantify the spreading power (of a node), we obtain a
measure of ``centrality'' as a spreader of each country in the economic
network. We thus rank the different countries according to the shell they
belong to, and find the 12 most central countries. These countries are the most
likely to spread a crisis globally. Of these 12 only six are large economies,
while the other six are medium/small ones, a result that could not have been
otherwise anticipated. Furthermore, we use our model to predict the crisis
spreading potential of countries belonging to different shells according to the
crisis magnitude.Comment: 13 pages, 4 figures and Supplementary Materia
- …