369 research outputs found
Measuring Generalized Preferential Attachment in Dynamic Social Networks
The mechanism of preferential attachment underpins most recent social network
formation models. Yet few authors attempt to check or quantify assumptions on
this mechanism. We call generalized preferential attachment any kind of
preference to interact with other agents with respect to any node property. We
then introduce tools for measuring empirically and characterizing
comprehensively such phenomena, and apply these tools to a socio-semantic
network of scientific collaborations, investigating in particular homophilic
behavior. This opens the way to a whole class of realistic and credible social
network morphogenesis models.Comment: 9 pages, 6 figures (v2: added property correlation measures, and
various remarks
Binding Social and Cultural Networks: A Model
Until now, most studies carried onto social or semantic networks have
considered each of these networks independently. Our goal here is to bring a
formal frame for studying both networks empirically as well as to point out
stylized facts that would explain their reciprocal influence and the emergence
of clusters of agents, which may also be regarded as ''cultural cliques''. We
show how to apply the Galois lattice theory to the modeling of the coevolution
of social and conceptual networks, and the characterization of cultural
communities. Basing our approach on Barabasi-Albert's models, we however extend
the usual preferential attachment probability in order to take into account the
reciprocal influence of both networks, therefore introducing the notion of dual
distance. In addition to providing a theoretic frame we draw here a program of
empirical tests which should give root to a more analytical model and the
consequent simulation and validation. In a broader view, adopting and actually
implementing the paradigm of cultural epidemiology, we could proceed further
with the study of knowledge diffusion and explain how the social network
structure affects concept propagation and in return how concept propagation
affects the social network.Comment: 8 pages, 3 figures (v2: typos, minor corrections in section 3.2) (v3:
examples, figures added
Symbolic regression of generative network models
Networks are a powerful abstraction with applicability to a variety of
scientific fields. Models explaining their morphology and growth processes
permit a wide range of phenomena to be more systematically analysed and
understood. At the same time, creating such models is often challenging and
requires insights that may be counter-intuitive. Yet there currently exists no
general method to arrive at better models. We have developed an approach to
automatically detect realistic decentralised network growth models from
empirical data, employing a machine learning technique inspired by natural
selection and defining a unified formalism to describe such models as computer
programs. As the proposed method is completely general and does not assume any
pre-existing models, it can be applied "out of the box" to any given network.
To validate our approach empirically, we systematically rediscover pre-defined
growth laws underlying several canonical network generation models and credible
laws for diverse real-world networks. We were able to find programs that are
simple enough to lead to an actual understanding of the mechanisms proposed,
namely for a simple brain and a social network
Scaling in transportation networks
Subway systems span most large cities, and railway networks most countries in
the world. These networks are fundamental in the development of countries and
their cities, and it is therefore crucial to understand their formation and
evolution. However, if the topological properties of these networks are fairly
well understood, how they relate to population and socio-economical properties
remains an open question. We propose here a general coarse-grained approach,
based on a cost-benefit analysis that accounts for the scaling properties of
the main quantities characterizing these systems (the number of stations, the
total length, and the ridership) with the substrate's population, area and
wealth. More precisely, we show that the length, number of stations and
ridership of subways and rail networks can be estimated knowing the area,
population and wealth of the underlying region. These predictions are in good
agreement with data gathered for about subway systems and more than
railway networks in the world. We also show that train networks and subway
systems can be described within the same framework, but with a fundamental
difference: while the interstation distance seems to be constant and determined
by the typical walking distance for subways, the interstation distance for
railways scales with the number of stations.Comment: 8 pages, 6 figures, 1 table. To appear in PLoS On
Intrinsically Dynamic Network Communities
Community finding algorithms for networks have recently been extended to
dynamic data. Most of these recent methods aim at exhibiting community
partitions from successive graph snapshots and thereafter connecting or
smoothing these partitions using clever time-dependent features and sampling
techniques. These approaches are nonetheless achieving longitudinal rather than
dynamic community detection. We assume that communities are fundamentally
defined by the repetition of interactions among a set of nodes over time.
According to this definition, analyzing the data by considering successive
snapshots induces a significant loss of information: we suggest that it blurs
essentially dynamic phenomena - such as communities based on repeated
inter-temporal interactions, nodes switching from a community to another across
time, or the possibility that a community survives while its members are being
integrally replaced over a longer time period. We propose a formalism which
aims at tackling this issue in the context of time-directed datasets (such as
citation networks), and present several illustrations on both empirical and
synthetic dynamic networks. We eventually introduce intrinsically dynamic
metrics to qualify temporal community structure and emphasize their possible
role as an estimator of the quality of the community detection - taking into
account the fact that various empirical contexts may call for distinct
`community' definitions and detection criteria.Comment: 27 pages, 11 figure
How Realistic Should Knowledge Diffusion Models Be?
Knowledge diffusion models typically involve two main features: an underlying social network topology on one side, and a particular design of interaction rules driving knowledge transmission on the other side. Acknowledging the need for realistic topologies and adoption behaviors backed by empirical measurements, it becomes unclear how accurately existing models render real-world phenomena: if indeed both topology and transmission mechanisms have a key impact on these phenomena, to which extent does the use of more or less stylized assumptions affect modeling results? In order to evaluate various classical topologies and mechanisms, we push the comparison to more empirical benchmarks: real-world network structures and empirically measured mechanisms. Special attention is paid to appraising the discrepancy between diffusion phenomena (i) on some real network topologies vs. various kinds of scale-free networks, and (ii) using an empirically-measured transmission mechanism, compared with canonical appropriate models such as threshold models. We find very sensible differences between the more realistic settings and their traditional stylized counterparts. On the whole, our point is thus also epistemological by insisting that models should be tested against simulation-based empirical benchmarks.Agent-Based Simulation, Complex Systems, Empirical Calibration and Validation, Knowledge Diffusion, Model Comparison, Social Networks
Precursors and Laggards: An Analysis of Semantic Temporal Relationships on a Blog Network
We explore the hypothesis that it is possible to obtain information about the
dynamics of a blog network by analysing the temporal relationships between
blogs at a semantic level, and that this type of analysis adds to the knowledge
that can be extracted by studying the network only at the structural level of
URL links. We present an algorithm to automatically detect fine-grained
discussion topics, characterized by n-grams and time intervals. We then propose
a probabilistic model to estimate the temporal relationships that blogs have
with one another. We define the precursor score of blog A in relation to blog B
as the probability that A enters a new topic before B, discounting the effect
created by asymmetric posting rates. Network-level metrics of precursor and
laggard behavior are derived from these dyadic precursor score estimations.
This model is used to analyze a network of French political blogs. The scores
are compared to traditional link degree metrics. We obtain insights into the
dynamics of topic participation on this network, as well as the relationship
between precursor/laggard and linking behaviors. We validate and analyze
results with the help of an expert on the French blogosphere. Finally, we
propose possible applications to the improvement of search engine ranking
algorithms
Generating constrained random graphs using multiple edge switches
The generation of random graphs using edge swaps provides a reliable method
to draw uniformly random samples of sets of graphs respecting some simple
constraints, e.g. degree distributions. However, in general, it is not
necessarily possible to access all graphs obeying some given con- straints
through a classical switching procedure calling on pairs of edges. We therefore
propose to get round this issue by generalizing this classical approach through
the use of higher-order edge switches. This method, which we denote by "k-edge
switching", makes it possible to progres- sively improve the covered portion of
a set of constrained graphs, thereby providing an increasing, asymptotically
certain confidence on the statistical representativeness of the obtained
sample.Comment: 15 page
Algorithmic Distortion of Informational Landscapes
The possible impact of algorithmic recommendation on the autonomy and free
choice of Internet users is being increasingly discussed, especially in terms
of the rendering of information and the structuring of interactions. This paper
aims at reviewing and framing this issue along a double dichotomy. The first
one addresses the discrepancy between users' intentions and actions (1) under
some algorithmic influence and (2) without it. The second one distinguishes
algorithmic biases on (1) prior information rearrangement and (2) posterior
information arrangement. In all cases, we focus on and differentiate situations
where algorithms empirically appear to expand the cognitive and social horizon
of users, from those where they seem to limit that horizon. We additionally
suggest that these biases may not be properly appraised without taking into
account the underlying social processes which algorithms are building upon
Préliminaires à l'étude micro-économique de la taxation des carburants
La notion d'élasticité de la consommation au prix des carburants permet d'appréhender quantitativement l'effet de la taxation sur la consommation des agents. Jusqu'ici, ce paramètre a principalement été estimé dans une perspective macro-économique, sans différenciation suivant les diverses catégories d'agents. Dans cet article, en s'appuyant aussi bien sur des résultats concrets que sur des arguments théoriques, nous exhibons une typologie des ménages pour laquelle l'élasticité varie sensiblement en fonction de critères comme par exemple le revenu, ou bien l'habitat. Ceci nous amène à proposer une réflexion méthodologique sur la notion d'élasticité et, plus largement, à souligner certains effets adverses de la taxation des carburants.élasticité de la demande au prix; TIPP; taxation; comportement des agents; typologie des ménages; réduction des émissions de carbone; micro-économie
- …