12,294 research outputs found
Structure and Dynamics of Information Pathways in Online Media
Diffusion of information, spread of rumors and infectious diseases are all
instances of stochastic processes that occur over the edges of an underlying
network. Many times networks over which contagions spread are unobserved, and
such networks are often dynamic and change over time. In this paper, we
investigate the problem of inferring dynamic networks based on information
diffusion data. We assume there is an unobserved dynamic network that changes
over time, while we observe the results of a dynamic process spreading over the
edges of the network. The task then is to infer the edges and the dynamics of
the underlying network.
We develop an on-line algorithm that relies on stochastic convex optimization
to efficiently solve the dynamic network inference problem. We apply our
algorithm to information diffusion among 3.3 million mainstream media and blog
sites and experiment with more than 179 million different pieces of information
spreading over the network in a one year period. We study the evolution of
information pathways in the online media space and find interesting insights.
Information pathways for general recurrent topics are more stable across time
than for on-going news events. Clusters of news media sites and blogs often
emerge and vanish in matter of days for on-going news events. Major social
movements and events involving civil population, such as the Libyan's civil war
or Syria's uprise, lead to an increased amount of information pathways among
blogs as well as in the overall increase in the network centrality of blogs and
social media sites.Comment: To Appear at the 6th International Conference on Web Search and Data
Mining (WSDM '13
Postmortem Analysis of Decayed Online Social Communities: Cascade Pattern Analysis and Prediction
Recently, many online social networks, such as MySpace, Orkut, and
Friendster, have faced inactivity decay of their members, which contributed to
the collapse of these networks. The reasons, mechanics, and prevention
mechanisms of such inactivity decay are not fully understood. In this work, we
analyze decayed and alive sub-websites from the StackExchange platform. The
analysis mainly focuses on the inactivity cascades that occur among the members
of these communities. We provide measures to understand the decay process and
statistical analysis to extract the patterns that accompany the inactivity
decay. Additionally, we predict cascade size and cascade virality using machine
learning. The results of this work include a statistically significant
difference of the decay patterns between the decayed and the alive
sub-websites. These patterns are mainly: cascade size, cascade virality,
cascade duration, and cascade similarity. Additionally, the contributed
prediction framework showed satisfactory prediction results compared to a
baseline predictor. Supported by empirical evidence, the main findings of this
work are: (1) the decay process is not governed by only one network measure; it
is better described using multiple measures; (2) the expert members of the
StackExchange sub-websites were mainly responsible for the activity or
inactivity of the StackExchange sub-websites; (3) the Statistics sub-website is
going through decay dynamics that may lead to it becoming fully-decayed; and
(4) decayed sub-websites were originally less resilient to inactivity decay,
unlike the alive sub-websites
From coincidence to purposeful flow? properties of transcendental information cascades
In this paper, we investigate a method for constructing cascades of information co-occurrence, which is suitable to trace emergent structures in information in scenarios where rich contextual features are unavailable. Our method relies only on the temporal order of content-sharing activities, and intrinsic properties of the shared content itself. We apply this method to analyse information dissemination patterns across the active online citizen science project Planet Hunters, a part of the Zooniverse platform. Our results lend insight into both structural and informational properties of different types of identifiers that can be used and combined to construct cascades. In particular, significant differences are found in the structural properties of information cascades when hashtags as used as cascade identifiers, compared with other content features. We also explain apparent local information losses in cascades in terms of information obsolescence and cascade divergence; e.g., when a cascade branches into multiple, divergent cascades with combined capacity equal to the original
- …