196 research outputs found
Inheritance patterns in citation networks reveal scientific memes
Memes are the cultural equivalent of genes that spread across human culture
by means of imitation. What makes a meme and what distinguishes it from other
forms of information, however, is still poorly understood. Our analysis of
memes in the scientific literature reveals that they are governed by a
surprisingly simple relationship between frequency of occurrence and the degree
to which they propagate along the citation graph. We propose a simple
formalization of this pattern and we validate it with data from close to 50
million publication records from the Web of Science, PubMed Central, and the
American Physical Society. Evaluations relying on human annotators, citation
network randomizations, and comparisons with several alternative approaches
confirm that our formula is accurate and effective, without a dependence on
linguistic or ontological knowledge and without the application of arbitrary
thresholds or filters.Comment: 8 two-column pages, 5 figures; accepted for publication in Physical
Review
Validating Network Value of Influencers by means of Explanations
Recently, there has been significant interest in social influence analysis.
One of the central problems in this area is the problem of identifying
influencers, such that by convincing these users to perform a certain action
(like buying a new product), a large number of other users get influenced to
follow the action. The client of such an application is a marketer who would
target these influencers for marketing a given new product, say by providing
free samples or discounts. It is natural that before committing resources for
targeting an influencer the marketer would be interested in validating the
influence (or network value) of influencers returned. This requires digging
deeper into such analytical questions as: who are their followers, on what
actions (or products) they are influential, etc. However, the current
approaches to identifying influencers largely work as a black box in this
respect. The goal of this paper is to open up the black box, address these
questions and provide informative and crisp explanations for validating the
network value of influencers.
We formulate the problem of providing explanations (called PROXI) as a
discrete optimization problem of feature selection. We show that PROXI is not
only NP-hard to solve exactly, it is NP-hard to approximate within any
reasonable factor. Nevertheless, we show interesting properties of the
objective function and develop an intuitive greedy heuristic. We perform
detailed experimental analysis on two real world datasets - Twitter and
Flixster, and show that our approach is useful in generating concise and
insightful explanations of the influence distribution of users and that our
greedy algorithm is effective and efficient with respect to several baselines
Identifying spatial invasion of pandemics on metapopulation networks via anatomizing arrival history
Spatial spread of infectious diseases among populations via the mobility of
humans is highly stochastic and heterogeneous. Accurate forecast/mining of the
spread process is often hard to be achieved by using statistical or mechanical
models. Here we propose a new reverse problem, which aims to identify the
stochastically spatial spread process itself from observable information
regarding the arrival history of infectious cases in each subpopulation. We
solved the problem by developing an efficient optimization algorithm based on
dynamical programming, which comprises three procedures: i, anatomizing the
whole spread process among all subpopulations into disjoint componential
patches; ii, inferring the most probable invasion pathways underlying each
patch via maximum likelihood estimation; iii, recovering the whole process by
assembling the invasion pathways in each patch iteratively, without burdens in
parameter calibrations and computer simulations. Based on the entropy theory,
we introduced an identifiability measure to assess the difficulty level that an
invasion pathway can be identified. Results on both artificial and empirical
metapopulation networks show the robust performance in identifying actual
invasion pathways driving pandemic spread.Comment: 14pages, 8 figures; Accepted by IEEE Transactions on Cybernetic
- …