32 research outputs found
Probabilistic Approach to Structural Change Prediction in Evolving Social Networks
We propose a predictive model of structural
changes in elementary subgraphs of social network based on
Mixture of Markov Chains. The model is trained and verified
on a dataset from a large corporate social network analyzed
in short, one day-long time windows, and reveals distinctive
patterns of evolution of connections on the level of local
network topology. We argue that the network investigated in
such short timescales is highly dynamic and therefore immune
to classic methods of link prediction and structural analysis,
and show that in the case of complex networks, the dynamic
subgraph mining may lead to better prediction accuracy. The
experiments were carried out on the logs from the Wroclaw
University of Technology mail server
Link Prediction Based on Subgraph Evolution in Dynamic Social Networks
We propose a new method for characterizing the dynamics of complex networks with its application to the link prediction problem. Our approach is based on the discovery of network subgraphs (in this study: triads of nodes) and measuring their transitions during network evolution. We define the Triad Transition Matrix (TTM) containing the probabilities of transitions between triads found in the network, then we show how it can help to discover and quantify the dynamic patterns of network evolution. We also propose the application of TTM to link prediction with an algorithm (called TTM-predictor) which shows good performance, especially for sparse networks analyzed in short time scales. The future applications and research directions of our approach are also proposed and discussed
Understanding and Predicting Delay in Reciprocal Relations
Reciprocity in directed networks points to user's willingness to return
favors in building mutual interactions. High reciprocity has been widely
observed in many directed social media networks such as following relations in
Twitter and Tumblr. Therefore, reciprocal relations between users are often
regarded as a basic mechanism to create stable social ties and play a crucial
role in the formation and evolution of networks. Each reciprocity relation is
formed by two parasocial links in a back-and-forth manner with a time delay.
Hence, understanding the delay can help us gain better insights into the
underlying mechanisms of network dynamics. Meanwhile, the accurate prediction
of delay has practical implications in advancing a variety of real-world
applications such as friend recommendation and marketing campaign. For example,
by knowing when will users follow back, service providers can focus on the
users with a potential long reciprocal delay for effective targeted marketing.
This paper presents the initial investigation of the time delay in reciprocal
relations. Our study is based on a large-scale directed network from Tumblr
that consists of 62.8 million users and 3.1 billion user following relations
with a timespan of multiple years (from 31 Oct 2007 to 24 Jul 2013). We reveal
a number of interesting patterns about the delay that motivate the development
of a principled learning model to predict the delay in reciprocal relations.
Experimental results on the above mentioned dynamic networks corroborate the
effectiveness of the proposed delay prediction model.Comment: 10 page
Link Prediction Based on Common-Neighbors for Dynamic Social Network
AbstractLink prediction is an important issue in social networks. Most of the existing methods aim to predict interactions between individuals for static networks, ignoring the dynamic feature of social networks. This paper proposes a link prediction method which considers the dynamic topology of social networks. Given a snapshot of a social network at time t (or network evolution between t1 and t2), we seek to accurately predict the edges that will be added during the interval from time t (or t2) to a given future time t′. Our approach utilizes three metrics, the time-varied weight, the change degree of common neighbor and the intimacy between common neighbors. Moreover, we redefine the common neighbors by finding them within two hops. Experiments on DBLP show that our method can reach better results
Temporal similarity metrics for latent network reconstruction: The role of time-lag decay
When investigating the spreading of a piece of information or the diffusion
of an innovation, we often lack information on the underlying propagation
network. Reconstructing the hidden propagation paths based on the observed
diffusion process is a challenging problem which has recently attracted
attention from diverse research fields. To address this reconstruction problem,
based on static similarity metrics commonly used in the link prediction
literature, we introduce new node-node temporal similarity metrics. The new
metrics take as input the time-series of multiple independent spreading
processes, based on the hypothesis that two nodes are more likely to be
connected if they were often infected at similar points in time. This
hypothesis is implemented by introducing a time-lag function which penalizes
distant infection times. We find that the choice of this time-lag strongly
affects the metrics' reconstruction accuracy, depending on the network's
clustering coefficient and we provide an extensive comparative analysis of
static and temporal similarity metrics for network reconstruction. Our findings
shed new light on the notion of similarity between pairs of nodes in complex
networks
Interaction Prediction Problems in Link Streams
International audienceThe problems of link prediction and recovery have been the focus of much work during the last 10 years. This is due to the fact that these questions have a large number of practical implications ranging from detecting spam emails, to predicting which item is selected by which user in a recommendation system. However, considering the highly dynamical aspect of complex networks, there is a rising interest not only for knowing who will interact with whom, but also when. For example, when trying to control the spreading of a virus in a population, it is important to know whether an individual is bound to have a lot of new contacts before or after being infected. In that sense, this question is located at the crossroad of link prediction and another family of problems which has been widely dealt with in the literature, that is, time-series prediction. We name it the interaction prediction problem in link streams. It calls for the definition of specific features, strategies, and evaluation methods to capture both the structural and temporal aspects of the interactions. In this chapter, we propose a general formulation of the problem, consistent with the link stream formalism, which formally represents the streaming sequence of interactions between the elements of the system. Using this framework, we discuss the formulation of the interaction prediction problem and propose possible strategies to address it
Link prediction in evolving networks based on popularity of nodes
Link prediction aims to uncover the underlying relationship behind networks, which could be utilized to predict missing edges or identify the spurious edges. The key issue of link prediction is to estimate the likelihood of potential links in networks. Most classical static-structure based methods ignore the temporal aspects of networks, limited by the time-varying features, such approaches perform poorly in evolving networks. In this paper, we propose a hypothesis that the ability of each node to attract links depends not only on its structural importance, but also on its current popularity (activeness), since active nodes have much more probability to attract future links. Then a novel approach named popularity based structural perturbation method (PBSPM) and its fast algorithm are proposed to characterize the likelihood of an edge from both existing connectivity structure and current popularity of its two endpoints. Experiments on six evolving networks show that the proposed methods outperform state-of-the-art methods in accuracy and robustness. Besides, visual results and statistical analysis reveal that the proposed methods are inclined to predict future edges between active nodes, rather than edges between inactive nodes
Combining structural and dynamic information to predict activity in link streams
International audienceA link stream is a sequence of triplets (t, u, v) meaning that nodes u and v have interacted at time t. Capturing both the structural and temporal aspects of interactions is crucial for many real world datasets like contact between individuals. We tackle the issue of activity prediction in link streams, that is to say predicting the number of links occurring during a given period of time and we present a protocol that takes advantage of the temporal and structural information contained in the link stream. We introduce a way to represent the information captured using different features and combine them in a prediction function which is used to evaluate the future activity of links
HPRA: Hyperedge Prediction using Resource Allocation
Many real-world systems involve higher-order interactions and thus demand
complex models such as hypergraphs. For instance, a research article could have
multiple collaborating authors, and therefore the co-authorship network is best
represented as a hypergraph. In this work, we focus on the problem of hyperedge
prediction. This problem has immense applications in multiple domains, such as
predicting new collaborations in social networks, discovering new chemical
reactions in metabolic networks, etc. Despite having significant importance,
the problem of hyperedge prediction hasn't received adequate attention, mainly
because of its inherent complexity. In a graph with nodes the number of
potential edges is , whereas in a hypergraph, the number of
potential hyperedges is . To avoid searching through such a
huge space, current methods restrain the original problem in the following two
ways. One class of algorithms assume the hypergraphs to be -uniform.
However, many real-world systems are not confined only to have interactions
involving components. Thus, these algorithms are not suitable for many
real-world applications. The second class of algorithms requires a candidate
set of hyperedges from which the potential hyperedges are chosen. In the
absence of domain knowledge, the candidate set can have
possible hyperedges, which makes this problem intractable. We propose HPRA -
Hyperedge Prediction using Resource Allocation, the first of its kind
algorithm, which overcomes these issues and predicts hyperedges of any
cardinality without using any candidate hyperedge set. HPRA is a
similarity-based method working on the principles of the resource allocation
process. In addition to recovering missing hyperedges, we demonstrate that HPRA
can predict future hyperedges in a wide range of hypergraphs.Comment: Accepted at WebSci'2