11 research outputs found

    Social Network De-anonymization Under Scale-free User Relations

    Get PDF
    We tackle the problem of user de-anonymization in social networks characterized by scale-free relationships between users. The network is modeled as a graph capturing the impact of power-law node degree distribution, which is a fundamental and quite common feature of social networks. Using this model, we present a de-anonymization algorithm that exploits an initial set of users, called seeds, that are known a priori. By employing bootstrap percolation theory and a novel graph slicing technique, we develop a rigorous analysis of the proposed algorithm under asymptotic conditions. Our analysis shows that large inhomogeneities in the node degree lead to a dramatic reduction of the size of the seed set that is necessary to successfully identify all other users. We characterize this set size when seeds are properly selected based on the node degree as well as when seeds are uniformly distributed. We prove that, given n nodes, the number of seeds required for network de-anonymization can be as small as n^epsilon, for any small epsilon>0. Additionally, we discuss the complexity of our de-anonymization algorithm and validate our results through numerical experiments on a real social network graph

    Seeded Graph Matching: Efficient Algorithms and Theoretical Guarantees

    Full text link
    In this paper, a new information theoretic framework for graph matching is introduced. Using this framework, the graph isomorphism and seeded graph matching problems are studied. The maximum degree algorithm for graph isomorphism is analyzed and sufficient conditions for successful matching are rederived using type analysis. Furthermore, a new seeded matching algorithm with polynomial time complexity is introduced. The algorithm uses `typicality matching' and techniques from point-to-point communications for reliable matching. Assuming an Erdos-Renyi model on the correlated graph pair, it is shown that successful matching is guaranteed when the number of seeds grows logarithmically with the number of vertices in the graphs. The logarithmic coefficient is shown to be inversely proportional to the mutual information between the edge variables in the two graphs

    On the Randi\'{c} index and its variants of network data

    Full text link
    Summary statistics play an important role in network data analysis. They can provide us with meaningful insight into the structure of a network. The Randi\'{c} index is one of the most popular network statistics that has been widely used for quantifying information of biological networks, chemical networks, pharmacologic networks, etc. A topic of current interest is to find bounds or limits of the Randi\'{c} index and its variants. A number of bounds of the indices are available in literature. Recently, there are several attempts to study the limits of the indices in the Erd\H{o}s-R\'{e}nyi random graph by simulation. In this paper, we shall derive the limits of the Randi\'{c} index and its variants of an inhomogeneous Erd\H{o}s-R\'{e}nyi random graph. Our results charaterize how network heterogeneity affects the indices and provide new insights about the Randi\'{c} index and its variants. Finally we apply the indices to several real-world networks.Comment: to appea

    Asymptotic distributions of the average clustering coefficient and its variant

    Full text link
    In network data analysis, summary statistics of a network can provide us with meaningful insight into the structure of the network. The average clustering coefficient is one of the most popular and widely used network statistics. In this paper, we investigate the asymptotic distributions of the average clustering coefficient and its variant of a heterogeneous Erd\"{o}s-R\'{e}nyi random graph. We show that the standardized average clustering coefficient converges in distribution to the standard normal distribution. Interestingly, the variance of the average clustering coefficient exhibits a phase transition phenomenon. The sum of weighted triangles is a variant of the average clustering coefficient. It is recently introduced to detect geometry in a network. We also derive the asymptotic distribution of the sum weighted triangles, which does not exhibit a phase transition phenomenon as the average clustering coefficient. This result signifies the difference between the two summary statistics

    Social Network De-Anonymization Under Scale-Free User Relations

    No full text
    We tackle the problem of user de-anonymization in social networks characterized by scale-free relationships between users. The network is modeled as a graph capturing the impact of power-law node degree distribution, which is a fundamental and quite common feature of social networks. Using this model, we present a de-anonymization algorithm that exploits an initial set of users, called seeds, that are known a priori. By employing bootstrap percolation theory and a novel graph slicing technique, we develop a rigorous analysis of the proposed algorithm under asymptotic conditions. Our analysis shows that large inhomogeneities in the node degree lead to a dramatic reduction of the size of the seed set that is necessary to successfully identify all other users. We characterize this set size when seeds are properly selected based on the node degree as well as when seeds are uniformly distributed. We prove that, given n nodes, the number of seeds required for network de-anonymization can be as small as n^epsilon, for any small epsilon>0. Additionally, we discuss the complexity of our de-anonymization algorithm and validate our results through numerical experiments on a real social network graph
    corecore