3 research outputs found

    REGAL: Representation Learning-based Graph Alignment

    Full text link
    Problems involving multiple networks are prevalent in many scientific and other domains. In particular, network alignment, or the task of identifying corresponding nodes in different networks, has applications across the social and natural sciences. Motivated by recent advancements in node representation learning for single-graph tasks, we propose REGAL (REpresentation learning-based Graph ALignment), a framework that leverages the power of automatically-learned node representations to match nodes across different graphs. Within REGAL we devise xNetMF, an elegant and principled node embedding formulation that uniquely generalizes to multi-network problems. Our results demonstrate the utility and promise of unsupervised representation learning-based network alignment in terms of both speed and accuracy. REGAL runs up to 30x faster in the representation learning stage than comparable methods, outperforms existing network alignment methods by 20 to 30% accuracy on average, and scales to networks with millions of nodes each.Comment: In Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM), 201

    Graph Inference and Graph Matching

    Get PDF
    Graphs are widely used in many fields of research, ranging from natural sciences to computer and mathematical sciences. Graph inference is an area of intense research. In this dissertation, we propose several methodologies in graph inference. We focus on statistical inference using graph invariants, vertex nomination, and a divide-and-conquer graph matching technique. We present a comparative power analysis of various graph invariants for testing the hypothesis that the graph has a subgraph with higher edge probability. Given a graph drawn from a kidney-egg random graph model, the null hypothesis is that all edge probabilities are equal. The alternative hypothesis is that there exists a subset of vertices which are more likely to be adjacenct to each other than the rest of the graph. Using Monte Carlo simulations, we estimate the power of several graph invariants acting as test statistics. We discovered that for many choices of parameters in the random graph model, the scan statistic and clustering coefficient often dominate other graph invariants. However, our results indicates that none of the graph invariants considered is uniformly most powerful. Given a graph drawn from a stochastic block model where one block is of particular interest, vertex nomination is the task of creating a list of vertices such that there are an abundance of vertices from the block of interest at the top of the list. Vertex nomination is useful in situations where only a limited number of vertices can be examined and have their block membership checked. We propose several vertex nomination schemes, derive theoretical results for performance, and compare the schemes on simulated and real data. Given two graphs, graph matching is to create a mapping from one set of vertices to the other, such that the edge structure of the graphs is preserved as best as possible. We develop a new method for scaling graph matching algorithms, and prove performance guarantees. Any graph matching algorithm can be scaled using our divide-and-conquer technique. The performance of this technique is demonstrated on large simulated graphs and human brain graphs

    Joint Optimization of Fidelity and Commensurability for Manifold Alignment and Graph Matching

    Get PDF
    In this thesis, we investigate how to perform inference in settings in which the data consist of different modalities or views. For effective learning utilizing the information available, data fusion that considers all views of these multiview data settings is needed. We also require dimensionality reduction to address the problems associated with high dimensionality, or “the curse of dimensionality.” We are interested in the type of information that is available in the multiview data that is essential for the inference task. We also seek to determine the principles to be used throughout the dimensionality reduction and data fusion steps to provide acceptable task performance. Our research focuses on exploring how these queries and their solutions are relevant to particular data problems of interest
    corecore