4 research outputs found

    ์Œ๋ณ„ ์ƒ‰ ๊ฐœ์„ ๊ณผ ํšจ์œจ์ ์ธ ๋ฐฑํŠธ๋ž˜ํ‚น์„ ์ด์šฉํ•œ ๋น ๋ฅธ ๊ทธ๋ž˜ํ”„ ๋™ํ˜• ์•Œ๊ณ ๋ฆฌ์ฆ˜

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€, 2021.8. ๊ตฌ๊ฑด๋ชจ.Graph isomorphism is a core problem in graph analysis of various domains including social networks, bioinformatics, chemistry, and so on. As real-world graphs are getting bigger and bigger, applications demand practically fast algorithms that can run on large-scale graphs. Existing approaches, however, show limited performances on large-scale real-world graphs either in time or space. Also, graph isomorphism query processing is often required in many applications, which is a natural generalization of graph isomorphism for multiple graphs. In this thesis we present fast algorithms for graph isomorphism and graph isomorphism query processing. First, we present a new approach to graph isomorphism, which is the framework of pairwise color refinement and efficient backtracking. Within the framework, we introduce three efficient techniques, which together lead to a much faster and scalable algorithm for graph isomorphism. Experiments on real-world datasets show that our algorithm outperforms state-of-the-art solutions by up to several orders of magnitude in terms of running time. Second, We develop an efficient algorithm for graph isomorphism query processing. We use a two-level index using degree sequences and color-label distributions. Experimental results on real datasets show that our algorithm is orders of magnitude faster than the state-of-the-art algorithms in terms of index construction time, and it runs faster than existing algorithms in terms of query processing time as the graph sizes increase.๊ทธ๋ž˜ํ”„ ๋™ํ˜• ๋ฌธ์ œ๋Š” ์†Œ์…œ ๋„คํŠธ์›Œํฌ ์„œ๋น„์Šค, ์ƒ๋ฌผ์ •๋ณดํ•™, ํ™”ํ•™์ •๋ณดํ•™ ๋“ฑ๋“ฑ ๋‹ค์–‘ํ•œ ์‘์šฉ ๋ถ„์•ผ์—์„œ ๊ทธ๋ž˜ํ”„ ๋ถ„์„์„ ์œ„ํ•ด ๋‹ค๋ฃจ๊ณ  ์žˆ๋Š” ํ•ต์‹ฌ ๋ฌธ์ œ์ด๋‹ค. ์‹ค์ƒํ™œ์—์„œ ๋‹ค๋ฃจ๋Š” ๊ทธ๋ž˜ํ”„ ๋ฐ์ดํ„ฐ์˜ ํฌ๊ธฐ๊ฐ€ ์ปค์ ธ ๊ฐ์— ๋”ฐ๋ผ, ๋Œ€์šฉ๋Ÿ‰์˜ ๊ทธ๋ž˜ํ”„๋ฅผ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ๋Š” ๊ทธ๋ž˜ํ”„ ๋™ํ˜• ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ํ•„์š”์„ฑ์ด ๋†’์•„์ง€๊ณ  ์žˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ํ˜„์žฌ ์กด์žฌํ•˜๋Š” ๊ทธ๋ž˜ํ”„ ๋™ํ˜• ์•Œ๊ณ ๋ฆฌ์ฆ˜๋“ค์€ ๋Œ€์šฉ๋Ÿ‰์˜ ๊ทธ๋ž˜ํ”„์— ๋Œ€ํ•ด์„œ ์‹œ๊ฐ„ ํ˜น์€ ๊ณต๊ฐ„ ์ธก๋ฉด์—์„œ ํ•œ๊ณ„๋ฅผ ๋ณด์—ฌ์ค€๋‹ค. ์‘์šฉ ๋ถ„์•ผ ์ค‘์—์„œ๋Š” ์—ฌ๋Ÿฌ ๊ฐœ์˜ ๊ทธ๋ž˜ํ”„๋“ค ์ค‘์—์„œ ํ•˜๋‚˜์˜ ์ฟผ๋ฆฌ ๊ทธ๋ž˜ํ”„์™€ ๋™ํ˜•์ธ ๊ทธ๋ž˜ํ”„๋ฅผ ๋ชจ๋‘ ์ฐพ๋Š” ๋ฌธ์ œ, ์ฆ‰ ๊ทธ๋ž˜ํ”„ ๋™ํ˜• ์ฟผ๋ฆฌ ํ”„๋กœ์„ธ์‹ฑ์„ ์ข…์ข… ์š”๊ตฌํ•˜๊ธฐ๋„ ํ•œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ๋Œ€์šฉ๋Ÿ‰์˜ ์‹ค์ œ ๊ทธ๋ž˜ํ”„ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•ด์„œ ๊ทธ๋ž˜ํ”„ ๋™ํ˜• ๋ฌธ์ œ์™€ ๊ทธ๋ž˜ํ”„ ๋™ํ˜• ์ฟผ๋ฆฌ ํ”„๋กœ์„ธ์‹ฑ ๋ฌธ์ œ๋ฅผ ๋น ๋ฅด๊ฒŒ ํ‘ธ๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜๋“ค์„ ์ œ์•ˆํ•œ๋‹ค. ์ฒซ ๋ฒˆ์งธ๋กœ, ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ๊ทธ๋ž˜ํ”„ ๋™ํ˜• ๋ฌธ์ œ๋ฅผ ์œ„ํ•œ ๋น ๋ฅด๊ณ  ํ™•์žฅ์„ฑ ์žˆ๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ œ์•ˆํ•œ๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ์Œ๋ณ„ ์ƒ‰ ๊ฐœ์„ (pairwise color refinement)๊ณผ ํšจ์œจ์ ์ธ ๋ฐฑํŠธ๋ž˜ํ‚น์œผ๋กœ ๊ตฌ์„ฑ๋œ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์†Œ๊ฐœํ•œ๋‹ค. ์ด ํ”„๋ ˆ์ž„์›Œํฌ ๋‚ด์—์„œ ์„ธ ๊ฐ€์ง€ ํšจ์œจ์ ์ธ ํ…Œํฌ๋‹‰์„ ์‚ฌ์šฉํ•œ๋‹ค. ์‹ค์ œ ๊ทธ๋ž˜ํ”„ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ์‹คํ—˜์„ ํ†ตํ•ด ๋ณธ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด ํ˜„์กดํ•˜๋Š” ๊ฐ€์žฅ ๋น ๋ฅธ ์•Œ๊ณ ๋ฆฌ์ฆ˜๋“ค๋ณด๋‹ค ํ‰๊ท  ์ˆ˜์ฒœ ๋ฐฐ ๋น ๋ฆ„์„ ๋ณด์˜€๋‹ค. ๋‘ ๋ฒˆ์งธ๋กœ, ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ๊ทธ๋ž˜ํ”„ ๋™ํ˜• ์ฟผ๋ฆฌ ํ”„๋กœ์„ธ์‹ฑ์„ ์œ„ํ•œ ํšจ์œจ์ ์ธ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ฐœ๋ฐœํ•œ๋‹ค. ๋ณธ ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ์ฐจ์ˆ˜์—ด๊ณผ ์ƒ‰-๋ ˆ์ด๋ธ” ๋ถ„ํฌ๋ฅผ ์ด์šฉํ•œ ์ธ๋ฑ์Šค๋ฅผ ์ด์šฉํ•œ๋‹ค. ์‹ค์ œ ๊ทธ๋ž˜ํ”„ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ์‹คํ—˜์„ ํ†ตํ•ด ๋ณธ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด ํ˜„์กดํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜๋“ค๋ณด๋‹ค ์ธ๋ฑ์‹ฑ ์‹œ๊ฐ„์—์„œ๋Š” ํ•ญ์ƒ ํ‰๊ท  ์ˆ˜์ฒœ ๋ฐฐ ๋น ๋ฅด๊ณ , ์ฟผ๋ฆฌ ์ฒ˜๋ฆฌ ์‹œ๊ฐ„์—์„œ๋Š” ์ค‘โ‹…\cdot๋Œ€์šฉ๋Ÿ‰์˜ ๊ทธ๋ž˜ํ”„๋“ค์— ๋Œ€ํ•ด์„œ ํ‰๊ท  ์ˆ˜์‹ญ ๋ฐฐ ๋น ๋ฅด๊ฒŒ ๋™์ž‘ํ•˜๋Š” ๊ฒƒ์„ ๋ณด์˜€๋‹ค.1. Introduction 1 1.1. Background 1 1.2. Organization 3 2. Preliminaries 4 2.1. Notation 4 2.2. Problem Definitions 6 2.3. Related Work 7 3. Graph Isomorphism 9 3.1. Algorithm Overview 12 3.2. Pairwise Color Refinement and Binary Cell Mapping 13 3.3. Compressed Candidate Space 16 3.4. Backtracking and Partial Failing Sets 21 3.5. Performance Evaluation 31 3.5.1. Comparing with Existing Solutions 35 3.5.2. Effectiveness of Individual Techniques 39 3.5.3. Analysis with Varying Degrees of Similarity 42 3.5.4. Sensitivity Analysis 46 4. Graph Isomorphism Query Processing 48 4.1. Canonical Coloring 51 4.2. Index Construction 56 4.3. Query Processing 59 4.4. Performance Evaluation 63 4.4.1. Varying Number of Hops 67 4.4.2. Varying Number of Data Graphs 74 5. Conclusion 78 5.1. Summary 78 5.2. Future Directions 79 ์š”์•ฝ 95๋ฐ•

    Scalable Graph Isomorphism: Combining Pairwise Color Refinement and Backtracking via Compressed Candidate Space

    No full text
    Graph isomorphism is a core problem in graph analysis of various application domains. Given two graphs, the graph isomorphism problem is to determine whether there exists an isomorphism between them. As real-world graphs are getting bigger and bigger, applications demand practically fast algorithms that can run on large-scale graphs. However, existing approaches such as graph canonization and subgraph isomorphism show limited performances on large-scale graphs either in time or space. In this paper, we propose a new approach to graph isomorphism, which is the framework of pairwise color refinement and efficient backtracking. The main features of our approach are: (1) pairwise color refinement and binary cell mapping (2) compressed CS (candidate space), and (3) partial failing set, which together lead to a much faster and scalable algorithm for graph isomorphism. Extensive experiments with real-world datasets show that our approach outperforms stateof-the-art algorithms by up to orders of magnitude in terms of running time. I1

    Scalable graph isomorphism: Combining pairwise color refinement and backtracking via compressed candidate space

    No full text
    ยฉ 2021 IEEE.Graph isomorphism is a core problem in graph analysis of various application domains. Given two graphs, the graph isomorphism problem is to determine whether there exists an isomorphism between them. As real-world graphs are getting bigger and bigger, applications demand practically fast algorithms that can run on large-scale graphs. However, existing approaches such as graph canonization and subgraph isomorphism show limited performances on large-scale graphs either in time or space. In this paper, we propose a new approach to graph isomorphism, which is the framework of pairwise color refinement and efficient backtracking. The main features of our approach are: (1) pairwise color refinement and binary cell mapping (2) compressed CS (candidate space), and (3) partial failing set, which together lead to a much faster and scalable algorithm for graph isomorphism. Extensive experiments with real-world datasets show that our approach outperforms state-of-the-art algorithms by up to orders of magnitude in terms of running time.N

    Scalable graph isomorphism: Combining pairwise color refinement and backtracking via compressed candidate space

    No full text
    Graph isomorphism is a core problem in graph analysis of various application domains. Given two graphs, the graph isomorphism problem is to determine whether there exists an isomorphism between them. As real-world graphs are getting bigger and bigger, applications demand practically fast algorithms that can run on large-scale graphs. However, existing approaches such as graph canonization and subgraph isomorphism show limited performances on large-scale graphs either in time or space. In this paper, we propose a new approach to graph isomorphism, which is the framework of pairwise color refinement and efficient backtracking. The main features of our approach are: (1) pairwise color refinement and binary cell mapping (2) compressed CS (candidate space), and (3) partial failing set, which together lead to a much faster and scalable algorithm for graph isomorphism. Extensive experiments with real-world datasets show that our approach outperforms state-of-the-art algorithms by up to orders of magnitude in terms of running time
    corecore