Search CORE

164 research outputs found

効率的で安全な集合間類似結合に関する研究

Author: Mateus SilqueiraHicksonCruz
Publication venue
Publication date: 01/01/2018
Field of study

筑波大学 (University of Tsukuba)201

Tsukuba Repository

Parallel Index-Based Structural Graph Clustering and Its Approximation

Author: Dhulipala Laxman
Shun Julian
Tseng Tom
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 30/03/2021
Field of study

SCAN (Structural Clustering Algorithm for Networks) is a well-studied, widely used graph clustering algorithm. For large graphs, however, sequential SCAN variants are prohibitively slow, and parallel SCAN variants do not effectively share work among queries with different SCAN parameter settings. Since users of SCAN often explore many parameter settings to find good clusterings, it is worthwhile to precompute an index that speeds up queries. This paper presents a practical and provably efficient parallel index-based SCAN algorithm based on GS*-Index, a recent sequential algorithm. Our parallel algorithm improves upon the asymptotic work of the sequential algorithm by using integer sorting. It is also highly parallel, achieving logarithmic span (parallel time) for both index construction and clustering queries. Furthermore, we apply locality-sensitive hashing (LSH) to design a novel approximate SCAN algorithm and prove guarantees for its clustering behavior. We present an experimental evaluation of our algorithms on large real-world graphs. On a 48-core machine with two-way hyper-threading, our parallel index construction achieves 50--151

\times

speedup over the construction of GS*-Index. In fact, even on a single thread, our index construction algorithm is faster than GS*-Index. Our parallel index query implementation achieves 5--32

\times

speedup over GS*-Index queries across a range of SCAN parameter values, and our implementation is always faster than ppSCAN, a state-of-the-art parallel SCAN algorithm. Moreover, our experiments show that applying LSH results in faster index construction while maintaining good clustering quality

arXiv.org e-Print Archive

DSpace@MIT

Learning Fine-grained Image Similarity with Deep Ranking

Author: Chen Bo
Leung Thomas
Philbin James
Rosenberg Chuck
song Yang
Wang Jiang
Wang Jinbin
Wu Ying
Publication venue
Publication date: 17/04/2014
Field of study

Learning fine-grained image similarity is a challenging task. It needs to capture between-class and within-class image differences. This paper proposes a deep ranking model that employs deep learning techniques to learn similarity metric directly from images.It has higher learning capability than models based on hand-crafted features. A novel multiscale network structure has been developed to describe the images effectively. An efficient triplet sampling algorithm is proposed to learn the model with distributed asynchronized stochastic gradient. Extensive experiments show that the proposed algorithm outperforms models based on hand-crafted visual features and deep classification models.Comment: CVPR 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Caltech Authors

Analysis of SparseHash: an efficient embedding of set-similarity via sparse projections

Author: Bianchi Tiziano
Fosson Sophie Marie
Magli Enrico
Ravazzi Chiara
Valsesia Diego
Publication venue
Publication date: 01/01/2019
Field of study

Embeddings provide compact representations of signals in order to perform efficient inference in a wide variety of tasks. In particular, random projections are common tools to construct Euclidean distance-preserving embeddings, while hashing techniques are extensively used to embed set-similarity metrics, such as the Jaccard coefficient. In this letter, we theoretically prove that a class of random projections based on sparse matrices, called SparseHash, can preserve the Jaccard coefficient between the supports of sparse signals, which can be used to estimate set similarities. Moreover, besides the analysis, we provide an efficient implementation and we test the performance in several numerical experiments, both on synthetic and real datasets.Comment: 25 pages, 6 figure

arXiv.org e-Print Archive

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Analysis of SparseHash: An efficient embedding of set-similarity via sparse projections

Author: Bianchi Tiziano
Fosson Sophie M.
Magli Enrico
Ravazzi Chiara
Valsesia Diego
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)