This paper introduces an extension of min-Hash, called Sim-min-Hash (SMH), to compare sets of real-valued vectors. It drastically improves the comparison metric between images while being very efficient for the retrieval and linking problems. This is achieved by adding to the original sketches extra information, in particular in the form of binary codes. The underlying motivation is to exploit the similarity measurements between vectors, in the spirit of Hamhal-00839921
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.