Cluster-based data oriented hashing

Sanaa Chafik, Imane Daoudi, Mounim A. El Yacoubi, Hamid El Ouardi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Many multidimensional hashing schemes have been actively studied in recent years, providing efficient nearest neighbor search. Generally, we can distinguish several hashing families, such as learning based hashing, which provides better hash function selectivity by learning the dataset distribution. The spacial hashing family proposes a suitable partition of the multidimensional space, more adapted to data points distribution. In spite of the efficiency of multidimensional hashing techniques to solve the nearest neighbor search problem, these techniques suffer from scalabity issues. In this paper, we propose a novel hashing algorithm, named Cluster Based Data Oriented Hashing, that combines space hashing and learning based hashing techniques. The proposed approach applies first a clustering algorithm for structuring the multidimensional space into clusters. Then, in each cluster, a learning based hashing algorithm is applied by selecting an appropriate hash function that fits the data distribution. Experimental comparaisons with standard Euclidean Locality Sensitive Hashing demonstrate the effectiveness of the proposed method for large datasets.

Original languageEnglish
Title of host publicationProceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015
EditorsEric Gaussier, Longbing Cao, Patrick Gallinari, James Kwok, Gabriella Pasi, Osmar Zaiane
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781467382731
DOIs
Publication statusPublished - 2 Dec 2015
Externally publishedYes
Event2nd IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 - Paris, France
Duration: 19 Oct 201521 Oct 2015

Publication series

NameProceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015

Conference

Conference2nd IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015
Country/TerritoryFrance
CityParis
Period19/10/1521/10/15

Fingerprint

Dive into the research topics of 'Cluster-based data oriented hashing'. Together they form a unique fingerprint.

Cite this