Minimal Similarity Loss Hashing based on Optimized Self-taught Clustering Method

Zhen Wang; Chenyu Wang; Anlei Jiao

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

FORENSIC ANALYSIS FRAMEWORKS FOR ENCRYPTED CLOUD STORAGE INVESTIGATIONS

Joy Awoleye Sarah Mavire Allan Munyira Kelvin Magora

Random Articles

Article:A Comparative study of Face Recognition with Principal Component Analysis and Cross-Correlation Technique

November

2010

Evaluating Embedded GPUs Performance via Computer Vision Applications

Jul

2020

Detection and Identification of Mass Structure in Digital Mammogram

September

2013

A Two Hop Power Adaptive MAC Protocol for Densely Populated Wireless Networks

March

2013

Reseach Article

Minimal Similarity Loss Hashing based on Optimized Self-taught Clustering Method

by Zhen Wang, Chenyu Wang, Anlei Jiao

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 175 - Number 10

Year of Publication: 2020

Authors: Zhen Wang, Chenyu Wang, Anlei Jiao

10.5120/ijca2020920564

Zhen Wang, Chenyu Wang, Anlei Jiao . Minimal Similarity Loss Hashing based on Optimized Self-taught Clustering Method. International Journal of Computer Applications. 175, 10 ( Aug 2020), 34-39. DOI=10.5120/ijca2020920564

@article{ 10.5120/ijca2020920564,

author = { Zhen Wang, Chenyu Wang, Anlei Jiao },

title = { Minimal Similarity Loss Hashing based on Optimized Self-taught Clustering Method },

journal = { International Journal of Computer Applications },

issue_date = { Aug 2020 },

volume = { 175 },

number = { 10 },

month = { Aug },

year = { 2020 },

issn = { 0975-8887 },

pages = { 34-39 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume175/number10/31491-2020920564/ },

doi = { 10.5120/ijca2020920564 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T00:24:41.983065+05:30

%A Zhen Wang

%A Chenyu Wang

%A Anlei Jiao

%T Minimal Similarity Loss Hashing based on Optimized Self-taught Clustering Method

%J International Journal of Computer Applications

%@ 0975-8887

%V 175

%N 10

%P 34-39

%D 2020

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In With the rapid growth of the amount of web images, how to fast respond the approximate nearest neighbors (ANN) search task has been concerned by researchers. As hashing algorithms map the floating point data into binary code and achieve ANN search task according to Hamming distances, it has the advantages of low computational time complexity. To obtain an excellent ANN search performance based on compact binary code, many algorithms adopt machine learning mechanism to train hashing functions. For instance, k-means hashing (KMH) and stacked k-means hashing (SKMH) utilize k-means clustering mechanism to learn encoding centers. Due to KMH and SKMH randomly generate initial centers, many centers would converge to the same solution. To fix the above problem, a novel hashing method termed as minimal similarity loss hashing (MSLH) is proposed to generate the initial centers with maximum average distance by a self-taught mechanism. Furthermore, MSLH defines both the similarity loss and the quantization loss as the objective function. By minimizing the similarity loss, MSLH can approximate the data pairs' Euclidean distance by their Hamming distance. The encoding results with minimal quantization loss map the nearest neighbors into the same binary code, which well adaptive to the data distribution. The ANN search comparative experiments on three public available datasets including NUS-WIDE and 22K LabelME show that MSLH can achieve a superior performance.

References

Wang Z., Sun F., Zhang L., Liu P.. 2020 Minimal residual ordinal loss hashing with an adaptive optimization mechanism. EuRASIP Journal on Image and Video Processing, 10.
Wang Z., Sun F., Zhang L., Wang L.. 2020 Top position sensitive ordinal relation preserving bitwise weight for image retrieval [J]. Algorithms, 13(1), 18.
Wang Z., Zhang L., Sun F., Wang L., Liu S. Relative similarity preserving bitwise weights generated by an adaptive mechanism [J]. 2019 Multimedia Tools and Applications, 78 (17), pp. 24453-24472.
Datar M., Immorlica N., Indyk P., Mirrokni V. S. 2004 Locality-sensitive hashing scheme based on p-stable distributions [C]. In Proceedings of twentieth Annual Symposium on Computational Geometry. Brooklyn, New York, USA, 253-262.
Weiss Y., Torralba A., Fergus R.. 2008 Spectral hashing [C]. In Proceedings of the Advances in Neural Information Processing Systems. British Columbia, Canada, 1753-1760.
Yusuke M., Toshikazu W.. 2009 Principal component hashing: An accelerated approximate nearest neighbor search [C]. In proceedings of Pacific Rim Symposium on Advances in Image and Video Technology. Springer-Verlag, 374-385.
Jegou H., Douze M., Schmid C., Perez P. 2010 Aggregating local descriptors into a compact image representation [C]. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. San Francisco, California, USA, 3304-3311.
Gong Y., Lazebnik S.. 2011 Iterative Quantization: A procrustean approach to learning binary codes [C]. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Colorado Springs, CO, USA, 817-824.
He K., Wen F., Sun J.. 2013 K-means hashing: an affinity-preserving quantization method for learning binary compact codes [C]. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Portland, Oregon, 2938-2945.
Chen Y., Li Z., Shi J., Liu Z. and Qu W.. 2018 Stacked K-Means Hashing Quantization for Nearest Neighbor Search [C]. In Proceedings of the IEEE Fourth International Conference on Multimedia Big Data. 1-4.
Ge T., He K., Ke Q., Sun J.. 2013 Optimized Product Quantization for Approximate Nearest Neighbor Search [C]. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Portland, Oregon, 2946-2953.
Ge T., He K., Ke Q., Sun J.. 2014 Optimized Product Quantization [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(4): 744-755.
Wang X., Shi Y., Kitani K.. 2016 Deep Supervised Hashing with Triplet Labels [C]. In Proceedings of Asian Conference on Computer Vision. Taipei, Taiwan, 70-84.
Cakir F., Sclaroff S.. 2015 Adaptive Hashing for Fast Similarity Search [C]. In Proceedings of the 2015 IEEE International Conference on Computer Vision (CVPR). Santiago, Chile, 1044-1052.
Oliva A., Torralba A.. 2001 Modeling the shape of the scene: A holistic representation of the spatial envelope [J]. International journal of computer vision, 42(3): 145-175.

Index Terms

Computer Science

Information Sciences

Keywords

Hashing algorithm self-taught clustering Iterative optimization mechanism Similarity preserving.