Efficient Hashing Methods for Finding Objects in Large Volumes of Data

L. N. Ivanova, S. E. Ivanov

Abstract


This article analyzes machine learning methods for efficient access and rapid retrieval of objects such as images, videos, and documents in large data sets. Various hashing methods are considered: deep lifelong cross-modal hashing, learned locality-sensitive hashing, graph-collaborated auto-encoder hashing for multi-view binary clustering, sparsity-induced generative adversarial hashing, contrastive language-image pre-training multimodal hashing, locality-sensitive hashing with query-based dynamic bucketing, deep supervised hashing, and cross-modal hashing methods. A computational experiment was conducted to comparatively analyze and evaluate the accuracy, loss function, and performance of the hashing algorithms: deep hashing, deep supervised hashing, and deep learning hashing. Python programs were developed for calculating the hashing algorithms and presenting graphical results. For multimodal tasks, where data from various sources must be integrated and supplemented with new data, deep lifelong cross-modal hashing is the most suitable solution. An analysis of deep hashing methods has demonstrated the superiority of deep supervised hashing when used with labeled data and distinct object classes.

Full Text:

PDF

References


L. Xu, H. Li, B. Zheng, W. Li. and J. Lv, "Deep Lifelong Cross-modal Hashing," IEEE Transactions on Circuits and Systems for Video Technology, 2024.

R. Liu, J. Zhao, X. Chu, Y. Liang, W. Zhou and J. He, "Can LSH (locality-sensitive hashing) be replaced by neural network?,” Soft Computing, vol. 28, no. 2, pp. 1041-1053, 2024.

H. Wang, M. Yao, G. Jiang, Z. Mi, and X. Fu, "Graph-collaborated auto-encoder hashing for multiview binary clustering," IEEE Transactions on Neural Networks and Learning Systems, vol.35, no.7, pp. 10121-10133, 2023.

H. Liu, "Sparse-Inductive Generative Adversarial Hashing for Nearest Neighbor Search," arXiv preprint, arXiv:2306.06928, 2023.

J. Zhu, M. Sheng, M. Ke, Z. Huang and J. Chang, "CLIP Multi-modal Hashing: A new baseline CLIPMH," arXiv preprint, arXiv:2308.11797, 2023.

A. Singh and S. Gupta, "Learning to hash: a comprehensive survey of deep learning-based hashing methods," Knowledge and Information Systems, vol. 64., no. 10, pp. 2565-2597, 2022.

L. He, Z. Huang, E. Chen, Q. Liu, S. Tong, H. Wang and S. Wang, "An efficient and robust semantic hashing framework for similar text search," ACM Transactions on Information Systems, vol. 41, no. 4, pp. 1-31, 2023.

Y. Tian, X. Zhao and X. Zhou , "DB-LSH 2.0: Locality-sensitive hashing with query-based dynamic bucketing," IEEE Transactions on Knowledge and Data Engineering, vol. 36, no. 3, pp. 1000-1015, 2023.

X. Luo, H. Wang, D. Wu, C. Chen, M. Deng, J. Huang and X. S. Hua, "A survey on deep hashing methods," ACM Transactions on Knowledge Discovery from Data, vol. 17, no. 1, pp. 1-50, 2023.

Y. Chen, Y. Fang, Y. Zhang and I. King, "Bipartite graph convolutional hashing for effective and efficient top-n search in hamming space," In Proceedings of the ACM Web Conference 2023, pp. 3164-3172, 2023.

D. Zhang and X. J. Wu, "Robust and discrete matrix factorization hashing for cross-modal retrieval," Pattern Recognition, vol. 122, pp.108343, 2022.

W. Zhao, Z. Guan, H. Luo, J. Peng and J. Fan, "Deep multiple instances hashing for fast multi-object image search," IEEE Transactions on Image Processing, vol. 30, pp. 7995-8007, 2021.

G. E. Pibiri and R. Trani, "PTHash: Revisiting FCH minimal perfect hashing," In Proceedings of the 44th international ACM SIGIR conference on research and development in information retrieval, pp. 1339-1348, 2021.

Y. Yu, D. Kruyff, J. Jiao, T. Becker and M. Behrisch, "PSEUDo: Interactive pattern search in multivariate time series with locality-sensitive hashing and relevance feedback," IEEE Transactions on Visualization and Computer Graphics, vol. 29, no. 1, pp. 33-42, 2022.

D. Zhang, X. J. Wu, T. Xu and J. Kittler, "Two-stage supervised discrete hashing for cross-modal retrieval," IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 52, no.11, pp. 7014-7026, 2022.

Z. Zhang, H. Luo, L. Zhu, G. Lu and H. T. Shen, "Modality-invariant asymmetric networks for cross-modal hashing," IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 5, pp. 5091-5104, 2022.

Y. Han, C. Liu and P. Wang, "A comprehensive survey on vector database: Storage and retrieval technique, challenge," arXiv preprint, arXiv:2310.11703, 2023.

L. Wang, J. Yang, M. Zareapoor and Z. Zheng, "Cluster-wise unsupervised hashing for cross-modal similarity search," Pattern Recognition, vol. 111, 107732, 2021.

D. Zhang, X. J. Wu and J. Yu, "Discrete bidirectional matrix factorization hashing for zero-shot cross-media retrieval," In Chinese conference on pattern recognition and computer vision (PRCV), Cham, Springer International Publishing, pp. 524-536, 2021.

G. Arora, S. Kalra, A. Bhatia and K. Tiwari, "Palmhashnet: Palmprint hashing network for indexing large databases to boost identification," IEEE Access, no. 9, pp. 145912-145928, 2021.

Y. Wang, B. Xue, Q. Cheng, Y. Chen and L. Zhang, "Deep Unified Cross-Modality Hashing by Pairwise Data Alignment," IJCAI, pp. 1129-1135, 2021.

D. Zhang, X. J. Wu, T. Xu and J. Kittler, "WATCH: Two-stage discrete cross-media hashing," IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 6, pp. 6461-6474, 2022.

W. W. Ng, J. Li, X. Tian and H. Wang, "Bit-wise attention deep complementary supervised hashing for image retrieval," Multimedia Tools and Applications, vol. 81, no. 1, pp. 927-951, 2022.

D. Osorio-Roig, C. Rathgeb, P. Drozdowski and C. Busch, "Stable hash generation for efficient privacy-preserving face identification," IEEE Transactions on Biometrics, Behavior, and Identity Science, vol. 4, no. 3, pp. 333-348, 2021.

X. Fang, K. Jiang, N. Han, S. Teng, G. Zhou and S. Xie, "Average approximate hashing-based double projections learning for cross-modal retrieval," IEEE Transactions on Cybernetics, vol. 52, no. 11, pp. 11780-11793, 2021.

D. Wu, Q. Dai, B. Li and W. Wang, "Deep uncoupled discrete hashing via similarity matrix decomposition," ACM Transactions on Multimedia Computing, Communications and Applications, vol. 19, no. 1, pp. 1-22, 2023


Refbacks

  • There are currently no refbacks.


Abava  Кибербезопасность ИТ конгресс СНЭ

ISSN: 2307-8162