A Comparative Study of Deep Learning-Based CBIR Frameworks: From CNN Baselines to Explainable and Hybrid Models
DOI:
https://doi.org/10.62647/IJITCE2025V13I2PP1446-1452Keywords:
Content-Based Image Retrieval, Deep Learning, Explainable AI, CNN-Transformer Fusion, Image SimilarityAbstract
Deep learning has proven to be a
breakthrough technology transforming
Content-Based Image Retrieval (CBIR). In
this paper, we present a comparative study
of three novel frameworks of CBIR that
were developed in a series of studies,
namely a baseline of Modified CNN model,
the Explain CBIR-Net utilizing explainable
AI with Grad-CAM, and HybridCBIRNet,
which incorporates CNN and
Transformer-based architecture with
weighted feature fusion. We quantitatively
estimate all models' accuracy, precision,
recall, and explain ability over the Mini
ImageNet dataset. The findings of our
comparative study underscore the balance
between accuracy, interpretability, and
feature richness. The findings validate the
superior efficacy of HybridCBIRNet
while highlighting the value of contextual
and explainable modelling towards critical
retrieval applications.
Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Authors

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.