Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment

Y Zheng, Z Li, X Li, J Liu, Y Wang, X Meng… - … Conference on Artificial …, 2024 - Springer
Image classification models often demonstrate unstable performance in real-world
applications due to variations in image information, driven by differing visual perspectives of …