Transformers in computational visual media: A survey

Y Xu, H Wei, M Lin, Y Deng, K Sheng, M Zhang… - Computational Visual …, 2022 - Springer
Transformers, the dominant architecture for natural language processing, have also recently
attracted much attention from computational visual media researchers due to their capacity …

A comprehensive survey of convolutions in deep learning: Applications, challenges, and future trends

A Younesi, M Ansari, M Fazli, A Ejlali, M Shafique… - IEEE …, 2024 - ieeexplore.ieee.org
In today's digital age, Convolutional Neural Networks (CNNs), a subset of Deep Learning
(DL), are widely used for various computer vision tasks such as image classification, object …

Cream: Weakly supervised object localization via class re-activation map**

J Xu, J Hou, Y Zhang, R Feng… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract Weakly Supervised Object Localization (WSOL) aims to localize objects with image-
level supervision. Existing works mainly rely on Class Activation Map** (CAM) derived …

Background activation suppression for weakly supervised object localization and semantic segmentation

W Zhai, P Wu, K Zhu, Y Cao, F Wu, ZJ Zha - International Journal of …, 2024 - Springer
Weakly supervised object localization and semantic segmentation aim to localize objects
using only image-level labels. Recently, a new paradigm has emerged by generating a …