Transformers in computational visual media: A survey
Transformers, the dominant architecture for natural language processing, have also recently
attracted much attention from computational visual media researchers due to their capacity …
attracted much attention from computational visual media researchers due to their capacity …
Cream: Weakly supervised object localization via class re-activation map**
Abstract Weakly Supervised Object Localization (WSOL) aims to localize objects with image-
level supervision. Existing works mainly rely on Class Activation Map** (CAM) derived …
level supervision. Existing works mainly rely on Class Activation Map** (CAM) derived …
Background activation suppression for weakly supervised object localization
Weakly supervised object localization (WSOL) aims to localize objects using only image-
level labels. Recently a new paradigm has emerged by generating a foreground prediction …
level labels. Recently a new paradigm has emerged by generating a foreground prediction …
Lctr: On awakening the local continuity of transformer for weakly supervised object localization
Weakly supervised object localization (WSOL) aims to learn object localizer solely by using
image-level labels. The convolution neural network (CNN) based techniques often result in …
image-level labels. The convolution neural network (CNN) based techniques often result in …