Dip: Dual incongruity perceiving network for sarcasm detection

C Wen, G Jia, J Yang - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Sarcasm indicates the literal meaning is contrary to the real attitude. Considering the
popularity and complementarity of image-text data, we investigate the task of multi-modal …

Mart: Masked affective representation learning via masked temporal distribution distillation

Z Zhang, P Zhao, E Park… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Limited training data is a long-standing problem for video emotion analysis (VEA). Existing
works leverage the power of large-scale image datasets for transferring while failing to …

Adapt or perish: Adaptive sparse transformer with attentive feature refinement for image restoration

S Zhou, D Chen, J Pan, J Shi… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Transformer-based approaches have achieved promising performance in image restoration
tasks given their ability to model long-range dependencies which is crucial for recovering …

Extdm: Distribution extrapolation diffusion model for video prediction

Z Zhang, J Hu, W Cheng, D Paudel… - Proceedings of the …, 2024 - openaccess.thecvf.com
Video prediction is a challenging task due to its nature of uncertainty especially for
forecasting a long period. To model the temporal dynamics advanced methods benefit from …

Progressive neighbor consistency mining for correspondence pruning

X Liu, J Yang - Proceedings of the IEEE/CVF Conference …, 2023 - openaccess.thecvf.com
The goal of correspondence pruning is to recognize correct correspondences (inliers) from
initial ones, with applications to various feature matching based tasks. Seeking neighbors in …

Lake-red: Camouflaged images generation by latent background knowledge retrieval-augmented diffusion

P Zhao, P Xu, P Qin, DP Fan, Z Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Camouflaged vision perception is an important vision task with numerous practical
applications. Due to the expensive collection and labeling costs this community struggles …

Grid: Guided refinement for detector-free multimodal image matching

Y Liu, W He, H Zhang - IEEE Transactions on Image …, 2024 - ieeexplore.ieee.org
Multimodal image matching is essential in image stitching, image fusion, change detection,
and land cover map**. However, the severe nonlinear radiometric distortion (NRD) and …

DHM-Net: Deep Hypergraph Modeling for Robust Feature Matching

S Chen, G **ao, J Guo, Q Wu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
We present a novel deep hypergraph modeling architecture (called DHM-Net) for feature
matching in this paper. Our network focuses on learning reliable correspondences between …

VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning

T Liao, X Zhang, L Zhao, T Wang, G **ao - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Correspondence pruning aims to find correct matches (inliers) from an initial set of putative
correspondences, which is a fundamental task for many applications. The process of finding …

Heterogeneous context interaction network for vehicle re-identification

K Sun, X Pang, M Zheng, X Nie, X Li, H Zhou, Y Yin - Neural Networks, 2024 - Elsevier
Capturing global and subtle discriminative information using attention mechanisms is
essential to address the challenge of inter-class high similarity for vehicle re-identification …