Dip: Dual incongruity perceiving network for sarcasm detection
C Wen, G Jia, J Yang - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Sarcasm indicates the literal meaning is contrary to the real attitude. Considering the
popularity and complementarity of image-text data, we investigate the task of multi-modal …
popularity and complementarity of image-text data, we investigate the task of multi-modal …
Mart: Masked affective representation learning via masked temporal distribution distillation
Limited training data is a long-standing problem for video emotion analysis (VEA). Existing
works leverage the power of large-scale image datasets for transferring while failing to …
works leverage the power of large-scale image datasets for transferring while failing to …
Adapt or perish: Adaptive sparse transformer with attentive feature refinement for image restoration
Transformer-based approaches have achieved promising performance in image restoration
tasks given their ability to model long-range dependencies which is crucial for recovering …
tasks given their ability to model long-range dependencies which is crucial for recovering …
Extdm: Distribution extrapolation diffusion model for video prediction
Video prediction is a challenging task due to its nature of uncertainty especially for
forecasting a long period. To model the temporal dynamics advanced methods benefit from …
forecasting a long period. To model the temporal dynamics advanced methods benefit from …
Progressive neighbor consistency mining for correspondence pruning
The goal of correspondence pruning is to recognize correct correspondences (inliers) from
initial ones, with applications to various feature matching based tasks. Seeking neighbors in …
initial ones, with applications to various feature matching based tasks. Seeking neighbors in …
Lake-red: Camouflaged images generation by latent background knowledge retrieval-augmented diffusion
Camouflaged vision perception is an important vision task with numerous practical
applications. Due to the expensive collection and labeling costs this community struggles …
applications. Due to the expensive collection and labeling costs this community struggles …
Grid: Guided refinement for detector-free multimodal image matching
Multimodal image matching is essential in image stitching, image fusion, change detection,
and land cover map**. However, the severe nonlinear radiometric distortion (NRD) and …
and land cover map**. However, the severe nonlinear radiometric distortion (NRD) and …
DHM-Net: Deep Hypergraph Modeling for Robust Feature Matching
We present a novel deep hypergraph modeling architecture (called DHM-Net) for feature
matching in this paper. Our network focuses on learning reliable correspondences between …
matching in this paper. Our network focuses on learning reliable correspondences between …
VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
Correspondence pruning aims to find correct matches (inliers) from an initial set of putative
correspondences, which is a fundamental task for many applications. The process of finding …
correspondences, which is a fundamental task for many applications. The process of finding …
Heterogeneous context interaction network for vehicle re-identification
K Sun, X Pang, M Zheng, X Nie, X Li, H Zhou, Y Yin - Neural Networks, 2024 - Elsevier
Capturing global and subtle discriminative information using attention mechanisms is
essential to address the challenge of inter-class high similarity for vehicle re-identification …
essential to address the challenge of inter-class high similarity for vehicle re-identification …