Cddfuse: Correlation-driven dual-branch feature decomposition for multi-modality image fusion

Z Zhao, H Bai, J Zhang, Y Zhang, S Xu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Multi-modality (MM) image fusion aims to render fused images that maintain the merits of
different modalities, eg, functional highlight and detailed textures. To tackle the challenge in …

Patchct: Aligning patch set and label set with conditional transport for multi-label image classification

M Li, D Wang, X Liu, Z Zeng, R Lu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Multi-label image classification is a prediction task that aims to identify more than one label
from a given image. This paper considers the semantic consistency of the latent space …

Two-stream transformer for multi-label image classification

X Zhu, J Cao, J Ge, W Liu, B Liu - Proceedings of the 30th ACM …, 2022 - dl.acm.org
Multi-label image classification is a fundamental yet challenging task in computer vision that
aims to identify multiple objects from a given image. Recent studies on this task mainly focus …

Semantic and correlation disentangled graph convolutions for multilabel image recognition

S Cai, L Li, X Han, S Huang, Q Tian… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Multilabel image recognition (MLR) aims to annotate an image with comprehensive labels
and suffers from object occlusion or small object sizes within images. Although the existing …

Recent trends of multimodal affective computing: A survey from NLP perspective

G Hu, Y **n, W Lyu, H Huang, C Sun, Z Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Multimodal affective computing (MAC) has garnered increasing attention due to its broad
applications in analyzing human behaviors and intentions, especially in text-dominated …

Multi-modal extreme classification

A Mittal, K Dahiya, S Malani… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper develops the MUFIN technique for extreme classification (XC) tasks with millions
of labels where datapoints and labels are endowed with visual and textual descriptors …

Mfgan: multimodal fusion for industrial anomaly detection using attention-based autoencoder and generative adversarial network

X Qu, Z Liu, CQ Wu, A Hou, X Yin, Z Chen - Sensors, 2024 - mdpi.com
Anomaly detection plays a critical role in ensuring safe, smooth, and efficient operation of
machinery and equipment in industrial environments. With the wide deployment of …

Semantic representation and dependency learning for multi-label image recognition

T Pu, M Sun, H Wu, T Chen, L Tian, L Lin - Neurocomputing, 2023 - Elsevier
Recently many multi-label image recognition (MLR) works have made significant progress
by introducing pre-trained object detection models to generate lots of proposals or utilizing …

Towards private stylists via personalized compatibility learning

D Mo, X Zou, K Pang, WK Wong - Expert Systems with Applications, 2023 - Elsevier
Personalized outfit compatibility learning is an emerging yet challenging task. Most of the
existing methods focus on general outfit compatibility learning. Although a few works have …

Hyperspherical learning in multi-label classification

B Ke, Y Zhu, M Li, X Shu, R Qiao, B Ren - European Conference on …, 2022 - Springer
Learning from online data with noisy web labels is gaining more attention due to the
increasing cost of fully annotated datasets in large-scale multi-label classification tasks …