Cddfuse: Correlation-driven dual-branch feature decomposition for multi-modality image fusion
Multi-modality (MM) image fusion aims to render fused images that maintain the merits of
different modalities, eg, functional highlight and detailed textures. To tackle the challenge in …
different modalities, eg, functional highlight and detailed textures. To tackle the challenge in …
Patchct: Aligning patch set and label set with conditional transport for multi-label image classification
Multi-label image classification is a prediction task that aims to identify more than one label
from a given image. This paper considers the semantic consistency of the latent space …
from a given image. This paper considers the semantic consistency of the latent space …
Two-stream transformer for multi-label image classification
Multi-label image classification is a fundamental yet challenging task in computer vision that
aims to identify multiple objects from a given image. Recent studies on this task mainly focus …
aims to identify multiple objects from a given image. Recent studies on this task mainly focus …
Semantic and correlation disentangled graph convolutions for multilabel image recognition
Multilabel image recognition (MLR) aims to annotate an image with comprehensive labels
and suffers from object occlusion or small object sizes within images. Although the existing …
and suffers from object occlusion or small object sizes within images. Although the existing …
Recent trends of multimodal affective computing: A survey from NLP perspective
Multimodal affective computing (MAC) has garnered increasing attention due to its broad
applications in analyzing human behaviors and intentions, especially in text-dominated …
applications in analyzing human behaviors and intentions, especially in text-dominated …
Multi-modal extreme classification
This paper develops the MUFIN technique for extreme classification (XC) tasks with millions
of labels where datapoints and labels are endowed with visual and textual descriptors …
of labels where datapoints and labels are endowed with visual and textual descriptors …
Mfgan: multimodal fusion for industrial anomaly detection using attention-based autoencoder and generative adversarial network
Anomaly detection plays a critical role in ensuring safe, smooth, and efficient operation of
machinery and equipment in industrial environments. With the wide deployment of …
machinery and equipment in industrial environments. With the wide deployment of …
Semantic representation and dependency learning for multi-label image recognition
Recently many multi-label image recognition (MLR) works have made significant progress
by introducing pre-trained object detection models to generate lots of proposals or utilizing …
by introducing pre-trained object detection models to generate lots of proposals or utilizing …
Towards private stylists via personalized compatibility learning
Personalized outfit compatibility learning is an emerging yet challenging task. Most of the
existing methods focus on general outfit compatibility learning. Although a few works have …
existing methods focus on general outfit compatibility learning. Although a few works have …
Hyperspherical learning in multi-label classification
Learning from online data with noisy web labels is gaining more attention due to the
increasing cost of fully annotated datasets in large-scale multi-label classification tasks …
increasing cost of fully annotated datasets in large-scale multi-label classification tasks …