- Academic Search

Y Wei, D Hu, Y Tian, X Li - arxiv preprint arxiv:2208.09579, 2022 - arxiv.org

Sight and hearing are two senses that play a vital role in human communication and scene
understanding. To mimic human perception ability, audio-visual learning, aimed at …

Speichern Zitieren Zitiert von: 62 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] ieee.org

Multimodal learning with transformers: A survey

P Xu, X Zhu, DA Clifton - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Transformer is a promising neural network learner, and has achieved great success in
various machine learning tasks. Thanks to the recent prevalence of multimodal applications …

Speichern Zitieren Zitiert von: 626 Ähnliche Artikel Alle 9 Versionen

[Free GPT-4]

[PDF] thecvf.com

Decoupled multimodal distilling for emotion recognition

Y Li, Y Wang, Z Cui - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com

Human multimodal emotion recognition (MER) aims to perceive human emotions via
language, visual and acoustic modalities. Despite the impressive performance of previous …

Speichern Zitieren Zitiert von: 114 Ähnliche Artikel Alle 8 Versionen HTML-Version

[Free GPT-4]

[PDF] github.io

Disentangled representation learning for multimodal emotion recognition

D Yang, S Huang, H Kuang, Y Du… - Proceedings of the 30th …, 2022 - dl.acm.org

Multimodal emotion recognition aims to identify human emotions from text, audio, and visual
modalities. Previous methods either explore correlations between different modalities or …

Speichern Zitieren Zitiert von: 162 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]

[PDF] mdpi.com

A survey of deep learning-based multimodal emotion recognition: Speech, text, and face

H Lian, C Lu, S Li, Y Zhao, C Tang, Y Zong - Entropy, 2023 - mdpi.com

Multimodal emotion recognition (MER) refers to the identification and understanding of
human emotional states by combining different signals, including—but not limited to—text …

Speichern Zitieren Zitiert von: 41 Ähnliche Artikel Alle 7 Versionen Im Cache

[Free GPT-4]

[PDF] neurips.cc

Incomplete multimodality-diffused emotion recognition

Y Wang, Y Li, Z Cui - Advances in Neural Information …, 2023 - proceedings.neurips.cc

Human multimodal emotion recognition (MER) aims to perceive and understand human
emotions via various heterogeneous modalities, such as language, vision, and acoustic …

Speichern Zitieren Zitiert von: 34 Ähnliche Artikel Alle 4 Versionen HTML-Version

[Free GPT-4]

[PDF] thecvf.com

Mart: Masked affective representation learning via masked temporal distribution distillation

Z Zhang, P Zhao, E Park… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Limited training data is a long-standing problem for video emotion analysis (VEA). Existing
works leverage the power of large-scale image datasets for transferring while failing to …

Speichern Zitieren Zitiert von: 7 Ähnliche Artikel HTML-Version

Target and source modality co-reinforcement for emotion understanding from asynchronous multimodal sequences

D Yang, Y Liu, C Huang, M Li, X Zhao, Y Wang… - Knowledge-Based …, 2023 - Elsevier

Perceiving human emotions from a multimodal perspective has received significant attention
in knowledge engineering communities. Due to the variable receiving frequency for …

Speichern Zitieren Zitiert von: 46 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]

[PDF] github.io

Learning modality-specific and-agnostic representations for asynchronous multimodal language sequences

D Yang, H Kuang, S Huang, L Zhang - Proceedings of the 30th ACM …, 2022 - dl.acm.org

Understanding human behaviors and intents from videos is a challenging task. Video flows
usually involve time-series data from different modalities, such as natural language, facial …

Speichern Zitieren Zitiert von: 56 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]

[PDF] arxiv.org

Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis

L Sun, Z Lian, B Liu, J Tao - IEEE Transactions on Affective …, 2023 - ieeexplore.ieee.org

With the proliferation of user-generated online videos, Multimodal Sentiment Analysis (MSA)
has attracted increasing attention recently. Despite significant progress, there are still two …

Speichern Zitieren Zitiert von: 89 Ähnliche Artikel Alle 6 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Progressive modality reinforcement for human multimodal emotion recognition from unaligned...

Learning in audio-visual context: A review, analysis, and new perspective

Multimodal learning with transformers: A survey

Decoupled multimodal distilling for emotion recognition

Disentangled representation learning for multimodal emotion recognition

A survey of deep learning-based multimodal emotion recognition: Speech, text, and face

Incomplete multimodality-diffused emotion recognition

Mart: Masked affective representation learning via masked temporal distribution distillation

Target and source modality co-reinforcement for emotion understanding from asynchronous multimodal sequences

Learning modality-specific and-agnostic representations for asynchronous multimodal language sequences

Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis