A survey of text classification with transformers: How wide? how large? how long? how accurate? how expensive? how safe?

J Fields, K Chovanec, P Madiraju - IEEE Access, 2024 - ieeexplore.ieee.org
Text classification in natural language processing (NLP) is evolving rapidly, particularly with
the surge in transformer-based models, including large language models (LLM). This paper …

A comprehensive survey on multi-modal conversational emotion recognition with deep learning

Y Shou, T Meng, W Ai, N Yin, K Li - arxiv preprint arxiv:2312.05735, 2023 - arxiv.org
Multi-modal conversation emotion recognition (MCER) aims to recognize and track the
speaker's emotional state using text, speech, and visual information in the conversation …

Revisiting disentanglement and fusion on modality and context in conversational multimodal emotion recognition

B Li, H Fei, L Liao, Y Zhao, C Teng, TS Chua… - Proceedings of the 31st …, 2023 - dl.acm.org
It has been a hot research topic to enable machines to understand human emotions in
multimodal contexts under dialogue scenarios, which is tasked with multimodal emotion …

A transformer-based model with self-distillation for multimodal emotion recognition in conversations

H Ma, J Wang, H Lin, B Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Emotion recognition in conversations (ERC), the task of recognizing the emotion of each
utterance in a conversation, is crucial for building empathetic machines. Existing studies …

MultiEMO: An attention-based correlation-aware multimodal fusion framework for emotion recognition in conversations

T Shi, SL Huang - Proceedings of the 61st Annual Meeting of the …, 2023 - aclanthology.org
Abstract Emotion Recognition in Conversations (ERC) is an increasingly popular task in the
Natural Language Processing community, which seeks to achieve accurate emotion …

A facial expression-aware multimodal multi-task learning framework for emotion recognition in multi-party conversations

W Zheng, J Yu, R **a, S Wang - … of the 61st Annual Meeting of the …, 2023 - aclanthology.org
Abstract Multimodal Emotion Recognition in Multiparty Conversations (MERMC) has
recently attracted considerable attention. Due to the complexity of visual scenes in multi …

Contextual augmented global contrast for multimodal intent recognition

K Sun, Z **e, M Ye, H Zhang - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Multimodal intent recognition (MIR) aims to perceive the human intent polarity via language
visual and acoustic modalities. The inherent intent ambiguity makes it challenging to …

Modeling multimodal social interactions: new challenges and baselines with densely aligned representations

S Lee, B Lai, F Ryan, B Boote… - Proceedings of the …, 2024 - openaccess.thecvf.com
Understanding social interactions involving both verbal and non-verbal cues is essential for
effectively interpreting social situations. However most prior works on multimodal social cues …

Speech-text pre-training for spoken dialog understanding with explicit cross-modal alignment

T Yu, H Gao, TE Lin, M Yang, Y Wu, W Ma… - Proceedings of the …, 2023 - aclanthology.org
Recently, speech-text pre-training methods have shown remarkable success in many
speech and natural language processing tasks. However, most previous pre-trained models …

CFN-ESA: A cross-modal fusion network with emotion-shift awareness for dialogue emotion recognition

J Li, X Wang, Y Liu, Z Zeng - IEEE Transactions on Affective …, 2024 - ieeexplore.ieee.org
Multimodal emotion recognition in conversation (ERC) has garnered growing attention from
research communities in various fields. In this paper, we propose a Cross-modal Fusion …