Foundations & trends in multimodal machine learning: Principles, challenges, and open questions

PP Liang, A Zadeh, LP Morency - ACM Computing Surveys, 2024 - dl.acm.org
Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design
computer agents with intelligent capabilities such as understanding, reasoning, and learning …

Multimodal research in vision and language: A review of current and emerging trends

S Uppal, S Bhagat, D Hazarika, N Majumder, S Poria… - Information …, 2022 - Elsevier
Deep Learning and its applications have cascaded impactful research and development
with a diverse range of modalities present in the real-world data. More recently, this has …

Misa: Modality-invariant and-specific representations for multimodal sentiment analysis

D Hazarika, R Zimmermann, S Poria - Proceedings of the 28th ACM …, 2020 - dl.acm.org
Multimodal Sentiment Analysis is an active area of research that leverages multimodal
signals for affective understanding of user-generated videos. The predominant approach …

Disentangled representation learning for multimodal emotion recognition

D Yang, S Huang, H Kuang, Y Du… - Proceedings of the 30th …, 2022 - dl.acm.org
Multimodal emotion recognition aims to identify human emotions from text, audio, and visual
modalities. Previous methods either explore correlations between different modalities or …

Bi-bimodal modality fusion for correlation-controlled multimodal sentiment analysis

W Han, H Chen, A Gelbukh, A Zadeh… - Proceedings of the …, 2021 - dl.acm.org
Multimodal sentiment analysis aims to extract and integrate semantic information collected
from multiple modalities to recognize the expressed emotions and sentiment in multimodal …

[HTML][HTML] Multibench: Multiscale benchmarks for multimodal representation learning

PP Liang, Y Lyu, X Fan, Z Wu, Y Cheng… - Advances in neural …, 2021 - ncbi.nlm.nih.gov
Learning multimodal representations involves integrating information from multiple
heterogeneous sources of data. It is a challenging yet crucial area with numerous real-world …

Beneath the tip of the iceberg: Current challenges and new directions in sentiment analysis research

S Poria, D Hazarika, N Majumder… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Sentiment analysis as a field has come a long way since it was first introduced as a task
nearly 20 years ago. It has widespread commercial applications in various domains like …

A low-rank matching attention based cross-modal feature fusion method for conversational emotion recognition

Y Shou, H Liu, X Cao, D Meng… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Conversational emotion recognition (CER) is an important research topic in human-
computer interactions. Although recent advancements in transformer-based cross-modal …

Xmecap: Meme caption generation with sub-image adaptability

Y Chen, S Yan, Z Zhu, Z Li, Y **ao - Proceedings of the 32nd ACM …, 2024 - dl.acm.org
Humor, deeply rooted in societal meanings and cultural details, poses a unique challenge
for machines. While advances have been made in natural language processing, real-world …

Quantifying & modeling multimodal interactions: An information decomposition framework

PP Liang, Y Cheng, X Fan, CK Ling… - Advances in …, 2024 - proceedings.neurips.cc
The recent explosion of interest in multimodal applications has resulted in a wide selection
of datasets and methods for representing and integrating information from different …