Multimodal machine learning: A survey and taxonomy

T Baltrušaitis, C Ahuja… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Our experience of the world is multimodal-we see objects, hear sounds, feel texture, smell
odors, and taste flavors. Modality refers to the way in which something happens or is …

Foundations & trends in multimodal machine learning: Principles, challenges, and open questions

PP Liang, A Zadeh, LP Morency - ACM Computing Surveys, 2024 - dl.acm.org
Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design
computer agents with intelligent capabilities such as understanding, reasoning, and learning …

Learning structured output representation using deep conditional generative models

K Sohn, H Lee, X Yan - Advances in neural information …, 2015 - proceedings.neurips.cc
Supervised deep learning has been successfully applied for many recognition problems in
machine learning and computer vision. Although it can approximate a complex many-to-one …

Self-supervised speech representation learning: A review

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

Misa: Modality-invariant and-specific representations for multimodal sentiment analysis

D Hazarika, R Zimmermann, S Poria - Proceedings of the 28th ACM …, 2020 - dl.acm.org
Multimodal Sentiment Analysis is an active area of research that leverages multimodal
signals for affective understanding of user-generated videos. The predominant approach …

Completer: Incomplete multi-view clustering via contrastive prediction

Y Lin, Y Gou, Z Liu, B Li, J Lv… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
In this paper, we study two challenging problems in incomplete multi-view clustering
analysis, namely, i) how to learn an informative and consistent representation among …

Defining inflammatory cell states in rheumatoid arthritis joint synovial tissues by integrating single-cell transcriptomics and mass cytometry

F Zhang, K Wei, K Slowikowski, CY Fonseka… - Nature …, 2019 - nature.com
To define the cell populations that drive joint inflammation in rheumatoid arthritis (RA), we
applied single-cell RNA sequencing (scRNA-seq), mass cytometry, bulk RNA sequencing …

Deep clustering: A comprehensive survey

Y Ren, J Pu, Z Yang, J Xu, G Li, X Pu… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Cluster analysis plays an indispensable role in machine learning and data mining. Learning
a good data representation is crucial for clustering algorithms. Recently, deep clustering …

Robust multi-view clustering with incomplete information

M Yang, Y Li, P Hu, J Bai, J Lv… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The success of existing multi-view clustering methods heavily relies on the assumption of
view consistency and instance completeness, referred to as the complete information …

Trusted multi-view classification with dynamic evidential fusion

Z Han, C Zhang, H Fu, JT Zhou - IEEE transactions on pattern …, 2022 - ieeexplore.ieee.org
Existing multi-view classification algorithms focus on promoting accuracy by exploiting
different views, typically integrating them into common representations for follow-up tasks …