Foundations & trends in multimodal machine learning: Principles, challenges, and open questions

PP Liang, A Zadeh, LP Morency - ACM Computing Surveys, 2024 - dl.acm.org
Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design
computer agents with intelligent capabilities such as understanding, reasoning, and learning …

Self-supervised speech representation learning: A review

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

Deep clustering: A comprehensive survey

Y Ren, J Pu, Z Yang, J Xu, G Li, X Pu… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Cluster analysis plays an indispensable role in machine learning and data mining. Learning
a good data representation is crucial for clustering algorithms. Recently, deep clustering …

Robust multi-view clustering with incomplete information

M Yang, Y Li, P Hu, J Bai, J Lv… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The success of existing multi-view clustering methods heavily relies on the assumption of
view consistency and instance completeness, referred to as the complete information …

Completer: Incomplete multi-view clustering via contrastive prediction

Y Lin, Y Gou, Z Liu, B Li, J Lv… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
In this paper, we study two challenging problems in incomplete multi-view clustering
analysis, namely, i) how to learn an informative and consistent representation among …

Dual contrastive prediction for incomplete multi-view representation learning

Y Lin, Y Gou, X Liu, J Bai, J Lv… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
In this article, we propose a unified framework to solve the following two challenging
problems in incomplete multi-view representation learning: i) how to learn a consistent …

Foundations and Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions

PP Liang, A Zadeh, LP Morency - arxiv preprint arxiv:2209.03430, 2022 - arxiv.org
Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design
computer agents with intelligent capabilities such as understanding, reasoning, and learning …

Trusted multi-view classification with dynamic evidential fusion

Z Han, C Zhang, H Fu, JT Zhou - IEEE transactions on pattern …, 2022 - ieeexplore.ieee.org
Existing multi-view classification algorithms focus on promoting accuracy by exploiting
different views, typically integrating them into common representations for follow-up tasks …

A comprehensive survey on multi-view clustering

U Fang, M Li, J Li, L Gao, T Jia… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The development of information gathering and extraction technology has led to the
popularity of multi-view data, which enables samples to be seen from numerous …

Tensorized bipartite graph learning for multi-view clustering

W **a, Q Gao, Q Wang, X Gao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Despite the impressive clustering performance and efficiency in characterizing both the
relationship between the data and cluster structure, most existing graph-based multi-view …