- Academic Search

Y Mao, J Zhang, M **ang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract We propose an Explicit Conditional Multimodal Variational Auto-Encoder
(ECMVAE) for audio-visual segmentation (AVS), aiming to segment sound sources in the …

Lưu Trích dẫn Trích dẫn 35 bài viết Bài viết có liên quan Tất cả 5 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Beyond mahalanobis distance for textual ood detection

P Colombo, E Dadalto, G Staerman… - Advances in …, 2022 - proceedings.neurips.cc

As the number of AI systems keeps growing, it is fundamental to implement and develop
efficient control mechanisms to ensure the safe and proper functioning of machine learning …

Lưu Trích dẫn Trích dẫn 51 bài viết Bài viết có liên quan Tất cả 7 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] google.com

Smin: Semi-supervised multi-modal interaction network for conversational emotion recognition

Z Lian, B Liu, J Tao - IEEE Transactions on Affective Computing, 2022 - ieeexplore.ieee.org

Conversational emotion recognition is a crucial research topic in human-computer
interactions. Due to the heavy annotation cost and inevitable label ambiguity, collecting …

Lưu Trích dẫn Trích dẫn 68 bài viết Bài viết có liên quan Tất cả 5 phiên bản

Multimodal sentiment analysis with two-phase multi-task learning

B Yang, L Wu, J Zhu, B Shao, X Lin… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org

Multimodal Sentiment Analysis (MSA) is a challenging research area that studies sentiment
expressed from multiple heterogeneous modalities. Given those pre-trained language …

Lưu Trích dẫn Trích dẫn 57 bài viết Bài viết có liên quan Tất cả 2 phiên bản

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Infolm: A new metric to evaluate summarization & data2text generation

PJA Colombo, C Clavel, P Piantanida - Proceedings of the AAAI …, 2022 - ojs.aaai.org

Assessing the quality of natural language generation (NLG) systems through human
annotation is very expensive. Additionally, human annotation campaigns are time …

Lưu Trích dẫn Trích dẫn 55 bài viết Bài viết có liên quan Tất cả 8 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning disentangled textual representations via statistical measures of similarity

P Colombo, G Staerman, N Noiry… - arxiv preprint arxiv …, 2022 - arxiv.org

When working with textual data, a natural application of disentangled representations is fair
classification where the goal is to make predictions without being biased (or influenced) by …

Lưu Trích dẫn Trích dẫn 42 bài viết Bài viết có liên quan Tất cả 8 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Automatic text evaluation through the lens of Wasserstein barycenters

P Colombo, G Staerman, C Clavel… - arxiv preprint arxiv …, 2021 - arxiv.org

A new metric\texttt {BaryScore} to evaluate text generation based on deep contextualized
embeddings eg, BERT, Roberta, ELMo) is introduced. This metric is motivated by a new …

Lưu Trích dẫn Trích dẫn 55 bài viết Bài viết có liên quan Tất cả 10 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

What are the best systems? new perspectives on nlp benchmarking

P Colombo, N Noiry, E Irurozki… - Advances in neural …, 2022 - proceedings.neurips.cc

Abstract In Machine Learning, a benchmark refers to an ensemble of datasets associated
with one or multiple metrics together with a way to aggregate different systems …

Lưu Trích dẫn Trích dẫn 41 bài viết Bài viết có liên quan Tất cả 5 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

AMOA: Global acoustic feature enhanced modal-order-aware network for multimodal sentiment analysis

Z Li, Y Zhou, W Zhang, Y Liu, C Yang… - Proceedings of the …, 2022 - aclanthology.org

In recent years, multimodal sentiment analysis (MSA) has attracted more and more interest,
which aims to predict the sentiment polarity expressed in a video. Existing methods typically …

Lưu Trích dẫn Trích dẫn 21 bài viết Bài viết có liên quan Xem dạng HTML

Learning emotional prompt features with multiple views for visual emotion analysis

Q Xu, Y Wei, S Yuan, J Wu, L Wang, C Wu - Information Fusion, 2024 - Elsevier

Visual emotion analysis (VEA) aiming to detect the emotions behind images, has gained
increasing attention with the development of online social media. Recent studies in prompt …

Lưu Trích dẫn Trích dẫn 3 bài viết Bài viết có liên quan Tất cả 2 phiên bản

Tạo thông báo

Trích dẫn

Tìm kiếm nâng cao

Đã lưu vào Thư viện của tôi

Improving multimodal fusion via mutual dependency maximisation

Multimodal variational auto-encoder based audio-visual segmentation

Beyond mahalanobis distance for textual ood detection

Smin: Semi-supervised multi-modal interaction network for conversational emotion recognition

Multimodal sentiment analysis with two-phase multi-task learning

Infolm: A new metric to evaluate summarization & data2text generation

Learning disentangled textual representations via statistical measures of similarity

Automatic text evaluation through the lens of Wasserstein barycenters

What are the best systems? new perspectives on nlp benchmarking

AMOA: Global acoustic feature enhanced modal-order-aware network for multimodal sentiment analysis

Learning emotional prompt features with multiple views for visual emotion analysis