Multimodal pretraining, adaptation, and generation for recommendation: A survey
Personalized recommendation serves as a ubiquitous channel for users to discover
information tailored to their interests. However, traditional recommendation models primarily …
information tailored to their interests. However, traditional recommendation models primarily …
Marble: Music audio representation benchmark for universal evaluation
In the era of extensive intersection between art and Artificial Intelligence (AI), such as image
generation and fiction co-creation, AI for music remains relatively nascent, particularly in …
generation and fiction co-creation, AI for music remains relatively nascent, particularly in …
End-to-end modeling via information tree for one-shot natural language spatial video grounding
Natural language spatial video grounding aims to detect the relevant objects in video frames
with descriptive sentences as the query. In spite of the great advances, most existing …
with descriptive sentences as the query. In spite of the great advances, most existing …
On the effectiveness of speech self-supervised learning for music
Self-supervised learning (SSL) has shown promising results in various speech and natural
language processing applications. However, its efficacy in music information retrieval (MIR) …
language processing applications. However, its efficacy in music information retrieval (MIR) …
Contrastive balancing representation learning for heterogeneous dose-response curves estimation
Estimating the individuals' potential response to varying treatment doses is crucial for
decision-making in areas such as precision medicine and management science. Most …
decision-making in areas such as precision medicine and management science. Most …
Discover: Disentangled music representation learning for cover song identification
In the field of music information retrieval (MIR), cover song identification (CSI) is a
challenging task that aims to identify cover versions of a query song from a massive …
challenging task that aims to identify cover versions of a query song from a massive …
On the effect of data-augmentation on local embedding properties in the contrastive learning of music audio representations
Audio embeddings are crucial tools in understanding large catalogs of music. Typically
embeddings are evaluated on the basis of the performance they provide in a wide range of …
embeddings are evaluated on the basis of the performance they provide in a wide range of …
Pre-training strategies using contrastive learning and playlist information for music classification and similarity
In this work, we investigate an approach that relies on contrastive learning and music
metadata as a weak source of supervision to train music representation models. Recent …
metadata as a weak source of supervision to train music representation models. Recent …
Multimodal Pretraining and Generation for Recommendation: A Tutorial
Personalized recommendation stands as a ubiquitous channel for users to explore
information or items aligned with their interests. Nevertheless, prevailing recommendation …
information or items aligned with their interests. Nevertheless, prevailing recommendation …
Equivariant self-supervision for musical tempo estimation
E Quinton - arxiv preprint arxiv:2209.01478, 2022 - arxiv.org
Self-supervised methods have emerged as a promising avenue for representation learning
in the recent years since they alleviate the need for labeled datasets, which are scarce and …
in the recent years since they alleviate the need for labeled datasets, which are scarce and …