Mulan: A joint embedding of music audio and natural language

Q Huang, A Jansen, J Lee, R Ganti, JY Li… - arxiv preprint arxiv …, 2022 - arxiv.org
Music tagging and content-based retrieval systems have traditionally been constructed
using pre-defined ontologies covering a rigid set of music attributes or text queries. This …

[HTML][HTML] Investigating gender fairness of recommendation algorithms in the music domain

AB Melchiorre, N Rekabsaz… - Information Processing …, 2021 - Elsevier
Although recommender systems (RSs) play a crucial role in our society, previous studies
have revealed that the performance of RSs may considerably differ between groups of …

Multimodal pretraining, adaptation, and generation for recommendation: A survey

Q Liu, J Zhu, Y Yang, Q Dai, Z Du, XM Wu… - Proceedings of the 30th …, 2024 - dl.acm.org
Personalized recommendation serves as a ubiquitous channel for users to discover
information tailored to their interests. However, traditional recommendation models primarily …

Codified audio language modeling learns useful representations for music information retrieval

R Castellon, C Donahue, P Liang - arxiv preprint arxiv:2107.05677, 2021 - arxiv.org
We demonstrate that language models pre-trained on codified (discretely-encoded) music
audio learn representations that are useful for downstream MIR tasks. Specifically, we …

Music recommendation systems: Techniques, use cases, and challenges

M Schedl, P Knees, B McFee, D Bogdanov - Recommender systems …, 2021 - Springer
This chapter gives an introduction to music recommender systems, considering the unique
characteristics of the music domain. We take a user-centric perspective, by organizing our …

Supervised and unsupervised learning of audio representations for music understanding

MC McCallum, F Korzeniowski, S Oramas… - arxiv preprint arxiv …, 2022 - arxiv.org
In this work, we provide a broad comparative analysis of strategies for pre-training audio
understanding models for several tasks in the music domain, including labelling of genre …

Recommendation with generative models

Y Deldjoo, Z He, J McAuley, A Korikov… - arxiv preprint arxiv …, 2024 - arxiv.org
Generative models are a class of AI models capable of creating new instances of data by
learning and sampling from their statistical distributions. In recent years, these models have …

Learning music audio representations via weak language supervision

I Manco, E Benetos, E Quinton… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Audio representations for music information retrieval are typically learned via supervised
learning in a task-specific fashion. Although effective at producing state-of-the-art results …

Learning audio embeddings with user listening data for content-based music recommendation

K Chen, B Liang, X Ma, M Gu - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Personalized recommendation on new track releases has always been a challenging
problem in the music industry. To combat this problem, we first explore user listening history …

LARP: Language Audio Relational Pre-training for Cold-Start Playlist Continuation

R Salganik, X Liu, Y Ma, J Kang, TS Chua - Proceedings of the 30th ACM …, 2024 - dl.acm.org
As online music consumption increasingly shifts towards playlist-based listening, the task of
playlist continuation, in which an algorithm suggests songs to extend a playlist in a …