Music deep learning: deep learning methods for music signal processing—a review of the state-of-the-art

L Moysis, LA Iliadis, SP Sotiroudis, AD Boursianis… - Ieee …, 2023 - ieeexplore.ieee.org
The discipline of Deep Learning has been recognized for its strong computational tools,
which have been extensively used in data and signal processing, with innumerable …

Foundation models for music: A survey

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

Mert: Acoustic music understanding model with large-scale self-supervised training

Y Li, R Yuan, G Zhang, Y Ma, X Chen, H Yin… - arxiv preprint arxiv …, 2023 - arxiv.org
Self-supervised learning (SSL) has recently emerged as a promising paradigm for training
generalisable models on large-scale data in the fields of vision, text, and speech. Although …

Marble: Music audio representation benchmark for universal evaluation

R Yuan, Y Ma, Y Li, G Zhang, X Chen… - Advances in …, 2023 - proceedings.neurips.cc
In the era of extensive intersection between art and Artificial Intelligence (AI), such as image
generation and fiction co-creation, AI for music remains relatively nascent, particularly in …

LyEmoBERT: Classification of lyrics' emotion and recommendation using a pre-trained model

VR Revathy, AS Pillai, F Daneshfar - Procedia Computer Science, 2023 - Elsevier
Music plays a significant role in evoking human emotions. Thanks to the quick proliferation
of smartphones and mobile internet, music streaming applications and websites have made …

DISCO-10M: A large-scale music dataset

L Lanzendörfer, F Grötschla, E Funke… - Advances in Neural …, 2023 - proceedings.neurips.cc
Music datasets play a crucial role in advancing research in machine learning for music.
However, existing music datasets suffer from limited size, accessibility, and lack of audio …

MERT: Acoustic music understanding model with large-scale self-supervised training

LI Yizhi, R Yuan, G Zhang, Y Ma, X Chen… - The Twelfth …, 2023 - openreview.net
Self-supervised learning (SSL) has recently emerged as a promising paradigm for training
generalisable models on large-scale data in the fields of vision, text, and speech. Although …

A Multimodal Single-Branch Embedding Network for Recommendation in Cold-Start and Missing Modality Scenarios

C Ganhör, M Moscati, A Hausberger, S Nawaz… - Proceedings of the 18th …, 2024 - dl.acm.org
Most recommender systems adopt collaborative filtering (CF) and provide recommendations
based on past collective interactions. Therefore, the performance of CF algorithms degrades …

Enriching music descriptions with a finetuned-llm and metadata for text-to-music retrieval

SH Doh, M Lee, D Jeong, J Nam - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Text-to-Music Retrieval, finding music based on a given natural language query, plays a
pivotal role in content discovery within extensive music databases. To address this …

Are we there yet? a brief survey of music emotion prediction datasets, models and outstanding challenges

J Kang, D Herremans - arxiv preprint arxiv:2406.08809, 2024 - arxiv.org
Deep learning models for music have advanced drastically in recent years, but how good
are machine learning models at capturing emotion, and what challenges are researchers …