Fakemusiccaps: a dataset for detection and attribution of synthetic music generated via text-to-music models

L Comanducci, P Bestagini, S Tubaro - arxiv preprint arxiv:2409.10684, 2024 - arxiv.org
Text-To-Music (TTM) models have recently revolutionized the automatic music generation
research field. Specifically, by reaching superior performances to all previous state-of-the-art …

MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling

S Rouard, RS Roman, Y Adi, A Roebel - arxiv preprint arxiv:2501.01757, 2025 - arxiv.org
While most music generation models generate a mixture of stems (in mono or stereo), we
propose to train a multi-stem generative model with 3 stems (bass, drums and other) that …

Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer

S Hou, S Liu, R Yuan, W Xue, Y Shan, M Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite the significant progress in controllable music generation and editing, challenges
remain in the quality and length of generated music due to the use of Mel-spectrogram …

The Interpretation Gap in Text-to-Music Generation Models

Y Zang, Y Zhang - arxiv preprint arxiv:2407.10328, 2024 - arxiv.org
Large-scale text-to-music generation models have significantly enhanced music creation
capabilities, offering unprecedented creative freedom. However, their ability to collaborate …

Improving Controllability and Editability for Pretrained Text-to-Music Generation Models

Y Zhang - arxiv preprint arxiv:2411.12641, 2024 - arxiv.org
The field of AI-assisted music creation has made significant strides, yet existing systems
often struggle to meet the demands of iterative and nuanced music production. These …