Diff-a-riff: Musical accompaniment co-creation via latent diffusion models

J Nistal, M Pasini, C Aouameur, M Grachten… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advancements in deep generative models present new opportunities for music
production but also pose challenges, such as high computational demands and limited …

Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning

Y Zhang, Y Ikemiya, W Choi, N Murata… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advances in text-to-music editing, which employ text queries to modify music (eg\by
changing its style or adjusting instrumental components), present unique challenges and …

Cocola: Coherence-oriented contrastive learning of musical audio representations

R Ciranni, G Mariani, M Mancusi, E Postolache… - arxiv preprint arxiv …, 2024 - arxiv.org
We present COCOLA (Coherence-Oriented Contrastive Learning for Audio), a contrastive
learning method for musical audio representations that captures the harmonic and rhythmic …

Naturalistic Music Decoding from EEG Data via Latent Diffusion Models

E Postolache, N Polouliakh, H Kitano… - arxiv preprint arxiv …, 2024 - arxiv.org
In this article, we explore the potential of using latent diffusion models, a family of powerful
generative models, for the task of reconstructing naturalistic music from …

Improving Musical Accompaniment Co-creation via Diffusion Transformers

J Nistal, M Pasini, S Lattner - arxiv preprint arxiv:2410.23005, 2024 - arxiv.org
Building upon Diff-A-Riff, a latent diffusion model for musical instrument accompaniment
generation, we present a series of improvements targeting quality, diversity, inference …

The Interpretation Gap in Text-to-Music Generation Models

Y Zang, Y Zhang - arxiv preprint arxiv:2407.10328, 2024 - arxiv.org
Large-scale text-to-music generation models have significantly enhanced music creation
capabilities, offering unprecedented creative freedom. However, their ability to collaborate …

Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems

J Zhang, J Zhuang, C **, G Li, Y Gu - arxiv preprint arxiv:2406.06959, 2024 - arxiv.org
The recent emergence of diffusion models has significantly advanced the precision of
learnable priors, presenting innovative avenues for addressing inverse problems. Since …

Harnessing the capabilities of Generative Models

G Mariani - 2024 - tesidottorato.depositolegale.it
Generative models have experienced significant advancements in recent years, driven by
the introduction of architectures such as Stable Diffusion, GPT-3, ChatGPT, and many …

From source separation to compositional music generation

E Postolache - 2024 - tesidottorato.depositolegale.it
This thesis proposes a journey into sound processing through deep learning, particularly
generative models, exploring the compositional structure of sound, which is layered in …