Hybrid transformers for music source separation

S Rouard, F Massa, A Défossez - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
A natural question arising in Music Source Separation (MSS) is whether long range
contextual information is useful, or whether local acoustic features are sufficient. In other …

Hybrid spectrogram and waveform source separation

A Défossez - arxiv preprint arxiv:2111.03600, 2021 - arxiv.org
Source separation models either work on the spectrogram or waveform domain. In this work,
we show how to perform end-to-end hybrid source separation, letting the model decide …

Music source separation with band-split RNN

Y Luo, J Yu - IEEE/ACM Transactions on Audio, Speech, and …, 2023 - ieeexplore.ieee.org
The performance of music source separation (MSS) models has been greatly improved in
recent years thanks to the development of novel neural network architectures and training …

Mert: Acoustic music understanding model with large-scale self-supervised training

Y Li, R Yuan, G Zhang, Y Ma, X Chen, H Yin… - arxiv preprint arxiv …, 2023 - arxiv.org
Self-supervised learning (SSL) has recently emerged as a promising paradigm for training
generalisable models on large-scale data in the fields of vision, text, and speech. Although …

Marble: Music audio representation benchmark for universal evaluation

R Yuan, Y Ma, Y Li, G Zhang, X Chen… - Advances in …, 2023 - proceedings.neurips.cc
In the era of extensive intersection between art and Artificial Intelligence (AI), such as image
generation and fiction co-creation, AI for music remains relatively nascent, particularly in …

Singfake: Singing voice deepfake detection

Y Zang, Y Zhang, M Heydari… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
The rise of singing voice synthesis presents critical challenges to artists and industry
stakeholders over unauthorized voice usage. Unlike synthesized speech, synthesized …

The Sound Demixing Challenge 2023$\unicode {x2013} $ Music Demixing Track

G Fabbro, S Uhlich, CH Lai, W Choi… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper summarizes the music demixing (MDX) track of the Sound Demixing Challenge
(SDX'23). We provide a summary of the challenge setup and introduce the task of robust …

Music source separation with band-split rope transformer

WT Lu, JC Wang, Q Kong… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Music source separation (MSS) aims to separate a music recording into multiple musically
distinct stems, such as vocals, bass, drums, and more. Recently, deep learning approaches …

Automatic music mixing with deep learning and out-of-domain data

MA Martínez-Ramírez, WH Liao, G Fabbro… - arxiv preprint arxiv …, 2022 - arxiv.org
Music mixing traditionally involves recording instruments in the form of clean, individual
tracks and blending them into a final mixture using audio effects and expert knowledge (eg …

Source separation of piano concertos using musically motivated augmentation techniques

Y Özer, M Müller - IEEE/ACM Transactions on Audio, Speech …, 2024 - ieeexplore.ieee.org
In this work, we address the novel and rarely considered source separation task of
decomposing piano concerto recordings into separate piano and orchestral tracks. Being a …