Music deep learning: deep learning methods for music signal processing—a review of the state-of-the-art

L Moysis, LA Iliadis, SP Sotiroudis, AD Boursianis… - Ieee …, 2023 - ieeexplore.ieee.org
The discipline of Deep Learning has been recognized for its strong computational tools,
which have been extensively used in data and signal processing, with innumerable …

Conditioned-U-Net: Introducing a control mechanism in the U-Net for multiple source separations

G Meseguer-Brocal, G Peeters - arxiv preprint arxiv:1907.01277, 2019 - arxiv.org
Data-driven models for audio source separation such as U-Net or Wave-U-Net are usually
models dedicated to and specifically trained for a single task, eg a particular instrument …

Multitrack music transcription with a time-frequency perceiver

WT Lu, JC Wang, YN Hung - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Multitrack music transcription aims to transcribe a music audio input into the musical notes of
multiple instruments simultaneously. It is a very challenging task that typically requires a …

Score-informed source separation of choral music

M Gover - 2020 - escholarship.mcgill.ca
La séparation de sources sonores consiste à extraire une ou plusieurs sources présentant
un attrait significatif d'un enregistrement contenant plusieurs sources sonores. Ces …

SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning

M Delcroix, JB Vázquez, T Ochiai… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
In many situations, we would like to hear desired sound events (SEs) while being able to
ignore interference. Target sound extraction (TSE) tackles this problem by estimating the …

Hierarchic temporal convolutional network with cross-domain encoder for music source separation

Y Hu, Y Chen, W Yang, L He… - IEEE Signal Processing …, 2022 - ieeexplore.ieee.org
Recently, the time-domain-based methods (ie, the method of modeling the raw waveform
directly) for audio source separation have shown tremendous potential. In this paper, we …

An introduction to signal processing for singing-voice analysis: High notes in the effort to automate the understanding of vocals in music

EJ Humphrey, S Reddy, P Seetharaman… - IEEE Signal …, 2018 - ieeexplore.ieee.org
Humans have devised a vast array of musical instruments, but the most prevalent instrument
remains the human voice. Thus, techniques for applying audio signal processing methods to …

An educational guide through the FMP notebooks for teaching and learning fundamentals of music processing

M Müller - Signals, 2021 - mdpi.com
This paper provides a guide through the FMP notebooks, a comprehensive collection of
educational material for teaching and learning fundamentals of music processing (FMP) with …

Deep learning approaches in topics of singing information processing

C Gupta, H Li, M Goto - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
Singing, the vocal productionof musical tones, is one of the most important elements of
music. Addressing the needs of real-world applications, the study of technologies related to …

Dilated convolution with dilated GRU for music source separation

JY Liu, YH Yang - arxiv preprint arxiv:1906.01203, 2019 - arxiv.org
Stacked dilated convolutions used in Wavenet have been shown effective for generating
high-quality audios. By replacing pooling/striding with dilation in convolution layers, they can …