A tutorial on deep learning for music information retrieval

K Choi, G Fazekas, K Cho, M Sandler - arxiv preprint arxiv:1709.04396, 2017 - arxiv.org
Following their success in Computer Vision and other areas, deep learning techniques have
recently become widely adopted in Music Information Retrieval (MIR) research. However …

Deep learning for audio signal processing

H Purwins, B Li, T Virtanen, J Schlüter… - IEEE Journal of …, 2019 - ieeexplore.ieee.org
Given the recent surge in developments of deep learning, this paper provides a review of the
state-of-the-art deep learning techniques for audio signal processing. Speech, music, and …

Crepe: A convolutional representation for pitch estimation

JW Kim, J Salamon, P Li… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
The task of estimating the fundamental frequency of a monophonic sound recording, also
known as pitch tracking, is fundamental to audio processing with multiple applications in …

A Comprehensive Review on Music Transcription

B Bhattarai, J Lee - Applied Sciences, 2023 - mdpi.com
Music transcription is the process of transforming recorded sound of musical performances
into symbolic representations such as sheet music or MIDI files. Extensive research and …

High-resolution piano transcription with pedals by regressing onset and offset times

Q Kong, B Li, X Song, Y Wan… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
Automatic music transcription (AMT) is the task of transcribing audio recordings into
symbolic representations. Recently, neural network-based methods have been applied to …

SampleCNN: End-to-end deep convolutional neural networks using very small filters for music classification

J Lee, J Park, KL Kim, J Nam - Applied Sciences, 2018 - mdpi.com
Convolutional Neural Networks (CNN) have been applied to diverse machine learning tasks
for different modalities of raw data in an end-to-end fashion. In the audio domain, a raw …

Evaluation of cnn-based automatic music tagging models

M Won, A Ferraro, D Bogdanov, X Serra - arxiv preprint arxiv:2006.00751, 2020 - arxiv.org
Recent advances in deep learning accelerated the development of content-based automatic
music tagging systems. Music information retrieval (MIR) researchers proposed various …

[PDF][PDF] GuitarSet: A Dataset for Guitar Transcription.

Q **, RM Bittner, J Pauwels, X Ye, JP Bello - ISMIR, 2018 - ismir2018.ismir.net
The guitar is a popular instrument for a variety of reasons, including its ability to produce
polyphonic sound and its musical versatility. The resulting variability of sounds, however …

Data-driven harmonic filters for audio representation learning

M Won, S Chun, O Nieto, X Serrc - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
We introduce a trainable front-end module for audio representation learning that exploits the
inherent harmonic structure of audio signals. The proposed architecture, composed of a set …

Multi-instrument automatic music transcription with self-attention-based instance segmentation

YT Wu, B Chen, L Su - IEEE/ACM Transactions on Audio …, 2020 - ieeexplore.ieee.org
Multi-instrument automatic music transcription (AMT) is a critical but less investigated
problem in the field of music information retrieval (MIR). With all the difficulties faced by …