Automatic music transcription: An overview

E Benetos, S Dixon, Z Duan… - IEEE Signal Processing …, 2018 - ieeexplore.ieee.org
The capability of transcribing music audio into music notation is a fascinating example of
human intelligence. It involves perception (analyzing complex auditory scenes), cognition …

A review of automatic drum transcription

CW Wu, C Dittmar, C Southall, R Vogl… - … on Audio, Speech …, 2018 - ieeexplore.ieee.org
In Western popular music, drums and percussion are an important means to emphasize and
shape the rhythm, often defining the musical style. If computers were able to analyze the …

Origins of music in credible signaling

SA Mehr, MM Krasnow, GA Bryant… - Behavioral and Brain …, 2021 - cambridge.org
Music comprises a diverse category of cognitive phenomena that likely represent both the
effects of psychological adaptations that are specific to music (eg, rhythmic entrainment) and …

Onsets and frames: Dual-objective piano transcription

C Hawthorne, E Elsen, J Song, A Roberts… - arxiv preprint arxiv …, 2017 - arxiv.org
We advance the state of the art in polyphonic piano music transcription by using a deep
convolutional and recurrent neural network which is trained to jointly predict onsets and …

Internet of musical things: Vision and challenges

L Turchet, C Fischione, G Essl, D Keller… - Ieee access, 2018 - ieeexplore.ieee.org
The Internet of Musical Things (IoMusT) is an emerging research field positioned at the
intersection of Internet of Things, new interfaces for musical expression, ubiquitous music …

[HTML][HTML] Speech emotion recognition using fusion of three multi-task learning-based classifiers: HSF-DNN, MS-CNN and LLD-RNN

Z Yao, Z Wang, W Liu, Y Liu, J Pan - Speech Communication, 2020 - Elsevier
Speech emotion recognition plays an increasingly important role in emotional computing
and is still a challenging task due to its complexity. In this study, we developed a framework …

High-resolution piano transcription with pedals by regressing onset and offset times

Q Kong, B Li, X Song, Y Wan… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
Automatic music transcription (AMT) is the task of transcribing audio recordings into
symbolic representations. Recently, neural network-based methods have been applied to …

Learning features of music from scratch

J Thickstun, Z Harchaoui, S Kakade - arxiv preprint arxiv:1611.09827, 2016 - arxiv.org
This paper introduces a new large-scale music dataset, MusicNet, to serve as a source of
supervision and evaluation of machine learning methods for music research. MusicNet …

Novel audio features for music emotion recognition

R Panda, R Malheiro, RP Paiva - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
This work advances the music emotion recognition state-of-the-art by proposing novel
emotionally-relevant audio features. We reviewed the existing audio features implemented …

nnaudio: An on-the-fly gpu audio to spectrogram conversion toolbox using 1d convolutional neural networks

KW Cheuk, H Anderson, K Agres, D Herremans - IEEE Access, 2020 - ieeexplore.ieee.org
In this paper, we present nnAudio, a new neural network-based audio processing framework
with graphics processing unit (GPU) support that leverages 1D convolutional neural …