Automatic music transcription: An overview

E Benetos, S Dixon, Z Duan… - IEEE Signal Processing …, 2018 - ieeexplore.ieee.org
The capability of transcribing music audio into music notation is a fascinating example of
human intelligence. It involves perception (analyzing complex auditory scenes), cognition …

[LIVRE][B] Audio source separation and speech enhancement

E Vincent, T Virtanen, S Gannot - 2018 - books.google.com
Learn the technology behind hearing aids, Siri, and Echo Audio source separation and
speech enhancement aim to extract one or more source signals of interest from an audio …

A Comprehensive Review on Music Transcription

B Bhattarai, J Lee - Applied Sciences, 2023 - mdpi.com
Music transcription is the process of transforming recorded sound of musical performances
into symbolic representations such as sheet music or MIDI files. Extensive research and …

[PDF][PDF] Deep Salience Representations for F0 Estimation in Polyphonic Music.

RM Bittner, B McFee, J Salamon, P Li, JP Bello - ISMIR, 2017 - nemo.yonsei.ac.kr
PowerPoint 프레젠테이션 Page 1 경영과학연구실 이태헌 2023.07.09 Deep salience
representations for f0 estimation in polyphonic music 1 Bittner, Rachel M., et al. "Deep Salience …

Multi-instrument automatic music transcription with self-attention-based instance segmentation

YT Wu, B Chen, L Su - IEEE/ACM Transactions on Audio …, 2020 - ieeexplore.ieee.org
Multi-instrument automatic music transcription (AMT) is a critical but less investigated
problem in the field of music information retrieval (MIR). With all the difficulties faced by …

SpecTNT: A time-frequency transformer for music audio

WT Lu, JC Wang, M Won, K Choi, X Song - arxiv preprint arxiv …, 2021 - arxiv.org
Transformers have drawn attention in the MIR field for their remarkable performance shown
in natural language processing and computer vision. However, prior works in the audio …

Wave-shape function analysis: When cepstrum meets time–frequency analysis

CY Lin, L Su, HT Wu - Journal of Fourier Analysis and Applications, 2018 - Springer
We propose to combine cepstrum and nonlinear time–frequency (TF) analysis to study
multiple component oscillatory signals with time-varying frequency and amplitude and with …

A streamlined encoder/decoder architecture for melody extraction

TH Hsieh, L Su, YH Yang - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
Melody extraction in polyphonic musical audio is important for music signal processing. In
this paper, we propose a novel streamlined encoder/decoder network that is designed for …

Omnizart: A general toolbox for automatic music transcription

YT Wu, YJ Luo, TP Chen, I Wei, JY Hsu… - arxiv preprint arxiv …, 2021 - arxiv.org
We present and release Omnizart, a new Python library that provides a streamlined solution
to automatic music transcription (AMT). Omnizart encompasses modules that construct the …

Vocal melody extraction using patch-based CNN

L Su - 2018 IEEE international conference on acoustics …, 2018 - ieeexplore.ieee.org
A patch-based convolutional neural network (CNN) model presented in this paper for vocal
melody extraction in polyphonic music is inspired from object detection in image processing …