[HTML][HTML] A comprehensive review on music transcription

B Bhattarai, J Lee - Applied Sciences, 2023 - mdpi.com
Music transcription is the process of transforming recorded sound of musical performances
into symbolic representations such as sheet music or MIDI files. Extensive research and …

SpecTNT: A time-frequency transformer for music audio

WT Lu, JC Wang, M Won, K Choi, X Song - arxiv preprint arxiv …, 2021 - arxiv.org
Transformers have drawn attention in the MIR field for their remarkable performance shown
in natural language processing and computer vision. However, prior works in the audio …

[HTML][HTML] Joint detection and classification of singing voice melody using convolutional recurrent neural networks

S Kum, J Nam - Applied Sciences, 2019 - mdpi.com
Singing melody extraction essentially involves two tasks: one is detecting the activity of a
singing voice in polyphonic music, and the other is estimating the pitch of a singing voice in …

Multi-instrument automatic music transcription with self-attention-based instance segmentation

YT Wu, B Chen, L Su - IEEE/ACM Transactions on Audio …, 2020 - ieeexplore.ieee.org
Multi-instrument automatic music transcription (AMT) is a critical but less investigated
problem in the field of music information retrieval (MIR). With all the difficulties faced by …

A streamlined encoder/decoder architecture for melody extraction

TH Hsieh, L Su, YH Yang - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
Melody extraction in polyphonic musical audio is important for music signal processing. In
this paper, we propose a novel streamlined encoder/decoder network that is designed for …

MCSSME: multi-task contrastive learning for semi-supervised singing melody extraction from polyphonic music

S Yu - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Singing melody extraction is an important task in the field of music information retrieval
(MIR). The development of data-driven models for this task have achieved great successes …

Polyphonic music transcription with semantic segmentation

YT Wu, B Chen, L Su - ICASSP 2019-2019 IEEE International …, 2019 - ieeexplore.ieee.org
The multi-instrument transcription task refers to joint recognition of instrument and pitch of
every event in polyphonic music signals generated by one or more classes of music …

Pseudo-label transfer from frame-level to note-level in a teacher-student framework for singing transcription from polyphonic music

S Kum, J Lee, KL Kim, T Kim… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Lack of large-scale note-level labeled data is the major obstacle to singing transcription from
polyphonic music. We address the issue by using pseudo labels from vocal pitch estimation …

Vocal melody extraction via hrnet-based singing voice separation and encoder-decoder-based f0 estimation

Y Gao, X Zhang, W Li - Electronics, 2021 - mdpi.com
Vocal melody extraction is an important and challenging task in music information retrieval.
One main difficulty is that, most of the time, various instruments and singing voices are mixed …

Learning stage-wise gans for whistle extraction in time-frequency spectrograms

P Li, MA Roch, H Klinck, E Fleishman… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Whistle contour extraction aims to derive animal whistles from time-frequency spectrograms
as polylines. For toothed whales, whistle extraction results can serve as the basis for …