[HTML][HTML] A comprehensive review on music transcription

B Bhattarai, J Lee - Applied Sciences, 2023 - mdpi.com
Music transcription is the process of transforming recorded sound of musical performances
into symbolic representations such as sheet music or MIDI files. Extensive research and …

Automatic piano transcription with hierarchical frequency-time transformer

K Toyama, T Akama, Y Ikemiya, Y Takida… - arxiv preprint arxiv …, 2023 - arxiv.org
Taking long-term spectral and temporal dependencies into account is essential for automatic
piano transcription. This is especially helpful when determining the precise onset and offset …

Beat transformer: Demixed beat and downbeat tracking with dilated self-attention

J Zhao, G **a, Y Wang - arxiv preprint arxiv:2209.07140, 2022 - arxiv.org
We propose Beat Transformer, a novel Transformer encoder architecture for joint beat and
downbeat tracking. Different from previous models that track beats solely based on the …

[PDF][PDF] Transformer-Based Beat Tracking With Low-Resolution Encoder and High-Resolution Decoder.

T Cheng, M Goto - ISMIR, 2023 - staff.aist.go.jp
In this paper, we address the beat tracking task which is to predict beat times corresponding
to the input audio. Due to the long sequential inputs, it is still challenging to model the global …

Tromr: Transformer-based polyphonic optical music recognition

Y Li, H Liu, Q **, M Cai, P Li - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Optical Music Recognition (OMR) is an important technology in music and has been
researched for a long time. Previous approaches for OMR are usually based on CNN for …

Piano transcription with harmonic attention

R Wu, X Wang, Y Li, W Xu… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Automatic Music Transcription (AMT) aims to convert music audio into digital sheet music.
Piano transcription is a popular but challenging subtask of AMT. For every piano pitch, the …

Weakly-supervised video anomaly detection via temporal resolution feature learning

S Peng, Y Cai, Z Yao, M Tan - Applied Intelligence, 2023 - Springer
Weakly supervised video anomaly detection (WS-VAD) is often formulated as a multiple
instance learning (MIL) problem. Snippet-level anomaly scores can be predicted using only …

Knowledge and data co-driven intelligent assessment of Chinese zither fingerings

W Zhao, S Wang, Y Zhao, J Wei, T Li - Displays, 2023 - Elsevier
The intelligent assessment of musical instrument fingerings can provide learners with timely
feedback to greatly improve learning efficiency and lay the foundation for distance teaching …

Fine-tuning music generation with reinforcement learning based on transformer

X Guo, H Xu, K Xu - … IEEE 16th International Conference on Anti …, 2022 - ieeexplore.ieee.org
Deep supervised learning is the most common way of automatically music generation.
However, this sort of model only learns probabilities from dataset, and such pattern does not …

A Two-Stage Audio-Visual Fusion Piano Transcription Model Based on the Attention Mechanism

Y Li, X Wang, R Wu, W Xu… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Piano transcription is a significant problem in the field of music information retrieval, aiming
to obtain symbolic representations of music from captured audio or visual signals. Previous …