Automatic chord estimation from audio: A review of the state of the art

M McVicar, R Santos-Rodríguez, Y Ni… - … /ACM Transactions on …, 2014 - ieeexplore.ieee.org
In this overview article, we review research on the task of Automatic Chord Estimation (ACE).
The major contributions from the last 14 years of research are summarized, with detailed …

Schubert Winterreise dataset: A multimodal scenario for music analysis

C Weiß, F Zalkow, V Arifi-Müller, M Müller… - Journal on Computing …, 2021 - dl.acm.org
This article presents a multimodal dataset comprising various representations and
annotations of Franz Schubert's song cycle Winterreise. Schubert's seminal work constitutes …

Automatic lyrics alignment and transcription in polyphonic music: Does background music help?

C Gupta, E Yılmaz, H Li - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
Automatic lyrics alignment and transcription in polyphonic music are challenging tasks
because the singing vocals are corrupted by the background music. In this work, we propose …

Transfer learning of wav2vec 2.0 for automatic lyric transcription

L Ou, X Gu, Y Wang - arxiv preprint arxiv:2207.09747, 2022 - arxiv.org
Automatic speech recognition (ASR) has progressed significantly in recent years due to the
emergence of large-scale datasets and the self-supervised learning (SSL) paradigm …

Phoneme level lyrics alignment and text-informed singing voice separation

K Schulze-Forster, CSJ Doire… - … /ACM Transactions on …, 2021 - ieeexplore.ieee.org
The goal of singing voice separation is to recover the vocals signal from music mixtures.
State-of-the-art performance is achieved by deep neural networks trained in a supervised …

An introduction to signal processing for singing-voice analysis: High notes in the effort to automate the understanding of vocals in music

EJ Humphrey, S Reddy, P Seetharaman… - IEEE Signal …, 2018 - ieeexplore.ieee.org
Humans have devised a vast array of musical instruments, but the most prevalent instrument
remains the human voice. Thus, techniques for applying audio signal processing methods to …

Deep learning approaches in topics of singing information processing

C Gupta, H Li, M Goto - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
Singing, the vocal productionof musical tones, is one of the most important elements of
music. Addressing the needs of real-world applications, the study of technologies related to …

MSTRE-Net: Multistreaming acoustic modeling for automatic lyrics transcription

E Demirel, S Ahlbäck, S Dixon - arxiv preprint arxiv:2108.02625, 2021 - arxiv.org
This paper makes several contributions to automatic lyrics transcription (ALT) research. Our
main contribution is a novel variant of the Multistreaming Time-Delay Neural Network …

Multilingual lyrics-to-audio alignment

A Vaglio, R Hennequin, M Moussallam… - International Society for …, 2020 - hal.science
Lyrics-to-audio alignment methods have recently reported impressive results, opening the
door to practical applications such as karaoke and within song navigation. However, most …

Automatic lyrics-to-audio alignment on polyphonic music using singing-adapted acoustic models

B Sharma, C Gupta, H Li, Y Wang - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
Lyrics-to-audio alignment is to automatically align the lyrical words with the mixed singing
audio (singing voice+ musical accompaniment). Such alignment can be achieved with an …