Randomly weighted cnns for (music) audio classification

J Pons, X Serra - … 2019-2019 IEEE international conference on …, 2019 - ieeexplore.ieee.org
The computer vision literature shows that randomly weighted neural networks perform
reasonably as feature extractors. Following this idea, we study how non-trained (randomly …

Deep learning approaches in topics of singing information processing

C Gupta, H Li, M Goto - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
Singing, the vocal productionof musical tones, is one of the most important elements of
music. Addressing the needs of real-world applications, the study of technologies related to …

[PDF][PDF] Automatic Pronunciation Evaluation of Singing.

C Gupta, H Li, Y Wang - Interspeech, 2018 - isca-archive.org
In this work, we develop a strategy to automatically evaluate pronunciation of singing. We
apply singing-adapted automatic speech recognizer (ASR) in a two-stage approach for …

Score-informed networks for music performance assessment

J Huang, YN Hung, A Pati, SK Gururani… - arxiv preprint arxiv …, 2020 - arxiv.org
The assessment of music performances in most cases takes into account the underlying
musical score being performed. While there have been several automatic approaches for …

DeepDDK: A deep learning based oral-diadochokinesis analysis software

YY Wang, K Gao, AM Kloepper, Y Zhao… - 2019 IEEE EMBS …, 2019 - ieeexplore.ieee.org
Oromotor dysfunction caused by neurological disorders can result in significant speech and
swallowing impairments. Current diagnostic methods to assess oromotor function are …

Creating an a cappella singing audio dataset for automatic **gju singing evaluation research

R Gong, RC Repetto, X Serra - … of the 4th International Workshop on …, 2017 - dl.acm.org
The data-driven computational research on automatic **gju (also known as Bei**g or
Peking opera) singing evaluation lacks a suitable and comprehensive a cappella singing …

Towards reference-independent rhythm assessment of solo singing

C Gupta, J Li, H Li - 2021 Asia-Pacific Signal and Information …, 2021 - ieeexplore.ieee.org
Rhythm is an important aspect of singing in music information retrieval. From the principles
of music theory, note duration is related to the time signature of a song, therefore it provides …

End-to-end lyrics transcription informed by pitch and onset estimation

T Deng, E Nakamura, K Yoshii - Proceedings of the …, 2022 - repository.kulib.kyoto-u.ac.jp
This paper presents an automatic lyrics transcription (ALT) method for music recordings that
leverages the framewise semitone-level sung pitches estimated in a multi-task learning …

[PDF][PDF] Deep neural networks for music and audio tagging

J Pons Puig - 2019 - jordipons.me
Automatic music and audio tagging can help increase the retrieval and re-use possibilities of
many audio databases that remain poorly labeled. In this dissertation, we tackle the task of …

Towards an efficient deep learning model for musical onset detection

R Gong, X Serra - arxiv preprint arxiv:1806.06773, 2018 - arxiv.org
In this paper, we propose an efficient and reproducible deep learning model for musical
onset detection (MOD). We first review the state-of-the-art deep learning models for MOD …