Content-based music information retrieval (cb-mir) and its applications toward the music industry: A review

YVS Murthy, SG Koolagudi - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
A huge increase in the number of digital music tracks has created the necessity to develop
an automated tool to extract the useful information from these tracks. As this information has …

[PDF][PDF] Exploring data augmentation for improved singing voice detection with neural networks.

J Schlüter, T Grill - ISMIR, 2015 - ofai.at
In computer vision, state-of-the-art object recognition systems rely on label-preserving image
transformations such as scaling and rotation to augment the training datasets. The additional …

[PDF][PDF] Local interpretable model-agnostic explanations for music content analysis.

S Mishra, BL Sturm, S Dixon - ISMIR, 2017 - archives.ismir.net
The interpretability of a machine learning model is essential for gaining insight into model
behaviour. While some machine learning models (eg, decision trees) are transparent, the …

Singing voice detection: a survey

R Monir, D Kostrzewa, D Mrozek - Entropy, 2022 - mdpi.com
Singing voice detection or vocal detection is a classification task that determines whether
there is a singing voice in a given audio segment. This process is a crucial preprocessing …

Dali: A large dataset of synchronized audio, lyrics and notes, automatically created using teacher-student machine learning paradigm

G Meseguer-Brocal, A Cohen-Hadria… - arxiv preprint arxiv …, 2019 - arxiv.org
The goal of this paper is twofold. First, we introduce DALI, a large and rich multimodal
dataset containing 5358 audio tracks with their time-aligned vocal melody notes and lyrics at …

[PDF][PDF] Learning to Pinpoint Singing Voice from Weakly Labeled Examples.

J Schlüter - ISMIR, 2016 - ofai.at
Building an instrument detector usually requires temporally accurate ground truth that is
expensive to create. However, song-wise information on the presence of instruments is often …

Research on singing voice detection based on a long-term recurrent convolutional network with vocal separation and temporal smoothing

X Zhang, Y Yu, Y Gao, X Chen, W Li - Electronics, 2020 - mdpi.com
Singing voice detection or vocal detection is a classification task that determines whether a
given audio segment contains singing voices. This task plays a very important role in vocal …

On the reduction of false positives in singing voice detection

B Lehner, G Widmer… - 2014 IEEE international …, 2014 - ieeexplore.ieee.org
Motivated by the observation that one of the biggest problems in automatic singing voice
detection is the confusion of vocals with other pitch-continuous and pitch-varying …

Singing voice separation using a deep convolutional neural network trained by ideal binary mask and cross entropy

KWE Lin, BT Balamurali, E Koh, S Lui… - Neural Computing and …, 2020 - Springer
Separating a singing voice from its music accompaniment remains an important challenge in
the field of music information retrieval. We present a unique neural network approach …

A low-latency, real-time-capable singing voice detection method with LSTM recurrent neural networks

B Lehner, G Widmer, S Bock - 2015 23rd European signal …, 2015 - ieeexplore.ieee.org
Singing voice detection aims at identifying the regions in a music recording where at least
one person sings. This is a challenging problem that cannot be solved without analysing the …