Content-based music information retrieval (cb-mir) and its applications toward the music industry: A review
A huge increase in the number of digital music tracks has created the necessity to develop
an automated tool to extract the useful information from these tracks. As this information has …
an automated tool to extract the useful information from these tracks. As this information has …
[PDF][PDF] Exploring data augmentation for improved singing voice detection with neural networks.
In computer vision, state-of-the-art object recognition systems rely on label-preserving image
transformations such as scaling and rotation to augment the training datasets. The additional …
transformations such as scaling and rotation to augment the training datasets. The additional …
[PDF][PDF] Local interpretable model-agnostic explanations for music content analysis.
The interpretability of a machine learning model is essential for gaining insight into model
behaviour. While some machine learning models (eg, decision trees) are transparent, the …
behaviour. While some machine learning models (eg, decision trees) are transparent, the …
Singing voice detection: a survey
Singing voice detection or vocal detection is a classification task that determines whether
there is a singing voice in a given audio segment. This process is a crucial preprocessing …
there is a singing voice in a given audio segment. This process is a crucial preprocessing …
Dali: A large dataset of synchronized audio, lyrics and notes, automatically created using teacher-student machine learning paradigm
The goal of this paper is twofold. First, we introduce DALI, a large and rich multimodal
dataset containing 5358 audio tracks with their time-aligned vocal melody notes and lyrics at …
dataset containing 5358 audio tracks with their time-aligned vocal melody notes and lyrics at …
[PDF][PDF] Learning to Pinpoint Singing Voice from Weakly Labeled Examples.
J Schlüter - ISMIR, 2016 - ofai.at
Building an instrument detector usually requires temporally accurate ground truth that is
expensive to create. However, song-wise information on the presence of instruments is often …
expensive to create. However, song-wise information on the presence of instruments is often …
Research on singing voice detection based on a long-term recurrent convolutional network with vocal separation and temporal smoothing
Singing voice detection or vocal detection is a classification task that determines whether a
given audio segment contains singing voices. This task plays a very important role in vocal …
given audio segment contains singing voices. This task plays a very important role in vocal …
On the reduction of false positives in singing voice detection
Motivated by the observation that one of the biggest problems in automatic singing voice
detection is the confusion of vocals with other pitch-continuous and pitch-varying …
detection is the confusion of vocals with other pitch-continuous and pitch-varying …
Singing voice separation using a deep convolutional neural network trained by ideal binary mask and cross entropy
Separating a singing voice from its music accompaniment remains an important challenge in
the field of music information retrieval. We present a unique neural network approach …
the field of music information retrieval. We present a unique neural network approach …
A low-latency, real-time-capable singing voice detection method with LSTM recurrent neural networks
Singing voice detection aims at identifying the regions in a music recording where at least
one person sings. This is a challenging problem that cannot be solved without analysing the …
one person sings. This is a challenging problem that cannot be solved without analysing the …