The GTZAN dataset: Its contents, its faults, their effects on evaluation, and its future use

BL Sturm - arxiv preprint arxiv:1306.1461, 2013 - arxiv.org
The GTZAN dataset appears in at least 100 published works, and is the most-used public
dataset for evaluation in machine listening research for music genre recognition (MGR). Our …

A survey of evaluation in music genre recognition

BL Sturm - International Workshop on Adaptive Multimedia …, 2012 - Springer
Much work is focused upon music genre recognition (MGR) from audio recordings, symbolic
data, and other modalities. While reviews have been written of some of this work before, no …

[PDF][PDF] Low-Rank Representation of Both Singing Voice and Music Accompaniment Via Learned Dictionaries.

YH Yang - ISMIR, 2013 - ismir2013.ismir.net
Recent research work has shown that the magnitude spectrogram of a song can be
considered as a superposition of a low-rank component and a sparse component, which …

A systematic evaluation of the bag-of-frames representation for music information retrieval

L Su, CCM Yeh, JY Liu, JC Wang… - IEEE Transactions on …, 2014 - ieeexplore.ieee.org
There has been an increasing attention on learning feature representations from the
complex, high-dimensional audio data applied in various music information retrieval (MIR) …

Codebook-based audio feature representation for music information retrieval

Y Vaizman, B McFee, G Lanckriet - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org
Digital music has become prolific in the web in recent decades. Automated recommendation
systems are essential for users to discover music they love and for artists to reach …

A deep neural network for modeling music

P Zhang, X Zheng, W Zhang, S Li, S Qian… - Proceedings of the 5th …, 2015 - dl.acm.org
We propose a convolutional neural network architecture with k-max pooling layer for
semantic modeling of music. The aim of a music model is to analyze and represent the …

[HTML][HTML] Classificaçao automática de textos por meio de aprendizado de máquina baseado em redes

RG Rossi - 2015 - bdtd.ibict.br
Nos dias atuais há uma quantidade massiva de dados textuais sendo produzida e
armazenada diariamente na forma de e-mails, relatórios, artigos e postagens em redes …

[PDF][PDF] Predicting the Genre and Rating of a Movie Based on its Synopsis

V Battu, V Batchu, RRR Gangula… - Proceedings of the …, 2018 - aclanthology.org
Movies are one of the most prominent means of entertainment. The widespread use of the
Internet in recent times has led to large volumes of data related to movies being generated …

Dual-layer bag-of-frames model for music genre classification

CCM Yeh, L Su, YH Yang - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
This paper concerns the development of a music dictionary-based model for summarizing
local feature descriptors computed over time. Comparing to a holistic representation, this text …

Sparse cepstral codes and power scale for instrument identification

LF Yu, L Su, YH Yang - 2014 IEEE International Conference on …, 2014 - ieeexplore.ieee.org
This paper presents a novel feature representation called sparse cepstral codes for
instrument identification. We first motivate the approach by discussing why cepstrum is …