A survey of music visualization techniques

HB Lima, CGRD Santos, BS Meiguins - ACM Computing Surveys (CSUR …, 2021 - dl.acm.org
Music Information Research (MIR) comprises all the research topics involved in modeling
and understanding music. Visualizations are frequently adopted to convey better …

Speaker gender recognition based on deep neural networks and ResNet50

AA Alnuaim, M Zakariah, C Shashidhar… - Wireless …, 2022 - Wiley Online Library
Several speaker recognition algorithms failed to get the best results because of the wildly
varying datasets and feature sets for classification. Gender information helps reduce this …

End-to-end learning for music audio tagging at scale

J Pons, O Nieto, M Prockup, E Schmidt… - arxiv preprint arxiv …, 2017 - arxiv.org
The lack of data tends to limit the outcomes of deep learning research, particularly when
dealing with end-to-end learning stacks processing raw data such as waveforms. In this …

Musical instrument identification using deep learning approach

M Blaszke, B Kostek - Sensors, 2022 - mdpi.com
The work aims to propose a novel approach for automatically identifying all instruments
present in an audio excerpt using sets of individual convolutional neural networks (CNNs) …

SampleCNN: End-to-end deep convolutional neural networks using very small filters for music classification

J Lee, J Park, KL Kim, J Nam - Applied Sciences, 2018 - mdpi.com
Convolutional Neural Networks (CNN) have been applied to diverse machine learning tasks
for different modalities of raw data in an end-to-end fashion. In the audio domain, a raw …

Spectrogram based multi-task audio classification

Y Zeng, H Mao, D Peng, Z Yi - Multimedia Tools and Applications, 2019 - Springer
Audio classification is regarded as a great challenge in pattern recognition. Although audio
classification tasks are always treated as independent tasks, tasks are essentially related to …

Deep learning for audio-based music classification and tagging: Teaching computers to distinguish rock from bach

J Nam, K Choi, J Lee, SY Chou… - IEEE signal processing …, 2018 - ieeexplore.ieee.org
Over the last decade, music-streaming services have grown dramatically. Pandora, one
company in the field, has pioneered and popularized streaming music by successfully …

Slow-fast auditory streams for audio recognition

E Kazakos, A Nagrani, A Zisserman… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
We propose a two-stream convolutional network for audio recognition, that operates on time-
frequency spectrogram inputs. Following similar success in visual recognition, we learn …

Upsampling artifacts in neural audio synthesis

J Pons, S Pascual, G Cengarle… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
A number of recent advances in neural audio synthesis rely on up-sampling layers, which
can introduce undesired artifacts. In computer vision, upsampling artifacts have been …

Sample-level CNN architectures for music auto-tagging using raw waveforms

T Kim, J Lee, J Nam - 2018 IEEE international conference on …, 2018 - ieeexplore.ieee.org
Recent work has shown that the end-to-end approach using convolutional neural network
(CNN) is effective in various types of machine learning tasks. For audio signals, the …