[PDF][PDF] librosa: Audio and music signal analysis in python.

B McFee, C Raffel, D Liang, DPW Ellis, M McVicar… - SciPy, 2015 - academia.edu
This document describes version 0.4. 0 of librosa: a Python package for audio and music
signal processing. At a high level, librosa provides implementations of a variety of common …

Onsets and frames: Dual-objective piano transcription

C Hawthorne, E Elsen, J Song, A Roberts… - arxiv preprint arxiv …, 2017 - arxiv.org
We advance the state of the art in polyphonic piano music transcription by using a deep
convolutional and recurrent neural network which is trained to jointly predict onsets and …

Transfer learning for music classification and regression tasks

K Choi, G Fazekas, M Sandler, K Cho - arxiv preprint arxiv:1703.09179, 2017 - arxiv.org
In this paper, we present a transfer learning approach for music classification and regression
tasks. We propose to use a pre-trained convnet feature, a concatenated feature vector using …

[BUCH][B] Learning-based methods for comparing sequences, with applications to audio-to-midi alignment and matching

C Raffel - 2016 - search.proquest.com
Sequences of feature vectors are a natural way of representing temporal data. Given a
database of sequences, a fundamental task is to find the database entry which is the most …

Cultural transmission of vocal dialect in the naked mole-rat

AJ Barker, G Veviurko, NC Bennett, DW Hart… - Science, 2021 - science.org
Naked mole-rats (Heterocephalus glaber) form some of the most cooperative groups in the
animal kingdom, living in multigenerational colonies under the control of a single breeding …

A comparison of deep learning methods for environmental sound detection

J Li, W Dai, F Metze, S Qu, S Das - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Environmental sound detection is a challenging application of machine learning because of
the noisy nature of the signal, and the small amount of (labeled) data that is typically …

Stacked convolutional and recurrent neural networks for bird audio detection

S Adavanne, K Drossos, E Çakir… - 2017 25th European …, 2017 - ieeexplore.ieee.org
This paper studies the detection of bird calls in audio segments using stacked convolutional
and recurrent neural networks. Data augmentation by blocks mixing and domain adaptation …

Sound event detection in multichannel audio using spatial and harmonic features

S Adavanne, G Parascandolo, P Pertilä… - arxiv preprint arxiv …, 2017 - arxiv.org
In this paper, we propose the use of spatial and harmonic features in combination with long
short term memory (LSTM) recurrent neural network (RNN) for automatic sound event …

Smart music player integrating facial emotion recognition and music mood recommendation

S Gilda, H Zafar, C Soni… - … conference on wireless …, 2017 - ieeexplore.ieee.org
Songs, as a medium of expression, have always been a popular choice to depict and
understand human emotions. Reliable emotion based classification systems can go a long …

An interpretable deep learning model for automatic sound classification

P Zinemanas, M Rocamora, M Miron, F Font, X Serra - Electronics, 2021 - mdpi.com
Deep learning models have improved cutting-edge technologies in many research areas,
but their black-box structure makes it difficult to understand their inner workings and the …