[PDF][PDF] librosa: Audio and music signal analysis in python.
This document describes version 0.4. 0 of librosa: a Python package for audio and music
signal processing. At a high level, librosa provides implementations of a variety of common …
signal processing. At a high level, librosa provides implementations of a variety of common …
Onsets and frames: Dual-objective piano transcription
We advance the state of the art in polyphonic piano music transcription by using a deep
convolutional and recurrent neural network which is trained to jointly predict onsets and …
convolutional and recurrent neural network which is trained to jointly predict onsets and …
Transfer learning for music classification and regression tasks
In this paper, we present a transfer learning approach for music classification and regression
tasks. We propose to use a pre-trained convnet feature, a concatenated feature vector using …
tasks. We propose to use a pre-trained convnet feature, a concatenated feature vector using …
[BUCH][B] Learning-based methods for comparing sequences, with applications to audio-to-midi alignment and matching
C Raffel - 2016 - search.proquest.com
Sequences of feature vectors are a natural way of representing temporal data. Given a
database of sequences, a fundamental task is to find the database entry which is the most …
database of sequences, a fundamental task is to find the database entry which is the most …
Cultural transmission of vocal dialect in the naked mole-rat
Naked mole-rats (Heterocephalus glaber) form some of the most cooperative groups in the
animal kingdom, living in multigenerational colonies under the control of a single breeding …
animal kingdom, living in multigenerational colonies under the control of a single breeding …
A comparison of deep learning methods for environmental sound detection
Environmental sound detection is a challenging application of machine learning because of
the noisy nature of the signal, and the small amount of (labeled) data that is typically …
the noisy nature of the signal, and the small amount of (labeled) data that is typically …
Stacked convolutional and recurrent neural networks for bird audio detection
This paper studies the detection of bird calls in audio segments using stacked convolutional
and recurrent neural networks. Data augmentation by blocks mixing and domain adaptation …
and recurrent neural networks. Data augmentation by blocks mixing and domain adaptation …
Sound event detection in multichannel audio using spatial and harmonic features
In this paper, we propose the use of spatial and harmonic features in combination with long
short term memory (LSTM) recurrent neural network (RNN) for automatic sound event …
short term memory (LSTM) recurrent neural network (RNN) for automatic sound event …
Smart music player integrating facial emotion recognition and music mood recommendation
Songs, as a medium of expression, have always been a popular choice to depict and
understand human emotions. Reliable emotion based classification systems can go a long …
understand human emotions. Reliable emotion based classification systems can go a long …
An interpretable deep learning model for automatic sound classification
Deep learning models have improved cutting-edge technologies in many research areas,
but their black-box structure makes it difficult to understand their inner workings and the …
but their black-box structure makes it difficult to understand their inner workings and the …