- Academic Search

K Koutini, J Schlüter, H Eghbal-Zadeh… - arxiv preprint arxiv …, 2021 - arxiv.org

The great success of transformer-based models in natural language processing (NLP) has
led to various attempts at adapting these architectures to other domains such as vision and …

Save Cite Cited by 285 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] justinsalamon.com

Look, listen, and learn more: Design choices for deep audio embeddings

AL Cramer, HH Wu, J Salamon… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

A considerable challenge in applying deep learning to audio classification is the scarcity of
labeled data. An increasingly popular solution is to learn deep audio embeddings from large …

Save Cite Cited by 399 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] ieee.org

Music deep learning: deep learning methods for music signal processing—a review of the state-of-the-art

L Moysis, LA Iliadis, SP Sotiroudis, AD Boursianis… - Ieee …, 2023 - ieeexplore.ieee.org

The discipline of Deep Learning has been recognized for its strong computational tools,
which have been extensively used in data and signal processing, with innumerable …

Save Cite Cited by 31 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] ieee.org

Strong labeling of sound events using crowdsourced weak labels and annotator competence estimation

I Martín-Morató, A Mesaros - IEEE/ACM transactions on audio …, 2023 - ieeexplore.ieee.org

Crowdsourcing is a popular tool for collecting large amounts of annotated data, but the
specific format of the strong labels necessary for sound event detection is not easily …

Save Cite Cited by 56 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Masked spectrogram prediction for self-supervised audio pre-training

D Chong, H Wang, P Zhou… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org

Transformer-based models attain excellent results and generalize well when trained on
sufficient amounts of data. However, constrained by the limited data available in the audio …

Save Cite Cited by 61 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks

K Koutini, H Eghbal-zadeh… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org

In this paper, we study the performance of variants of well-known Convolutional Neural
Network (CNN) architectures on different audio tasks. We show that tuning the Receptive …

Save Cite Cited by 62 Related articles All 4 versions Free GPT-4

Multi-instrument automatic music transcription with self-attention-based instance segmentation

YT Wu, B Chen, L Su - IEEE/ACM Transactions on Audio …, 2020 - ieeexplore.ieee.org

Multi-instrument automatic music transcription (AMT) is a critical but less investigated
problem in the field of music information retrieval (MIR). With all the difficulties faced by …

Save Cite Cited by 68 Related articles All 4 versions Free GPT-4

On the application of deep learning and multifractal techniques to classify emotions and instruments using Indian Classical Music

S Nag, M Basu, S Sanyal, A Banerjee… - Physica A: Statistical …, 2022 - Elsevier

Music is often considered as the language of emotions. The way it stimulates the emotional
appraisal across people from different communities, culture and demographics has long …

Save Cite Cited by 36 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Training sound event detection with soft labels from crowdsourced annotations

I Martín-Morató, M Harju, P Ahokas… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

In this paper, we study the use of soft labels to train a system for sound event detection
(SED). Soft labels can result from annotations which account for human uncertainty about …

Save Cite Cited by 24 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

An attention mechanism for musical instrument recognition

S Gururani, M Sharma, A Lerch - arxiv preprint arxiv:1907.04294, 2019 - arxiv.org

While the automatic recognition of musical instruments has seen significant progress, the
task is still considered hard for music featuring multiple instruments as opposed to single …

Save Cite Cited by 68 Related articles All 4 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

OpenMIC-2018: An Open Data-set for Multiple Instrument Recognition.

Efficient training of audio transformers with patchout

Look, listen, and learn more: Design choices for deep audio embeddings

Music deep learning: deep learning methods for music signal processing—a review of the state-of-the-art

Strong labeling of sound events using crowdsourced weak labels and annotator competence estimation

Masked spectrogram prediction for self-supervised audio pre-training

Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks

Multi-instrument automatic music transcription with self-attention-based instance segmentation

On the application of deep learning and multifractal techniques to classify emotions and instruments using Indian Classical Music

Training sound event detection with soft labels from crowdsourced annotations

An attention mechanism for musical instrument recognition