A squeeze-and-excitation and transformer-based cross-task model for environmental sound recognition

J Bai, J Chen, M Wang, MS Ayub… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Environmental sound recognition (ESR) is an emerging research topic in audio pattern
recognition. Many tasks are presented to resort to computational models for ESR in real-life …

How robust are audio embeddings for polyphonic sound event tagging?

J Abeßer, S Grollmisch, M Müller - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
Sound classification algorithms are challenged by the natural variability of everyday sounds,
particularly for large sound class taxonomies. In order to be applicable in real-life …

Multimodal urban sound tagging with spatiotemporal context

J Bai, J Chen, M Wang - IEEE Transactions on Cognitive and …, 2022 - ieeexplore.ieee.org
Noise pollution significantly affects our daily life and urban development. Urban sound
tagging (UST) has attracted much attention recently, which aims to analyze and monitor …

A strongly-labelled polyphonic dataset of urban sounds with spatiotemporal context

K Ooi, KN Watcharasupat, S Peksi… - 2021 Asia-Pacific …, 2021 - ieeexplore.ieee.org
This paper introduces SINGA: PURA, a strongly labelled polyphonic urban sound dataset
with spatiotemporal context. The data were collected via several recording units deployed …

A squeeze-and-excitation and transformer based cross-task system for environmental sound recognition

J Bai, J Chen, M Wang, MS Ayub - arxiv preprint arxiv:2203.08350, 2022 - arxiv.org
Environmental sound recognition (ESR) is an emerging research topic in audio pattern
recognition. Many tasks are presented to resort to computational models for ESR in real-life …

[SITAATTI][C] 改善基於門控卷積神經網路之城市噪音標註系統

鄭蕙心