UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection

Y **ao, RK Das - arxiv preprint arxiv:2407.03657, 2024 - arxiv.org
This work explores class-incremental learning (CIL) for sound event detection (SED),
advancing adaptability towards real-world scenarios. CIL's success in domains like …

Tf-mamba: A time-frequency network for sound source localization

Y **ao, RK Das - arxiv preprint arxiv:2409.05034, 2024 - arxiv.org
Sound source localization (SSL) determines the position of sound sources using multi-
channel audio data. It is commonly used to improve speech enhancement and separation …

Mixstyle based Domain Generalization for Sound Event Detection with Heterogeneous Training Data

Y **ao, H Yin, J Bai, RK Das - arxiv preprint arxiv:2407.03654, 2024 - arxiv.org
This work explores domain generalization (DG) for sound event detection (SED), advancing
adaptability towards real-world scenarios. Our approach employs a mean-teacher …

Advancing Continual Learning for Robust Deepfake Audio Classification

F Dong, Q Tang, Y Bai, Z Wang - arxiv preprint arxiv:2407.10108, 2024 - arxiv.org
The emergence of new spoofing attacks poses an increasing challenge to audio security.
Current detection methods often falter when faced with unseen spoofing attacks. Traditional …

Class-Incremental Learning for Sound Event Localization and Detection

R Pandey, M Mulimani, A Politis, A Mesaros - arxiv preprint arxiv …, 2024 - arxiv.org
This paper investigates the feasibility of class-incremental learning (CIL) for Sound Event
Localization and Detection (SELD) tasks. The method features an incremental learner that …

Dark Experience for Incremental Keyword Spotting

T Peng, Y **ao - arxiv preprint arxiv:2409.08153, 2024 - arxiv.org
Spoken keyword spotting (KWS) is crucial for identifying keywords within audio inputs and is
widely used in applications like Apple Siri and Google Home, particularly on edge devices …