ACCDOA: Activity-coupled cartesian direction of arrival representation for sound event localization and detection K Shimada, Y Koyama, N Takahashi, S Takahashi, Y Mitsufuji ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021 | 113 | 2021 |
Multi-accdoa: Localizing and detecting overlapping sounds from the same class with auxiliary duplicating permutation invariant training K Shimada, Y Koyama, S Takahashi, N Takahashi, E Tsunoo, Y Mitsufuji ICASSP 2022-2022 IEEE international conference on acoustics, speech and …, 2022 | 92 | 2022 |
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events A Politis, K Shimada, P Sudarsanam, S Adavanne, D Krause, Y Koyama, ... arXiv preprint arXiv:2206.01948, 2022 | 89 | 2022 |
Unsupervised speech enhancement based on multichannel NMF-informed beamforming for noise-robust automatic speech recognition K Shimada, Y Bando, M Mimura, K Itoyama, K Yoshii, T Kawahara IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (5), 960-971, 2019 | 65 | 2019 |
STARSS23: An audio-visual dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events K Shimada, A Politis, P Sudarsanam, DA Krause, K Uchida, S Adavanne, ... Advances in Neural Information Processing Systems 36, 2024 | 40 | 2024 |
Ensemble of ACCDOA-and EINV2-based systems with D3Nets and impulse response simulation for sound event localization and detection K Shimada, N Takahashi, Y Koyama, S Takahashi, E Tsunoo, ... arXiv preprint arXiv:2106.10806, 2021 | 29 | 2021 |
Metric learning with background noise class for few-shot detection of rare sound events K Shimada, Y Koyama, A Inoue ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 28 | 2020 |
Sound event localization and detection using activity-coupled cartesian DOA vector and RD3Net K Shimada, N Takahashi, S Takahashi, Y Mitsufuji arXiv preprint arXiv:2006.12014, 2020 | 21 | 2020 |
Unsupervised beamforming based on multichannel nonnegative matrix factorization for noisy speech recognition K Shimada, Y Bando, M Mimura, K Itoyama, K Yoshii, T Kawahara 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 17 | 2018 |
Diffusion-based speech enhancement with joint generative and predictive decoders H Shi, K Shimada, M Hirano, T Shibuya, Y Koyama, Z Zhong, S Takahashi, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 16 | 2024 |
An attention-based approach to hierarchical multi-label music instrument classification Z Zhong, M Hirano, K Shimada, K Tateishi, S Takahashi, Y Mitsufuji ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 15 | 2023 |
Spatial data augmentation with simulated room impulse responses for sound event localization and detection Y Koyama, K Shigemi, M Takahashi, K Shimada, N Takahashi, E Tsunoo, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 14 | 2022 |
Hq-vae: Hierarchical discrete representation learning with variational bayes Y Takida, Y Ikemiya, T Shibuya, K Shimada, W Choi, CH Lai, N Murata, ... arXiv preprint arXiv:2401.00365, 2023 | 9 | 2023 |
Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection R Falcón-Pérez, K Shimada, Y Koyama, S Takahashi, Y Mitsufuji ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 7 | 2022 |
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023 A Politis, K Shimada, P Sudarsanam, A Hakala, S Takahashi, DA Krause, ... Mar, 2023 | 6 | 2023 |
Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition. M Mimura, Y Bando, K Shimada, S Sakai, K Yoshii, T Kawahara INTERSPEECH, 2451-2455, 2017 | 5 | 2017 |
Zero-and few-shot sound event localization and detection K Shimada, K Uchida, Y Koyama, T Shibuya, S Takahashi, Y Mitsufuji, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Extending audio masked autoencoders toward audio restoration Z Zhong, H Shi, M Hirano, K Shimada, K Tateishi, T Shibuya, S Takahashi, ... 2023 IEEE Workshop on Applications of Signal Processing to Audio and …, 2023 | 4 | 2023 |
Diffusion-based signal refiner for speech separation M Hirano, K Shimada, Y Koyama, S Takahashi, Y Mitsufuji arXiv preprint arXiv:2305.05857, 2023 | 3 | 2023 |
CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation Y Chen, K Shimada, C Simon, Y Ikemiya, T Shibuya, Y Mitsufuji arXiv preprint arXiv:2501.02786, 2025 | | 2025 |