フォロー
Parthasaarathy Sudarsanam
Parthasaarathy Sudarsanam
Tampere University
確認したメール アドレス: tuni.fi
タイトル
引用先
引用先
Recursive speech separation for unknown number of speakers
N Takahashi, S Parthasaarathy, N Goswami, Y Mitsufuji
arXiv preprint arXiv:1904.03065, 2019
1042019
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
A Politis, K Shimada, P Sudarsanam, S Adavanne, D Krause, Y Koyama, ...
arXiv preprint arXiv:2206.01948, 2022
892022
Clotho-aqa: A crowdsourced dataset for audio question answering
S Lipping, P Sudarsanam, K Drossos, T Virtanen
2022 30th European Signal Processing Conference (EUSIPCO), 1140-1144, 2022
542022
STARSS23: An audio-visual dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events
K Shimada, A Politis, P Sudarsanam, DA Krause, K Uchida, S Adavanne, ...
Advances in Neural Information Processing Systems 36, 2024
402024
Improving voice separation by incorporating end-to-end speech recognition
N Takahashi, MK Singh, S Basak, P Sudarsanam, S Ganapathy, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
252020
Assessment of self-attention on learned features for sound event localization and detection
P Sudarsanam, A Politis, K Drossos
arXiv preprint arXiv:2107.09388, 2021
202021
STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023
A Politis, K Shimada, P Sudarsanam, A Hakala, S Takahashi, DA Krause, ...
Mar, 2023
62023
Attention-Based Methods For Audio Question Answering
P Sudarsanam, T Virtanen
2023 31st European Signal Processing Conference (EUSIPCO), 750-754, 2023
32023
Baseline models and evaluation of sound event localization and detection with distance estimation in DCASE 2024 Challenge
DDG Aparicio, A Politis, PA Sudarsanam, K Shimada, D Krause, K Uchida, ...
Workshop on Detection and Classification of Acoustic Scenes and Events, 41-45, 2024
2024
AVCAPS: AN AUDIO-VISUAL DATASET WITH MODALITY-SPECIFIC CAPTIONS
P Sudarsanam, I Martín-Morató, A Hakala, T Virtanen
Toward an Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events Kazuki Shimada1, Archontis Politis2, Parthasaarathy …
K Shimada, A Politis, P Sudarsanam, D Krause, N Takahashi, ...
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–11