Byol for audio: Self-supervised learning for general-purpose audio representation D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino 2021 International Joint Conference on Neural Networks (IJCNN), 1-8, 2021 | 196 | 2021 |
ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions N Harada, D Niizumi, D Takeuchi, Y Ohishi, M Yasuda, S Saito arXiv preprint arXiv:2106.02369, 2021 | 183 | 2021 |
Speech enhancement using self-adaptation and multi-head self-attention Y Koizumi, K Yatabe, M Delcroix, Y Masuyama, D Takeuchi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 156 | 2020 |
First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline N Harada, D Niizumi, Y Ohishi, D Takeuchi, M Yasuda 2023 31st European Signal Processing Conference (EUSIPCO), 191-195, 2023 | 72 | 2023 |
Masked spectrogram modeling using masked autoencoders for learning general-purpose audio representation D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino HEAR: Holistic Evaluation of Audio Representations, 1-24, 2022 | 62 | 2022 |
Real-time speech enhancement using equilibriated RNN D Takeuchi, K Yatabe, Y Koizumi, Y Oikawa, N Harada ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 61 | 2020 |
BYOL for audio: Exploring pre-trained general-purpose audio representations D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 137-151, 2022 | 57 | 2022 |
Audio captioning using pre-trained large-scale language model guided by audio-based similar caption retrieval Y Koizumi, Y Ohishi, D Niizumi, D Takeuchi, M Yasuda arXiv preprint arXiv:2012.07331, 2020 | 47 | 2020 |
Masked modeling duo: Learning representations by encouraging both networks to model the input D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 32 | 2023 |
Effects of word-frequency based pre-and post-processings for audio captioning D Takeuchi, Y Koizumi, Y Ohishi, N Harada, K Kashino arXiv preprint arXiv:2009.11436, 2020 | 31 | 2020 |
The NTT DCASE2020 challenge task 6 system: Automated audio captioning with keywords and sentence length estimation Y Koizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino arXiv preprint arXiv:2007.00225, 2020 | 30 | 2020 |
Data-driven design of perfect reconstruction filterbank for DNN-based sound source enhancement D Takeuchi, K Yatabe, Y Koizumi, Y Oikawa, N Harada ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 21 | 2019 |
Source directivity approximation for finite-difference time-domain simulation by estimating initial value D Takeuchi, K Yatabe, Y Oikawa The Journal of the Acoustical Society of America 145 (4), 2638-2649, 2019 | 21 | 2019 |
Conceptbeam: Concept driven target speech extraction Y Ohishi, M Delcroix, T Ochiai, S Araki, D Takeuchi, D Niizumi, A Kimura, ... Proceedings of the 30th ACM International Conference on Multimedia, 4252-4260, 2022 | 19 | 2022 |
Invertible DNN-based nonlinear time-frequency transform for speech enhancement D Takeuchi, K Yatabe, Y Koizumi, Y Oikawa, N Harada ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 15 | 2020 |
Effect of spectrogram resolution on deep-neural-network-based speech enhancement D Takeuchi, K Yatabe, Y Koizumi, Y Oikawa, N Harada Acoustical Science and Technology 41 (5), 769-775, 2020 | 11 | 2020 |
Masked Modeling Duo: Towards a Universal Audio Pre-Training Framework D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 9 | 2024 |
Audio difference captioning utilizing similarity-discrepancy disentanglement D Takeuchi, Y Ohishi, D Niizumi, N Harada, K Kashino arXiv preprint arXiv:2308.11923, 2023 | 9 | 2023 |
Composing general audio representation by fusing multilayer features of a pre-trained model D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino 2022 30th European Signal Processing Conference (EUSIPCO), 200-204, 2022 | 8 | 2022 |
Parametric approximation of piano sound based on Kautz model with sparse linear prediction K Kobayashi, D Takeuchi, M Iwamoto, K Yatabe, Y Oikawa 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 8 | 2018 |