フォロー
Daiki Takeuchi
Daiki Takeuchi
NTT Communication Science Laboratories
確認したメール アドレス: hco.ntt.co.jp
タイトル
引用先
引用先
Byol for audio: Self-supervised learning for general-purpose audio representation
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
2021 International Joint Conference on Neural Networks (IJCNN), 1-8, 2021
1962021
ToyADMOS2: Another dataset of miniature-machine operating sounds for anomalous sound detection under domain shift conditions
N Harada, D Niizumi, D Takeuchi, Y Ohishi, M Yasuda, S Saito
arXiv preprint arXiv:2106.02369, 2021
1832021
Speech enhancement using self-adaptation and multi-head self-attention
Y Koizumi, K Yatabe, M Delcroix, Y Masuyama, D Takeuchi
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1562020
First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline
N Harada, D Niizumi, Y Ohishi, D Takeuchi, M Yasuda
2023 31st European Signal Processing Conference (EUSIPCO), 191-195, 2023
722023
Masked spectrogram modeling using masked autoencoders for learning general-purpose audio representation
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
HEAR: Holistic Evaluation of Audio Representations, 1-24, 2022
622022
Real-time speech enhancement using equilibriated RNN
D Takeuchi, K Yatabe, Y Koizumi, Y Oikawa, N Harada
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
612020
BYOL for audio: Exploring pre-trained general-purpose audio representations
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 137-151, 2022
572022
Audio captioning using pre-trained large-scale language model guided by audio-based similar caption retrieval
Y Koizumi, Y Ohishi, D Niizumi, D Takeuchi, M Yasuda
arXiv preprint arXiv:2012.07331, 2020
472020
Masked modeling duo: Learning representations by encouraging both networks to model the input
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
322023
Effects of word-frequency based pre-and post-processings for audio captioning
D Takeuchi, Y Koizumi, Y Ohishi, N Harada, K Kashino
arXiv preprint arXiv:2009.11436, 2020
312020
The NTT DCASE2020 challenge task 6 system: Automated audio captioning with keywords and sentence length estimation
Y Koizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
arXiv preprint arXiv:2007.00225, 2020
302020
Data-driven design of perfect reconstruction filterbank for DNN-based sound source enhancement
D Takeuchi, K Yatabe, Y Koizumi, Y Oikawa, N Harada
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
212019
Source directivity approximation for finite-difference time-domain simulation by estimating initial value
D Takeuchi, K Yatabe, Y Oikawa
The Journal of the Acoustical Society of America 145 (4), 2638-2649, 2019
212019
Conceptbeam: Concept driven target speech extraction
Y Ohishi, M Delcroix, T Ochiai, S Araki, D Takeuchi, D Niizumi, A Kimura, ...
Proceedings of the 30th ACM International Conference on Multimedia, 4252-4260, 2022
192022
Invertible DNN-based nonlinear time-frequency transform for speech enhancement
D Takeuchi, K Yatabe, Y Koizumi, Y Oikawa, N Harada
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
152020
Effect of spectrogram resolution on deep-neural-network-based speech enhancement
D Takeuchi, K Yatabe, Y Koizumi, Y Oikawa, N Harada
Acoustical Science and Technology 41 (5), 769-775, 2020
112020
Masked Modeling Duo: Towards a Universal Audio Pre-Training Framework
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
92024
Audio difference captioning utilizing similarity-discrepancy disentanglement
D Takeuchi, Y Ohishi, D Niizumi, N Harada, K Kashino
arXiv preprint arXiv:2308.11923, 2023
92023
Composing general audio representation by fusing multilayer features of a pre-trained model
D Niizumi, D Takeuchi, Y Ohishi, N Harada, K Kashino
2022 30th European Signal Processing Conference (EUSIPCO), 200-204, 2022
82022
Parametric approximation of piano sound based on Kautz model with sparse linear prediction
K Kobayashi, D Takeuchi, M Iwamoto, K Yatabe, Y Oikawa
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
82018
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20