ติดตาม
Naohiro Tawara
Naohiro Tawara
NTT Corporation
ยืนยันอีเมลแล้วที่ ieee.org
ชื่อ
อ้างโดย
อ้างโดย
ปี
Improving speaker discrimination of target speech extraction with time-domain speakerbeam
M Delcroix, T Ochiai, K Zmolikova, K Kinoshita, N Tawara, T Nakatani, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1362020
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds
K Kinoshita, M Delcroix, N Tawara
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1022021
Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech
K Kinoshita, M Delcroix, N Tawara
arXiv preprint arXiv:2105.09040, 2021
652021
Multi-Channel Speech Enhancement Using Time-Domain Convolutional Denoising Autoencoder.
N Tawara, T Kobayashi, T Ogawa
Interspeech, 86-90, 2019
432019
Speaker invariant feature extraction for zero-resource languages with adversarial learning
T Tsuchiya, N Tawara, T Ogawa, T Kobayashi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
372018
Frame-level phoneme-invariant speaker embedding for text-independent speaker recognition on extremely short utterances
N Tawara, A Ogawa, T Iwata, M Delcroix, T Ogawa
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
332020
Age-vox-celeb: Multi-modal corpus for facial and speech estimation
N Tawara, A Ogawa, Y Kitagishi, H Kamiyama
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
282021
Language model domain adaptation via recurrent neural networks with domain-shared and domain-specific representations
T Moriokal, N Tawara, T Ogawa, A Ogawa, T Iwata, T Kobayashi
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
102018
Sequential fish catch forecasting using Bayesian state space models
Y Kokaki, N Tawara, T Kobayashi, K Hashimoto, T Ogawa
2018 24th International Conference on Pattern Recognition (ICPR), 776-781, 2018
92018
Ntt speaker diarization system for chime-7: multi-domain, multi-microphone end-to-end and vector clustering diarization
N Tawara, M Delcroix, A Ando, A Ogawa
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
82024
Speaker age estimation using age-dependent insensitive loss
Y Kitagishi, H Kamiyama, A Ando, N Tawara, T Mori, S Kobashikawa
2020 Asia-Pacific Signal and Information Processing Association Annual …, 2020
82020
A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions
N Tawara, T Ogawa, T Kobayashi
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
82015
Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering
N Tawara, T Ogawa, S Watanabe, T Kobayashi
2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012
82012
Multi-stream extension of variational Bayesian HMM clustering (MS-VBx) for combined end-to-end and vector clustering-based diarization
M Delcroix, N Tawara, M Diez, F Landini, A Silnova, A Ogawa, T Nakatani, ...
arXiv preprint arXiv:2305.13580, 2023
72023
NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge
N Kamo, N Tawara, A Ando, T Kano, H Sato, R Ikeshita, T Moriya, ...
arXiv preprint arXiv:2409.05554, 2024
62024
Blstm-based confidence estimation for end-to-end speech recognition
A Ogawa, N Tawara, T Kano, M Delcroix
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
62021
Noise-robust attention learning for end-to-end speech recognition
Y Higuchi, N Tawara, A Ogawa, T Iwata, T Kobayashi, T Ogawa
2020 28th European Signal Processing Conference (EUSIPCO), 311-315, 2021
62021
Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages.
Y Higuchi, N Tawara, T Kobayashi, T Ogawa
INTERSPEECH, 266-270, 2019
62019
Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model.
N Tawara, S Watanabe, T Ogawa, T Kobayashi
INTERSPEECH, 2905-2908, 2011
62011
Language Model Data Augmentation Based on Text Domain Transfer.
A Ogawa, N Tawara, M Delcroix
INTERSPEECH, 4926-4930, 2020
52020
ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง
บทความ 1–20