Voice conversion in high-order eigen space using deep belief nets. T Nakashika, R Takashima, T Takiguchi, Y Ariki Interspeech, 369-372, 2013 | 164 | 2013 |
Voice conversion using RNN pre-trained by recurrent temporal restricted Boltzmann machines T Nakashika, T Takiguchi, Y Ariki IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (3), 580-587, 2014 | 101 | 2014 |
Non-parallel training in voice conversion using an adaptive restricted Boltzmann machine T Nakashika, T Takiguchi, Y Minami IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (11 …, 2016 | 81 | 2016 |
High-order sequence modeling using speaker-dependent recurrent temporal restricted boltzmann machines for voice conversion. T Nakashika, T Takiguchi, Y Ariki Interspeech, 2278-2282, 2014 | 70 | 2014 |
Voice conversion based on non-negative matrix factorization using phoneme-categorized dictionary R Aihara, T Nakashika, T Takiguchi, Y Ariki 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 47 | 2014 |
Voice conversion based on speaker-dependent restricted boltzmann machines T Nakashika, T Takiguchi, Y Ariki IEICE TRANSACTIONS on Information and Systems 97 (6), 1403-1410, 2014 | 41 | 2014 |
Local-feature-map Integration Using Convolutional Neural Networks for Music Genre Classification. T Nakashika, C Garcia, T Takiguchi Interspeech, 1752-1755, 2012 | 40 | 2012 |
Dysarthric speech recognition using a convolutive bottleneck network T Nakashika, T Yoshioka, T Takiguchi, Y Ariki, S Duffner, C Garcia 2014 12th International Conference on Signal Processing (ICSP), 505-509, 2014 | 35 | 2014 |
Feature extraction using pre-trained convolutive bottleneck nets for dysarthric speech recognition Y Takashima, T Nakashika, T Takiguchi, Y Ariki 2015 23rd European signal processing conference (EUSIPCO), 1411-1415, 2015 | 28 | 2015 |
Voice conversion using speaker-dependent conditional restricted boltzmann machine T Nakashika, T Takiguchi, Y Ariki EURASIP Journal on Audio, Speech, and Music Processing 2015, 1-12, 2015 | 26 | 2015 |
STFT spectral loss for training a neural speech waveform model S Takaki, T Nakashika, X Wang, J Yamagishi ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 22 | 2019 |
Content-based image retrieval using rotation-invariant histograms of oriented gradients J Chen, T Nakashika, T Takiguchi, Y Ariki Proceedings of the 5th ACM on International Conference on Multimedia …, 2015 | 18 | 2015 |
High-frequency restoration using deep belief nets for super-resolution T Nakashika, T Takiguchi, Y Ariki 2013 International Conference on Signal-Image Technology & Internet-Based …, 2013 | 14 | 2013 |
Complex-valued restricted Boltzmann machine for direct learning of frequency spectra T Nakashika, S Takaki, J Yamagishi Interspeech 2017, 4021-4025, 2017 | 13 | 2017 |
Convolutive bottleneck network with dropout for dysarthric speech recognition T Nakashika, T Yoshioka, T Takiguchi, Y Ariki, S Duffner, C Garcia Transactions on Machine Learning and Artificial Intelligence 2, 1-15, 2014 | 12 | 2014 |
Parallel-data-free many-to-many voice conversion using an adaptive restricted Boltzmann machine T Nakashika, T Takiguchi, Y Ariki MLSLP 2015, 1-4, 2015 | 11 | 2015 |
Error correction of automatic speech recognition based on normalized web distance. E Byambakhishig, K Tanaka, R Aihara, T Nakashika, T Takiguchi, Y Ariki INTERSPEECH, 2852-2856, 2014 | 11 | 2014 |
Complex-Valued Variational Autoencoder: A Novel Deep Generative Model for Direct Representation of Complex Spectra. T Nakashika INTERSPEECH, 2002-2006, 2020 | 8 | 2020 |
Small-parallel exemplar-based voice conversion in noisy environments using affine non-negative matrix factorization R Aihara, T Fujii, T Nakashika, T Takiguchi, Y Ariki EURASIP Journal on Audio, Speech, and Music Processing 2015, 1-9, 2015 | 8 | 2015 |
Voice conversion in time-invariant speaker-independent space T Nakashika, T Takiguchi, Y Ariki 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 8 | 2014 |