Fragmentvc: Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention YY Lin, CM Chien, JH Lin, H Lee, L Lee ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 86 | 2021 |
Investigating on incorporating pretrained and learnable speaker representations for multi-speaker multi-style text-to-speech CM Chien, JH Lin, C Huang, P Hsu, H Lee ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 84 | 2021 |
S2VC: A framework for any-to-any voice conversion with self-supervised pretrained representations J Lin, YY Lin, CM Chien, H Lee arXiv preprint arXiv:2104.02901, 2021 | 71 | 2021 |
Hierarchical prosody modeling for non-autoregressive speech synthesis CM Chien, H Lee 2021 IEEE Spoken Language Technology Workshop (SLT), 446-453, 2021 | 37 | 2021 |
What do self-supervised speech models know about words? A Pasad, CM Chien, S Settle, K Livescu Transactions of the Association for Computational Linguistics 12, 372-391, 2024 | 28 | 2024 |
Voice filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module A Gabryś, G Huybrechts, MS Ribeiro, CM Chien, J Roth, G Comini, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 25 | 2022 |
Toward joint language modeling for speech units and text JC Chou, CM Chien, WN Hsu, K Livescu, A Babu, A Conneau, A Baevski, ... arXiv preprint arXiv:2310.08715, 2023 | 15 | 2023 |
On the Evaluation of Speech Foundation Models for Spoken Language Understanding S Arora, A Pasad, CM Chien, J Han, R Sharma, J Jung, H Dhamyal, ... arXiv preprint arXiv:2406.10083, 2024 | 6 | 2024 |
AV2WAV: Diffusion-Based Re-Synthesis from Continuous Self-Supervised Features for Audio-Visual Speech Enhancement JC Chou, CM Chien, K Livescu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Few-Shot Spoken Language Understanding Via Joint Speech-Text Models CM Chien, M Zhang, JC Chou, K Livescu 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 4 | 2023 |
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ... arXiv preprint arXiv:2411.05361, 2024 | 3 | 2024 |
Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning CM Chien, A Tjandra, A Vyas, M Le, B Shi, WN Hsu arXiv preprint arXiv:2406.06251, 2024 | 1 | 2024 |