Sledovat
Chung-Ming Chien
Název
Citace
Citace
Rok
Fragmentvc: Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
YY Lin, CM Chien, JH Lin, H Lee, L Lee
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
862021
Investigating on incorporating pretrained and learnable speaker representations for multi-speaker multi-style text-to-speech
CM Chien, JH Lin, C Huang, P Hsu, H Lee
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
842021
S2VC: A framework for any-to-any voice conversion with self-supervised pretrained representations
J Lin, YY Lin, CM Chien, H Lee
arXiv preprint arXiv:2104.02901, 2021
712021
Hierarchical prosody modeling for non-autoregressive speech synthesis
CM Chien, H Lee
2021 IEEE Spoken Language Technology Workshop (SLT), 446-453, 2021
372021
What do self-supervised speech models know about words?
A Pasad, CM Chien, S Settle, K Livescu
Transactions of the Association for Computational Linguistics 12, 372-391, 2024
282024
Voice filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module
A Gabryś, G Huybrechts, MS Ribeiro, CM Chien, J Roth, G Comini, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
252022
Toward joint language modeling for speech units and text
JC Chou, CM Chien, WN Hsu, K Livescu, A Babu, A Conneau, A Baevski, ...
arXiv preprint arXiv:2310.08715, 2023
152023
On the Evaluation of Speech Foundation Models for Spoken Language Understanding
S Arora, A Pasad, CM Chien, J Han, R Sharma, J Jung, H Dhamyal, ...
arXiv preprint arXiv:2406.10083, 2024
62024
AV2WAV: Diffusion-Based Re-Synthesis from Continuous Self-Supervised Features for Audio-Visual Speech Enhancement
JC Chou, CM Chien, K Livescu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Few-Shot Spoken Language Understanding Via Joint Speech-Text Models
CM Chien, M Zhang, JC Chou, K Livescu
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
42023
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks
C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ...
arXiv preprint arXiv:2411.05361, 2024
32024
Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning
CM Chien, A Tjandra, A Vyas, M Le, B Shi, WN Hsu
arXiv preprint arXiv:2406.06251, 2024
12024
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–12