MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation

C Liu, H Wang, J Zhao, S Zhao, H Bu, X Xu… - arxiv preprint arxiv …, 2025 - arxiv.org
The technology for generating music from textual descriptions has seen rapid
advancements. However, evaluating text-to-music (TTM) systems remains a significant …

FELLE: Autoregressive Speech Synthesis with Token-Wise Coarse-to-Fine Flow Matching

H Wang, S Liu, L Meng, J Li, Y Yang, S Zhao… - arxiv preprint arxiv …, 2025 - arxiv.org
To advance continuous-valued token modeling and temporal-coherence enforcement, we
propose FELLE, an autoregressive model that integrates language modeling with token …