Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
While textless Spoken Language Models (SLMs) have shown potential in end-to-end
speech-to-speech modeling, they still lag behind text-based Large Language Models …
speech-to-speech modeling, they still lag behind text-based Large Language Models …
Data-Centric Improvements for Enhancing Multi-Modal Understanding in Spoken Conversation Modeling
Conversational assistants are increasingly popular across diverse real-world applications,
highlighting the need for advanced multimodal speech modeling. Speech, as a natural …
highlighting the need for advanced multimodal speech modeling. Speech, as a natural …