STEMM: Self-learning with speech-text manifold mixup for speech translation Q Fang, R Ye, L Li, Y Feng, M Wang ACL 2022, 2022 | 99 | 2022 |
Bayling: Bridging cross-lingual alignment and instruction following through interactive translation for large language models S Zhang, Q Fang, Z Zhang, Z Ma, Y Zhou, L Huang, M Bu, S Gui, Y Chen, ... arXiv preprint arXiv:2306.10968, 2023 | 78* | 2023 |
Neural machine translation with phrase-level universal visual representations Q Fang, Y Feng ACL 2022, 2022 | 42 | 2022 |
Llama-omni: Seamless speech interaction with large language models Q Fang, S Guo, Y Zhou, Z Ma, S Zhang, Y Feng ICLR 2025, 2024 | 30 | 2024 |
CMOT: Cross-modal mixup via optimal transport for speech translation Y Zhou, Q Fang, Y Feng ACL 2023, 2023 | 25 | 2023 |
Understanding and bridging the modality gap for speech translation Q Fang, Y Feng ACL 2023, 2023 | 25 | 2023 |
StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning S Zhang, Q Fang, S Guo, Z Ma, M Zhang, Y Feng ACL 2024, 2024 | 12 | 2024 |
Back translation for speech-to-text translation without transcripts Q Fang, Y Feng ACL 2023, 2023 | 12 | 2023 |
A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation Z Ma, Q Fang, S Zhang, S Guo, Y Feng, M Zhang ACL 2024, 2024 | 9 | 2024 |
DASpeech: Directed acyclic transformer for fast and high-quality speech-to-speech translation Q Fang, Y Zhou, Y Feng NeurIPS 2023, 2023 | 8 | 2023 |
Bridging the gap between synthetic and authentic images for multimodal machine translation W Guo, Q Fang, D Yu, Y Feng EMNLP 2023, 2023 | 8 | 2023 |
Low-resource neural machine translation with cross-modal alignment Z Yang, Q Fang, Y Feng EMNLP 2022, 2022 | 5 | 2022 |
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token S Zhang, Q Fang, Z Yang, Y Feng ICLR 2025, 2025 | 1 | 2025 |
Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? Q Fang, S Zhang, Z Ma, M Zhang, Y Feng ACL 2024, 2024 | 1 | 2024 |
CTC-based Non-autoregressive Textless Speech-to-Speech Translation Q Fang, Z Ma, Y Zhou, M Zhang, Y Feng ACL 2024, 2024 | 1 | 2024 |
BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment S Zhang, K Zhang, Q Fang, S Guo, Y Zhou, X Liu, Y Feng arXiv preprint arXiv:2411.16300, 2024 | | 2024 |
Beyond Language: Empowering Unsupervised Machine Translation with Cross-modal Alignment Z Yang, Q Fang, Y Feng | | |