Folgen
Ruibin Yuan
Ruibin Yuan
HKUST
Bestätigte E-Mail-Adresse bei andrew.cmu.edu
Titel
Zitiert von
Zitiert von
Jahr
Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi
X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ...
CVPR 2024 Best Paper Nomination, 2023
5422023
Mert: Acoustic music understanding model with large-scale self-supervised training
Y Li, R Yuan, G Zhang, Y Ma, X Chen, H Yin, C Lin, A Ragni, E Benetos, ...
ICLR 2024, 2023
1022023
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
J Zhan, J Dai, J Ye, Y Zhou, D Zhang, Z Liu, X Zhang, R Yuan, G Zhang, ...
ACL 2024, 2024
872024
Rq-rag: Learning to refine queries for retrieval augmented generation
CM Chan, C Xu, R Yuan, H Luo, W Xue, Y Guo, J Fu
COLM 2024, 2024
432024
ChatMusician: Understanding and Generating Music Intrinsically with LLM
R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ...
ACL 2024, 2024
342024
Chinese open instruction generalist: A preliminary release
G Zhang, Y Shi, R Liu, R Yuan, Y Li, S Dong, Y Shu, Z Li, Z Wang, C Lin, ...
arXiv preprint arXiv:2304.07987, 2023
28*2023
Map-neo: Highly capable and transparent bilingual large language model series
G Zhang, S Qu, J Liu, C Zhang, C Lin, CL Yu, D Pan, E Cheng, J Liu, ...
arXiv preprint arXiv:2405.19327, 2024
242024
Map-music2vec: A simple and effective baseline for self-supervised music audio representation learning
Y Li, R Yuan, G Zhang, Y Ma, C Lin, X Chen, A Ragni, H Yin, Z Hu, H He, ...
arXiv preprint arXiv:2212.02508, 2022
242022
Marble: Music audio representation benchmark for universal evaluation
R Yuan, Y Ma, Y Li, G Zhang, X Chen, H Yin, Y Liu, J Huang, Z Tian, ...
NeurIPS 2023, 2024
232024
Lyricwhiz: Robust multilingual lyrics transcription by whispering to chatgpt
L Zhuo, R Yuan, J Pan, Y Ma, Y Li, G Zhang, S Liu, R Dannenberg, J Fu, ...
ISMIR 2023, 2023
22*2023
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Q Deng, Q Yang, R Yuan, Y Huang, Y Wang, X Liu, Z Tian, J Pan, ...
ISMIR 2024, 2024
15*2024
LLMs Meet Multimodal Generation and Editing: A Survey
Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu, R Yuan, Y Xing, W Wang, ...
arXiv preprint arXiv:2405.19334, 2024
142024
Foundation models for music: A survey
Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis, C Donahue, C Lin, ...
arXiv preprint arXiv:2408.14340, 2024
122024
Chinese tiny llm: Pretraining a chinese-centric large language model
X Du, Z Yu, S Gao, D Pan, Y Cheng, Z Ma, R Yuan, X Qu, J Liu, T Zheng, ...
COLM 2024, 2024
122024
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning
Y Bai, X Du, Y Liang, Y Jin, Z Liu, J Zhou, T Zheng, X Zhang, N Ma, ...
NAACL, 2024
102024
On the effectiveness of speech self-supervised learning for music
Y Ma, R Yuan, Y Li, G Zhang, X Chen, H Yin, C Lin, E Benetos, A Ragni, ...
ISMIR 2023, 2023
102023
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models
Y Li, G Zhang, X Qu, J Li, Z Li, Z Wang, H Li, R Yuan, Y Ma, K Zhang, ...
ACL 2024, 2024
72024
Cmmmu: A chinese massive multi-discipline multimodal understanding benchmark
G Zhang, X Du, B Chen, Y Liang, T Luo, T Zheng, K Zhu, Y Cheng, C Xu, ...
arXiv preprint arXiv:2401.11944, 2024
72024
Omnibench: Towards the future of universal omni-language models
Y Li, G Zhang, Y Ma, R Yuan, K Zhu, H Guo, Y Liang, J Liu, Z Wang, ...
arXiv preprint arXiv:2409.15272, 2024
62024
Vidmuse: A simple video-to-music generation framework with long-short-term modeling
Z Tian, Z Liu, R Yuan, J Pan, Q Liu, X Tan, Q Chen, W Xue, Y Guo
arXiv preprint arXiv:2406.04321, 2024
62024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20