Stebėti
Guangtao Zeng
Guangtao Zeng
Patvirtintas el. paštas mymail.sutd.edu.sg
Pavadinimas
Cituota
Cituota
Metai
Tinyllama: An open-source small language model
P Zhang*, G Zeng*, T Wang, W Lu
arXiv preprint arXiv:2401.02385, 2024
3092024
MedDialog: Large-scale medical dialogue datasets
G Zeng, W Yang, Z Ju, Y Yang, S Wang, R Zhang, M Zhou, J Zeng, ...
Proceedings of the 2020 conference on empirical methods in natural language …, 2020
1942020
Long context transfer from language to vision
P Zhang, K Zhang, B Li, G Zeng, J Yang, Y Zhang, Z Wang, H Tan, C Li, ...
arXiv preprint arXiv:2406.16852, 2024
822024
On the generation of medical dialogs for COVID-19
M Zhou, Z Li, B Tan, G Zeng, W Yang, X He, Z Ju, S Chakravorty, S Chen, ...
Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021
60*2021
Towards a mechanistic interpretation of multi-step reasoning capabilities of language models
Y Hou, J Li, Y Fei, A Stolfo, W Zhou, G Zeng, A Bosselut, M Sachan
arXiv preprint arXiv:2310.14491, 2023
272023
One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning
G Zeng*, P Zhang*, W Lu
arXiv preprint arXiv:2305.17682, 2023
222023
Regmix: Data mixture as regression for language model pre-training
Q Liu, X Zheng, N Muennighoff, G Zeng, L Dou, T Pang, J Jiang, M Lin
arXiv preprint arXiv:2407.01492, 2024
182024
Sailor: Open language models for south-east asia
L Dou, Q Liu, G Zeng, J Guo, J Zhou, W Lu, M Lin
arXiv preprint arXiv:2404.03608, 2024
182024
Beta-rec: Build, evaluate and tune automated recommender systems
Z Meng, R McCreadie, C Macdonald, I Ounis, S Liu, Y Wu, X Wang, ...
Proceedings of the 14th ACM Conference on Recommender Systems, 588-590, 2020
162020
Mhpp: Exploring the capabilities and limitations of language models beyond basic code generation
J Dai, J Lu, Y Feng, D Huang, G Zeng, R Ruan, M Cheng, H Tan, Z Guo
arXiv preprint arXiv:2405.11430, 2024
62024
Unsupervised non-transferable text classification
G Zeng, W Lu
arXiv preprint arXiv:2210.12651, 2022
42022
Scaling up Masked Diffusion Models on Text
S Nie, F Zhu, C Du, T Pang, Q Liu, G Zeng, M Lin, C Li
arXiv preprint arXiv:2410.18514, 2024
32024
Effi-code: Unleashing code efficiency in language models
D Huang, G Zeng, J Dai, M Luo, H Weng, Y Qing, H Cui, Z Guo, JM Zhang
arXiv preprint arXiv:2410.10209, 2024
32024
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
M Shen, G Zeng, Z Qi, ZW Hong, Z Chen, W Lu, G Wornell, S Das, D Cox, ...
arXiv preprint arXiv:2502.02508, 2025
12025
SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
J Guo, L Dou, G Zeng, S Kok, W Lu, Q Liu
arXiv preprint arXiv:2412.01186, 2024
12024
Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.
Straipsniai 1–15