Tinyllama: An open-source small language model P Zhang*, G Zeng*, T Wang, W Lu arXiv preprint arXiv:2401.02385, 2024 | 272 | 2024 |
MedDialog: Large-scale medical dialogue datasets G Zeng, W Yang, Z Ju, Y Yang, S Wang, R Zhang, M Zhou, J Zeng, ... Proceedings of the 2020 conference on empirical methods in natural language …, 2020 | 189 | 2020 |
Long context transfer from language to vision P Zhang, K Zhang, B Li, G Zeng, J Yang, Y Zhang, Z Wang, H Tan, C Li, ... arXiv preprint arXiv:2406.16852, 2024 | 64 | 2024 |
On the generation of medical dialogs for COVID-19 M Zhou, Z Li, B Tan, G Zeng, W Yang, X He, Z Ju, S Chakravorty, S Chen, ... Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 58* | 2021 |
One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning G Zeng*, P Zhang*, W Lu arXiv preprint arXiv:2305.17682, 2023 | 22 | 2023 |
Towards a mechanistic interpretation of multi-step reasoning capabilities of language models Y Hou, J Li, Y Fei, A Stolfo, W Zhou, G Zeng, A Bosselut, M Sachan arXiv preprint arXiv:2310.14491, 2023 | 19 | 2023 |
Beta-rec: Build, evaluate and tune automated recommender systems Z Meng, R McCreadie, C Macdonald, I Ounis, S Liu, Y Wu, X Wang, ... Proceedings of the 14th ACM Conference on Recommender Systems, 588-590, 2020 | 15 | 2020 |
Regmix: Data mixture as regression for language model pre-training Q Liu, X Zheng, N Muennighoff, G Zeng, L Dou, T Pang, J Jiang, M Lin arXiv preprint arXiv:2407.01492, 2024 | 10 | 2024 |
Sailor: Open Language Models for South-East Asia L Dou, Q Liu, G Zeng, J Guo, J Zhou, W Lu, M Lin arXiv preprint arXiv:2404.03608, 2024 | 8 | 2024 |
Mhpp: Exploring the capabilities and limitations of language models beyond basic code generation J Dai, J Lu, Y Feng, D Huang, G Zeng, R Ruan, M Cheng, H Tan, Z Guo arXiv preprint arXiv:2405.11430, 2024 | 7 | 2024 |
Unsupervised non-transferable text classification G Zeng, W Lu arXiv preprint arXiv:2210.12651, 2022 | 3 | 2022 |
Effi-Code: Unleashing Code Efficiency in Language Models D Huang, G Zeng, J Dai, M Luo, H Weng, Y Qing, H Cui, Z Guo, JM Zhang arXiv preprint arXiv:2410.10209, 2024 | 2 | 2024 |
SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages J Guo, L Dou, G Zeng, S Kok, W Lu, Q Liu arXiv preprint arXiv:2412.01186, 2024 | | 2024 |
Scaling up Masked Diffusion Models on Text S Nie, F Zhu, C Du, T Pang, Q Liu, G Zeng, M Lin, C Li arXiv preprint arXiv:2410.18514, 2024 | | 2024 |