Diffusionbert: Improving generative masked language models with diffusion models Z He, T Sun, K Wang, X Huang, X Qiu arXiv preprint arXiv:2211.15029, 2022 | 103 | 2022 |
BBTv2: Towards a Gradient-Free Future with Large Language Models T Sun, Z He, H Qian, Y Zhou, XJ Huang, X Qiu Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 95* | 2022 |
MOSS: An Open Conversational Large Language Model T Sun, X Zhang, Z He, P Li, Q Cheng, X Liu, H Yan, Y Shao, Q Tang, ... Machine Intelligence Research, 1-18, 2024 | 84* | 2024 |
Can AI assistants know what they don't know? Q Cheng, T Sun, X Liu, W Zhang, Z Yin, S Li, L Li, Z He, K Chen, X Qiu arXiv preprint arXiv:2401.13275, 2024 | 32 | 2024 |
Multitask pre-training of modular prompt for Chinese few-shot learning T Sun, Z He, Q Zhu, X Qiu, X Huang arXiv preprint arXiv:2210.07565, 2022 | 26 | 2022 |
Dictionary learning improves patch-free circuit discovery in mechanistic interpretability: A case study on othello-gpt Z He, X Ge, Q Tang, T Sun, Q Cheng, X Qiu arXiv preprint arXiv:2402.12201, 2024 | 16* | 2024 |
Competition for gradient-free tuning of large language models: approaches, results, current challenges and future directions T Cao, L Chen, D Zhang, T Sun, Z He, X Qiu, X Xu, H Zhang National Science Review 10 (6), nwad124, 2023 | 4 | 2023 |
Llama scope: Extracting millions of features from llama-3.1-8b with sparse autoencoders Z He, W Shu, X Ge, L Chen, J Wang, Y Zhou, F Liu, Q Guo, X Huang, ... arXiv preprint arXiv:2410.20526, 2024 | 3 | 2024 |
Automatically Identifying Local and Global Circuits with Linear Computation Graphs X Ge, F Zhu, W Shu, J Wang, Z He, X Qiu arXiv preprint arXiv:2405.13868, 2024 | 3 | 2024 |
Towards Universality: Studying mechanistic similarity across language model architectures J Wang, X Ge, W Shu, Q Tang, Y Zhou, Z He, X Qiu arXiv preprint arXiv:2410.06672, 2024 | 2 | 2024 |
Generate Point Clouds with Multiscale Details from Graph-Represented Structures X Yang, Z He, C Jin arXiv preprint arXiv:2112.06433, 2021 | | 2021 |