Damai Dai

引用次数

	总计	2020 年至今
引用	3951	3933
h 指数	19	18
i10 指数	26	26

2700

1350

675

2025

201920202021202220232024202511 29 64 130 741 2625 332

开放获取的出版物数量

查看全部

10 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Baobao CHANGPeking University在 pku.edu.cn 的电子邮件经过验证
Xu SunAssociate Professor, Peking University在 pku.edu.cn 的电子邮件经过验证
Qingxiu DongPeking University在 stu.pku.edu.cn 的电子邮件经过验证
Tianyu LiuAlibaba在 pku.edu.cn 的电子邮件经过验证
Li DongMicrosoft Research在 microsoft.com 的电子邮件经过验证
Furu WeiPartner Research Manager, Microsoft Research在 microsoft.com 的电子邮件经过验证
Shuming MaMicrosoft Research Asia在 microsoft.com 的电子邮件经过验证
Peiyi WangPeking University在 stu.pku.edu.cn 的电子邮件经过验证
Fuli Luo（罗福莉）Peking University在 pku.edu.cn 的电子邮件经过验证
Wei LiBeijing Language and Culture University在 blcu.edu.cn 的电子邮件经过验证

关注

Damai Dai

其他姓名代达劢

Peking University, DeepSeek AI

在 pku.edu.cn 的电子邮件经过验证

Deep Learning Natural Language Processing Large Language Model Mixture-of-Experts


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
A survey on in-context learning Q Dong, L Li, D Dai, C Zheng, J Ma, R Li, H Xia, J Xu, Z Wu, B Chang, ... Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024	1480	2024
Knowledge neurons in pretrained transformers D Dai, L Dong, Y Hao, Z Sui, C Baobao, F Wei Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	527	2022
Why can GPT learn in-context? language models implicitly perform gradient descent as meta-optimizers D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui, F Wei Findings of the Association for Computational Linguistics: ACL 2023, 4005-4019, 2023	381*	2023
Deepseek llm: Scaling open-source language models with longtermism X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ... arXiv preprint arXiv:2401.02954, 2024	199	2024
Math-shepherd: Verify and reinforce llms step-by-step without human annotations P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024	148*	2024
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ... Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024	143	2024
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Q Zhu, D Guo, Z Shao, D Yang, P Wang, R Xu, Y Wu, Y Li, H Gao, S Ma, ... arXiv preprint arXiv:2406.11931, 2024	122*	2024
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning L Wang, L Li, D Dai, D Chen, H Zhou, F Meng, J Zhou, X Sun (EMNLP 2023 Best Long Paper) Proceedings of the 2023 Conference on Empirical …, 2023	120	2023
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model DeepSeek-AI, A Liu, B Feng, B Wang, B Wang, B Liu, C Zhao, C Dengr, ... arXiv preprint arXiv:2405.04434, 2024	115	2024
Calibrating Factual Knowledge in Pretrained Language Models Q Dong, D Dai, Y Song, J Xu, Z Sui, L Li Findings of the Association for Computational Linguistics: EMNLP 2022, 2022	113	2022
On the representation collapse of sparse mixture of experts Z Chi, L Dong, S Huang, D Dai, S Ma, B Patra, S Singhal, P Bajaj, X Song, ... Advances in Neural Information Processing Systems 35, 34600-34613, 2022	83	2022
Preliminary study on the construction of Chinese medical knowledge graph O Byambasuren, Y Yang, Z Sui, D Dai, B Chang, S Li, H Zan Journal of Chinese Information Processing 33 (10), 1-9, 2019	81*	2019
Livebot: Generating live video comments based on visual and textual contexts S Ma, L Cui, D Dai, F Wei, X Sun Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 6810-6817, 2019	68	2019
Learning to control the fine-grained sentiment for story ending generation F Luo, D Dai, P Yang, T Liu, B Chang, Z Sui, X Sun Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019	64	2019
StableMoE: Stable Routing Strategy for Mixture of Experts D Dai, L Dong, S Ma, B Zheng, Z Sui, B Chang, F Wei Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	59	2022
Deepseek-v3 technical report A Liu, B Feng, B Xue, B Wang, B Wu, C Lu, C Zhao, C Deng, C Zhang, ... arXiv preprint arXiv:2412.19437, 2024	25	2024
Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions D Dai, H Zheng, F Luo, P Yang, T Liu, Z Sui, B Chang Proceedings of the 6th ACL Workshop on Representation Learning for NLP …, 2021	20	2021
Sememe prediction: Learning semantic knowledge from unstructured textual wiki descriptions W Li, X Ren, D Dai, Y Wu, H Wang, X Sun arXiv preprint arXiv:1808.05437, 2018	20	2018
Hierarchical Curriculum Learning for AMR Parsing P Wang, L Chen, T Liu, D Dai, Y Cao, B Chang, Z Sui Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	19	2022
Behind the scenes: An exploration of trigger biases problem in few-shot event classification P Wang, R Xun, T Liu, D Dai, B Chang, Z Sui Proceedings of the 30th ACM International Conference on Information …, 2021	17	2021

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者