Damai Dai

Sitert av

	Alle	Siden 2020
Sitater	4579	4557
h-indeks	19	19
i10-indeks	30	30

2700

1350

675

2025

20202021202220232024202529 64 132 766 2638 907

Offentlig tilgang

Vis alle

10 artikler

0 artikler

tilgjengelige

ikke tilgjengelige

Basert på finansieringsmandater

Medforfattere

Baobao CHANGPeking UniversityVerifisert e-postadresse på pku.edu.cn
Xu SunAssociate Professor, Peking UniversityVerifisert e-postadresse på pku.edu.cn
Qingxiu DongPeking UniversityVerifisert e-postadresse på stu.pku.edu.cn
Tianyu LiuAlibabaVerifisert e-postadresse på pku.edu.cn
Li DongMicrosoft ResearchVerifisert e-postadresse på microsoft.com
Furu WeiPartner Research Manager, Microsoft ResearchVerifisert e-postadresse på microsoft.com
Peiyi WangPeking UniversityVerifisert e-postadresse på stu.pku.edu.cn
Shuming MaMicrosoft Research AsiaVerifisert e-postadresse på microsoft.com
Fuli Luo（罗福莉）Peking UniversityVerifisert e-postadresse på pku.edu.cn
Wei LiBeijing Language and Culture UniversityVerifisert e-postadresse på blcu.edu.cn

Følg

Damai Dai

Andre navn代达劢

Peking University, DeepSeek AI

Verifisert e-postadresse på pku.edu.cn

Deep Learning Natural Language Processing Large Language Model Mixture-of-Experts


Tittel Sorter etter sitater Sorter etter år Sorter etter tittel	Sitert av Sitert av	År
A survey on in-context learning Q Dong, L Li, D Dai, C Zheng, J Ma, R Li, H Xia, J Xu, Z Wu, B Chang, ... Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024	1554	2024
Knowledge neurons in pretrained transformers D Dai, L Dong, Y Hao, Z Sui, C Baobao, F Wei Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	554	2022
Why can GPT learn in-context? language models implicitly perform gradient descent as meta-optimizers D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui, F Wei Findings of the Association for Computational Linguistics: ACL 2023, 4005-4019, 2023	405	2023
Deepseek llm: Scaling open-source language models with longtermism X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ... arXiv preprint arXiv:2401.02954, 2024	231	2024
Math-shepherd: Verify and reinforce llms step-by-step without human annotations P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024	203*	2024
Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning D Guo, D Yang, H Zhang, J Song, R Zhang, R Xu, Q Zhu, S Ma, P Wang, ... arXiv preprint arXiv:2501.12948, 2025	180	2025
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ... Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024	176	2024
Deepseek-coder-v2: Breaking the barrier of closed-source models in code intelligence Q Zhu, D Guo, Z Shao, D Yang, P Wang, R Xu, Y Wu, Y Li, H Gao, S Ma, ... arXiv preprint arXiv:2406.11931, 2024	161*	2024
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning L Wang, L Li, D Dai, D Chen, H Zhou, F Meng, J Zhou, X Sun (EMNLP 2023 Best Long Paper) Proceedings of the 2023 Conference on Empirical …, 2023	140	2023
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model DeepSeek-AI, A Liu, B Feng, B Wang, B Wang, B Liu, C Zhao, C Dengr, ... arXiv preprint arXiv:2405.04434, 2024	138	2024
Deepseek-v3 technical report A Liu, B Feng, B Xue, B Wang, B Wu, C Lu, C Zhao, C Deng, C Zhang, ... arXiv preprint arXiv:2412.19437, 2024	135	2024
Calibrating Factual Knowledge in Pretrained Language Models Q Dong, D Dai, Y Song, J Xu, Z Sui, L Li Findings of the Association for Computational Linguistics: EMNLP 2022, 2022	109	2022
On the representation collapse of sparse mixture of experts Z Chi, L Dong, S Huang, D Dai, S Ma, B Patra, S Singhal, P Bajaj, X Song, ... Advances in Neural Information Processing Systems 35, 34600-34613, 2022	89	2022
Preliminary study on the construction of Chinese medical knowledge graph O Byambasuren, Y Yang, Z Sui, D Dai, B Chang, S Li, H Zan Journal of Chinese Information Processing 33 (10), 1-9, 2019	79*	2019
Livebot: Generating live video comments based on visual and textual contexts S Ma, L Cui, D Dai, F Wei, X Sun Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 6810-6817, 2019	68	2019
StableMoE: Stable Routing Strategy for Mixture of Experts D Dai, L Dong, S Ma, B Zheng, Z Sui, B Chang, F Wei Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	65	2022
Learning to control the fine-grained sentiment for story ending generation F Luo, D Dai, P Yang, T Liu, B Chang, Z Sui, X Sun Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019	64	2019
Sememe prediction: Learning semantic knowledge from unstructured textual wiki descriptions W Li, X Ren, D Dai, Y Wu, H Wang, X Sun arXiv preprint arXiv:1808.05437, 2018	21	2018
Inductively Representing Out-of-Knowledge-Graph Entities by Optimal Estimation Under Translational Assumptions D Dai, H Zheng, F Luo, P Yang, T Liu, Z Sui, B Chang Proceedings of the 6th ACL Workshop on Representation Learning for NLP …, 2021	20	2021
Hierarchical Curriculum Learning for AMR Parsing P Wang, L Chen, T Liu, D Dai, Y Cao, B Chang, Z Sui Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022	19	2022

Systemet kan ikke utføre handlingen. Prøv på nytt senere.

Artikler 1–20

Sitater per år

Duplikatsitater

Sammenslåtte sitater

Legg til medforfattereMedforfattere

Følg

Sitert av

Medforfattere