A survey on in-context learning Q Dong, L Li, D Dai, C Zheng, J Ma, R Li, H Xia, J Xu, Z Wu, T Liu, ... arXiv preprint arXiv:2301.00234, 2022 | 1361 | 2022 |
Knowledge neurons in pretrained transformers D Dai, L Dong, Y Hao, Z Sui, B Chang, F Wei arXiv preprint arXiv:2104.08696, 2021 | 526 | 2021 |
Large language models are not fair evaluators P Wang, L Li, L Chen, Z Cai, D Zhu, B Lin, Y Cao, Q Liu, T Liu, Z Sui arXiv preprint arXiv:2305.17926, 2023 | 377 | 2023 |
Why can gpt learn in-context? language models implicitly perform gradient descent as meta-optimizers D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui, F Wei arXiv preprint arXiv:2212.10559, 2022 | 361 | 2022 |
Table-to-text generation by structure-aware seq2seq learning T Liu, K Wang, L Sha, B Chang, Z Sui Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 317 | 2018 |
Jointly extracting event triggers and arguments by dependency-bridge RNN and tensor-based argument interaction L Sha, F Qian, B Chang, Z Sui Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 299 | 2018 |
Cblue: A chinese biomedical language understanding evaluation benchmark N Zhang, M Chen, Z Bi, X Liang, L Li, X Shang, K Yin, C Tan, J Xu, ... arXiv preprint arXiv:2106.08087, 2021 | 207 | 2021 |
Towards time-aware knowledge graph completion T Jiang, T Liu, T Ge, L Sha, B Chang, S Li, Z Sui Proceedings of COLING 2016, the 26th International Conference on …, 2016 | 193 | 2016 |
A dual reinforcement learning framework for unsupervised text style transfer F Luo, P Li, J Zhou, P Yang, B Chang, Z Sui, X Sun arXiv preprint arXiv:1905.10060, 2019 | 192 | 2019 |
A soft-label method for noise-tolerant distantly supervised relation extraction T Liu, K Wang, B Chang, Z Sui Proceedings of the 2017 conference on empirical methods in natural language …, 2017 | 164 | 2017 |
Implicit discourse relation classification via multi-task neural networks Y Liu, S Li, X Zhang, Z Sui Proceedings of the AAAI conference on artificial intelligence 30 (1), 2016 | 150 | 2016 |
Encoding temporal information for time-aware link prediction T Jiang, T Liu, T Ge, L Sha, S Li, B Chang, Z Sui Proceedings of the 2016 conference on empirical methods in natural language …, 2016 | 147 | 2016 |
Deepseekmoe: Towards ultimate expert specialization in mixture-of-experts language models D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ... arXiv preprint arXiv:2401.06066, 2024 | 142 | 2024 |
Order-planning neural text generation from structured data L Sha, L Mou, T Liu, P Poupart, S Li, B Chang, Z Sui Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 128 | 2018 |
A token-level reference-free hallucination detection benchmark for free-form text generation T Liu, Y Zhang, C Brockett, Y Mao, Z Sui, W Chen, B Dolan arXiv preprint arXiv:2104.08704, 2021 | 126 | 2021 |
Incorporating glosses into neural word sense disambiguation F Luo, T Liu, Q Xia, B Chang, Z Sui arXiv preprint arXiv:1805.08028, 2018 | 116 | 2018 |
Calibrating factual knowledge in pretrained language models Q Dong, D Dai, Y Song, J Xu, Z Sui, L Li arXiv preprint arXiv:2210.03329, 2022 | 113 | 2022 |
Math-shepherd: Verify and reinforce llms step-by-step without human annotations P Wang, L Li, Z Shao, R Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024 | 112 | 2024 |
Reading and thinking: Re-read lstm unit for textual entailment recognition L Sha, B Chang, Z Sui, S Li Proceedings of COLING 2016, the 26th International Conference on …, 2016 | 93 | 2016 |
Xgpt: Cross-modal generative pre-training for image captioning Q Xia, H Huang, N Duan, D Zhang, L Ji, Z Sui, E Cui, T Bharti, M Zhou Natural Language Processing and Chinese Computing: 10th CCF International …, 2021 | 82 | 2021 |