Follow
Zhihong Shao
Zhihong Shao
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
Critic: Large language models can self-correct with tool-interactive critiquing
Z Gou, Z Shao, Y Gong, Y Shen, Y Yang, N Duan, W Chen
ICLR 2024, 2023
292*2023
Deepseekmath: Pushing the limits of mathematical reasoning in open language models
Z Shao, P Wang, Q Zhu, R Xu, J Song, X Bi, H Zhang, M Zhang, YK Li, ...
arXiv preprint arXiv:2402.03300, 2024
196*2024
Deepseek llm: Scaling open-source language models with longtermism
X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ...
arXiv preprint arXiv:2401.02954, 2024
1952024
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
Z Shao, Y Gong, Y Shen, M Huang, N Duan, W Chen
Findings of EMNLP 2023, 2023
1652023
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Z Shao*, Z Gou*, Y Gong, Y Yang, M Huang, N Duan, W Chen
ICLR 2024, 2023
151*2023
Math-shepherd: Verify and reinforce llms step-by-step without human annotations
P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui
ACL 2024, 2023
139*2023
Long and diverse text generation with planning-based hierarchical variational model
Z Shao, M Huang, J Wen, W Xu, X Zhu
EMNLP 2019, 2019
1312019
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Q Zhu*, D Guo*, Z Shao*, D Yang*, P Wang, R Xu, Y Wu, Y Li, H Gao, ...
arXiv preprint arXiv:2406.11931, 2024
120*2024
Deepseek-v2: A strong, economical, and efficient mixture-of-experts language model
A Liu, B Feng, B Wang, B Wang, B Liu, C Zhao, C Dengr, C Ruan, D Dai, ...
arXiv preprint arXiv:2405.04434, 2024
1132024
Synthetic prompting: Generating chain-of-thought demonstrations for large language models
Z Shao, Y Gong, Y Shen, M Huang, N Duan, W Chen
ICML 2023, 2023
942023
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
H Xin, D Guo, Z Shao, Z Ren, Q Zhu, B Liu, C Ruan, W Li, X Liang
NeurIPS 2024 Workshop MATH-AI, 2024
252024
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
H Xin*, ZZ Ren*, J Song*, Z Shao*, W Zhao, H Wang, B Liu, L Zhang, X Lu, ...
ICLR 2025, 2024
23*2024
Deepseek-v3 technical report
A Liu, B Feng, B Xue, B Wang, B Wu, C Lu, C Zhao, C Deng, C Zhang, ...
arXiv preprint arXiv:2412.19437, 2024
202024
AdvExpander: Generating Natural Language Adversarial Examples by Expanding Text
Z Shao, Z Wu, M Huang
TASLP 2021, 2020
142020
Chaining Simultaneous Thoughts for Numerical Reasoning
Z Shao, F Huang, M Huang
Findings of EMNLP 2022, 2022
132022
Answering Open-Domain Multi-Answer Questions via a Recall-then-Verify Framework
Z Shao, M Huang
ACL 2022, 2021
122021
Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning
D Guo, D Yang, H Zhang, J Song, R Zhang, R Xu, Q Zhu, S Ma, P Wang, ...
arXiv preprint arXiv:2501.12948, 2025
6*2025
A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering
Z Shao, L Shang, Q Liu, M Huang
ACL 2021, 2021
62021
Learning Task Decomposition to Assist Humans in Competitive Programming
J Wen, R Zhong, P Ke, Z Shao, H Wang, M Huang
ACL 2024, 2024
32024
Cotk: An open-source toolkit for fast development and fair evaluation of text generation
F Huang, D Wan, Z Shao, P Ke, J Guan, Y Niu, X Zhu, M Huang
arXiv preprint arXiv:2002.00583, 2020
32020
The system can't perform the operation now. Try again later.
Articles 1–20