Urmăriți
Zhihong Shao
Zhihong Shao
Adresă de e-mail confirmată pe mails.tsinghua.edu.cn - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Critic: Large language models can self-correct with tool-interactive critiquing
Z Gou, Z Shao, Y Gong, Y Shen, Y Yang, N Duan, W Chen
ICLR 2024, 2023
337*2023
Deepseekmath: Pushing the limits of mathematical reasoning in open language models
Z Shao, P Wang, Q Zhu, R Xu, J Song, X Bi, H Zhang, M Zhang, YK Li, ...
arXiv preprint arXiv:2402.03300, 2024
303*2024
Deepseek llm: Scaling open-source language models with longtermism
X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ...
arXiv preprint arXiv:2401.02954, 2024
2442024
Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning
D Guo, D Yang, H Zhang, J Song, R Zhang, R Xu, Q Zhu, S Ma, P Wang, ...
arXiv preprint arXiv:2501.12948, 2025
2402025
Math-shepherd: Verify and reinforce llms step-by-step without human annotations
P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui
ACL 2024, 2023
217*2023
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
Z Shao, Y Gong, Y Shen, M Huang, N Duan, W Chen
Findings of EMNLP 2023, 2023
1812023
Deepseek-v3 technical report
A Liu, B Feng, B Xue, B Wang, B Wu, C Lu, C Zhao, C Deng, C Zhang, ...
arXiv preprint arXiv:2412.19437, 2024
1742024
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Z Shao*, Z Gou*, Y Gong, Y Yang, M Huang, N Duan, W Chen
ICLR 2024, 2023
174*2023
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Q Zhu*, D Guo*, Z Shao*, D Yang*, P Wang, R Xu, Y Wu, Y Li, H Gao, ...
arXiv preprint arXiv:2406.11931, 2024
168*2024
Deepseek-v2: A strong, economical, and efficient mixture-of-experts language model
A Liu, B Feng, B Wang, B Wang, B Liu, C Zhao, C Dengr, C Ruan, D Dai, ...
arXiv preprint arXiv:2405.04434, 2024
1512024
Long and diverse text generation with planning-based hierarchical variational model
Z Shao, M Huang, J Wen, W Xu, X Zhu
EMNLP 2019, 2019
1322019
Synthetic prompting: Generating chain-of-thought demonstrations for large language models
Z Shao, Y Gong, Y Shen, M Huang, N Duan, W Chen
ICML 2023, 2023
972023
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
H Xin, D Guo, Z Shao, Z Ren, Q Zhu, B Liu, C Ruan, W Li, X Liang
NeurIPS 2024 Workshop MATH-AI, 2024
452024
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
H Xin*, ZZ Ren*, J Song*, Z Shao*, W Zhao, H Wang, B Liu, L Zhang, X Lu, ...
ICLR 2025, 2024
38*2024
AdvExpander: Generating Natural Language Adversarial Examples by Expanding Text
Z Shao, Z Wu, M Huang
TASLP 2021, 2020
152020
Chaining Simultaneous Thoughts for Numerical Reasoning
Z Shao, F Huang, M Huang
Findings of EMNLP 2022, 2022
142022
Answering Open-Domain Multi-Answer Questions via a Recall-then-Verify Framework
Z Shao, M Huang
ACL 2022, 2021
132021
A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering
Z Shao, L Shang, Q Liu, M Huang
ACL 2021, 2021
62021
Learning Task Decomposition to Assist Humans in Competitive Programming
J Wen, R Zhong, P Ke, Z Shao, H Wang, M Huang
ACL 2024, 2024
42024
Cotk: An open-source toolkit for fast development and fair evaluation of text generation
F Huang, D Wan, Z Shao, P Ke, J Guan, Y Niu, X Zhu, M Huang
arXiv preprint arXiv:2002.00583, 2020
32020
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–20