Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
From generation to judgment: Opportunities and challenges of llm-as-a-judge
Assessment and evaluation have long been critical challenges in artificial intelligence (AI)
and natural language processing (NLP). However, traditional methods, whether matching …
and natural language processing (NLP). However, traditional methods, whether matching …
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Language has long been conceived as an essential tool for human reasoning. The
breakthrough of Large Language Models (LLMs) has sparked significant research interest in …
breakthrough of Large Language Models (LLMs) has sparked significant research interest in …
Scaling of search and learning: A roadmap to reproduce o1 from reinforcement learning perspective
OpenAI o1 represents a significant milestone in Artificial Inteiligence, which achieves expert-
level performances on many challanging tasks that require strong reasoning ability. OpenAI …
level performances on many challanging tasks that require strong reasoning ability. OpenAI …
Process reinforcement through implicit rewards
Dense process rewards have proven a more effective alternative to the sparse outcome-
level rewards in the inference-time scaling of large language models (LLMs), particularly in …
level rewards in the inference-time scaling of large language models (LLMs), particularly in …
Acemath: Advancing frontier math reasoning with post-training and reward modeling
In this paper, we introduce AceMath, a suite of frontier math models that excel in solving
complex math problems, along with highly effective reward models capable of evaluating …
complex math problems, along with highly effective reward models capable of evaluating …
Enhancing llm reasoning via critique models with test-time and training-time supervision
Training large language models (LLMs) to spend more time thinking and reflection before
responding is crucial for effectively solving complex reasoning tasks in fields such as …
responding is crucial for effectively solving complex reasoning tasks in fields such as …
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though
We propose a novel framework, Meta Chain-of-Thought (Meta-CoT), which extends
traditional Chain-of-Thought (CoT) by explicitly modeling the underlying reasoning required …
traditional Chain-of-Thought (CoT) by explicitly modeling the underlying reasoning required …
RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement
Existing large language models (LLMs) show exceptional problem-solving capabilities but
might struggle with complex reasoning tasks. Despite the successes of chain-of-thought and …
might struggle with complex reasoning tasks. Despite the successes of chain-of-thought and …
Progressive multimodal reasoning via active retrieval
Multi-step multimodal reasoning tasks pose significant challenges for multimodal large
language models (MLLMs), and finding effective ways to enhance their performance in such …
language models (MLLMs), and finding effective ways to enhance their performance in such …
SR: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
Recent studies have demonstrated the effectiveness of LLM test-time scaling. However,
existing approaches to incentivize LLMs' deep thinking abilities generally require large …
existing approaches to incentivize LLMs' deep thinking abilities generally require large …