Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Self-discover: Large language models self-compose reasoning structures
We introduce SELF-DISCOVER, a general framework for LLMs to self-discover the task-
intrinsic reasoning structures to tackle complex reasoning problems that are challenging for …
intrinsic reasoning structures to tackle complex reasoning problems that are challenging for …
Key-point-driven data synthesis with its enhancement on mathematical reasoning
Large language models (LLMs) have shown great potential in complex reasoning tasks, yet
their performance is often hampered by the scarcity of high-quality and reasoning-focused …
their performance is often hampered by the scarcity of high-quality and reasoning-focused …
Determinants of llm-assisted decision-making
E Eigner, T Händler - arxiv preprint arxiv:2402.17385, 2024 - arxiv.org
Decision-making is a fundamental capability in everyday life. Large Language Models
(LLMs) provide multifaceted support in enhancing human decision-making processes …
(LLMs) provide multifaceted support in enhancing human decision-making processes …
Open-Ethical AI: Advancements in Open-Source Human-Centric Neural Language Models
This survey summarises the most recent methods for building and assessing helpful, honest,
and harmless neural language models, considering small, medium, and large-size models …
and harmless neural language models, considering small, medium, and large-size models …
Darg: Dynamic evaluation of large language models via adaptive reasoning graph
The current paradigm of evaluating Large Language Models (LLMs) through static
benchmarks comes with significant limitations, such as vulnerability to data contamination …
benchmarks comes with significant limitations, such as vulnerability to data contamination …
Dyval: Dynamic evaluation of large language models for reasoning tasks
Large language models (LLMs) have achieved remarkable performance in various
evaluation benchmarks. However, concerns are raised about potential data contamination in …
evaluation benchmarks. However, concerns are raised about potential data contamination in …
Graph-enhanced large language models in asynchronous plan reasoning
Planning is a fundamental property of human intelligence. Reasoning about asynchronous
plans is challenging since it requires sequential and parallel planning to optimize time costs …
plans is challenging since it requires sequential and parallel planning to optimize time costs …
Exposing limitations of language model agents in sequential-task compositions on the web
Language model agents (LMA) recently emerged as a promising paradigm on muti-step
decision making tasks, often outperforming humans and other reinforcement learning …
decision making tasks, often outperforming humans and other reinforcement learning …
VipAct: Visual-perception enhancement via specialized vlm agent collaboration and tool-use
While vision-language models (VLMs) have demonstrated remarkable performance across
various tasks combining textual and visual information, they continue to struggle with fine …
various tasks combining textual and visual information, they continue to struggle with fine …
When reasoning meets information aggregation: A case study with sports narratives
Reasoning is most powerful when an LLM accurately aggregates relevant information. We
examine the critical role of information aggregation in reasoning by requiring the LLM to …
examine the critical role of information aggregation in reasoning by requiring the LLM to …