Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A Survey on LLM-as-a-Judge
Accurate and consistent evaluation is crucial for decision-making across numerous fields,
yet it remains a challenging task due to inherent subjectivity, variability, and scale. Large …
yet it remains a challenging task due to inherent subjectivity, variability, and scale. Large …
Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
With the success of 2D diffusion models, 2D AIGC content has already transformed our lives.
Recently, this success has been extended to 3D AIGC, with state-of-the-art methods …
Recently, this success has been extended to 3D AIGC, with state-of-the-art methods …
The AI Agent Index
Leading AI developers and startups are increasingly deploying agentic AI systems that can
plan and execute complex tasks with limited human involvement. However, there is currently …
plan and execute complex tasks with limited human involvement. However, there is currently …
Mobilesafetybench: Evaluating safety of autonomous agents in mobile device control
Autonomous agents powered by large language models (LLMs) show promising potential in
assistive tasks across various domains, including mobile device control. As these agents …
assistive tasks across various domains, including mobile device control. As these agents …
AI Cyber Risk Benchmark: Automated Exploitation Capabilities
We introduce a new benchmark for assessing AI models' capabilities and risks in automated
software exploitation, focusing on their ability to detect and exploit vulnerabilities in real …
software exploitation, focusing on their ability to detect and exploit vulnerabilities in real …
G-Safeguard: A Topology-Guided Security Lens and Treatment on LLM-based Multi-agent Systems
Large Language Model (LLM)-based Multi-agent Systems (MAS) have demonstrated
remarkable capabilities in various complex tasks, ranging from collaborative problem …
remarkable capabilities in various complex tasks, ranging from collaborative problem …
The Science of Evaluating Foundation Models
The emergent phenomena of large foundation models have revolutionized natural language
processing. However, evaluating these models presents significant challenges due to their …
processing. However, evaluating these models presents significant challenges due to their …
SoK: Unifying Cybersecurity and Cybersafety of Multimodal Foundation Models with an Information Theory Approach
Multimodal foundation models (MFMs) represent a significant advancement in artificial
intelligence, combining diverse data modalities to enhance learning and understanding …
intelligence, combining diverse data modalities to enhance learning and understanding …
The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1
The rapid development of large reasoning models, such as OpenAI-o3 and DeepSeek-R1,
has led to significant improvements in complex reasoning over non-reasoning large …
has led to significant improvements in complex reasoning over non-reasoning large …
Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers
A trained Large Language Model (LLM) contains much of human knowledge. Yet, it is
difficult to gauge the extent or accuracy of that knowledge, as LLMs do not always``know …
difficult to gauge the extent or accuracy of that knowledge, as LLMs do not always``know …