OVERTHINKING: Slowdown Attacks on Reasoning LLMs

A Kumar, J Roh, A Naseh, M Karpinska, M Iyyer… - arxiv preprint arxiv …, 2025 - arxiv.org
We increase overhead for applications that rely on reasoning LLMs-we force models to
spend an amplified number of reasoning tokens, ie," overthink", to respond to the user query …