OVERTHINKING: Slowdown Attacks on Reasoning LLMs
We increase overhead for applications that rely on reasoning LLMs-we force models to
spend an amplified number of reasoning tokens, ie," overthink", to respond to the user query …
spend an amplified number of reasoning tokens, ie," overthink", to respond to the user query …