Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection
Marinelli, Ryan, Pichlmeier, Josef, Bisztray, Tamas
–arXiv.org Artificial Intelligence
In this work, we propose a metric called "Number of Thoughts (NofT)" to determine the difficulty of tasks pre-prompting and support Large Language Models (LLMs) in production contexts. By setting thresholds based on the number of thoughts, this metric can discern the difficulty of prompts and support more effective prompt routing. A 2% decrease in latency is achieved when routing prompts from the MathInstruct dataset through quantized, distilled versions of Deepseek with 1.7 billion, 7 billion, and 14 billion parameters. Moreover, this metric can be used to detect adversarial prompts used in prompt injection attacks with high efficacy. The Number of Thoughts can inform a classifier that achieves 95% accuracy in adversarial prompt detection.
arXiv.org Artificial Intelligence
Mar-27-2025
- Country:
- Asia > Thailand
- Europe
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Norway > Eastern Norway
- Oslo (0.04)
- Germany > Bavaria
- North America > United States
- Florida > Miami-Dade County > Miami (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Information Technology (0.46)
- Technology: