Gradient Short-Circuit: Efficient Out-of-Distribution Detection via Feature Intervention
Gu, Jiawei, Qiao, Ziyue, Li, Zechao
–arXiv.org Artificial Intelligence
Out-of-Distribution (OOD) detection is critical for safely deploying deep models in open-world environments, where inputs may lie outside the training distribution. During inference on a model trained exclusively with In-Distribution (ID) data, we observe a salient gradient phenomenon: around an ID sample, the local gradient directions for "enhancing" that sample's predicted class remain relatively consistent, whereas OOD samples--unseen in training--exhibit disorganized or conflicting gradient directions in the same neighborhood. Motivated by this observation, we propose an inference-stage technique to short-circuit those feature coordinates that spurious gradients exploit to inflate OOD confidence, while leaving ID classification largely intact. T o circumvent the expense of recom-puting the logits after this gradient short-circuit, we further introduce a local first-order approximation that accurately captures the post-modification outputs without a second forward pass. Experiments on standard OOD benchmarks show our approach yields substantial improvements. Moreover, the method is lightweight and requires minimal changes to the standard inference pipeline, offering a practical path toward robust OOD detection in real-world applications.
arXiv.org Artificial Intelligence
Jul-8-2025
- Genre:
- Research Report (1.00)
- Technology:
- Information Technology
- Data Science > Data Mining (0.93)
- Artificial Intelligence
- Vision (0.93)
- Representation & Reasoning (0.93)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Performance Analysis > Accuracy (0.94)
- Information Technology