Zero-Direction Probing: A Linear-Algebraic Framework for Deep Analysis of Large-Language-Model Drift

Aug-12-2025–arXiv.org Machine Learning

We present Zero-Direction Probing (ZDP), a theory-only framework for detecting model drift from null directions of transformer activations without task labels or output evaluations. Under assumptions A1--A6, we prove: (i) the Variance--Leak Theorem, (ii) Fisher Null-Conservation, (iii) a Rank--Leak bound for low-rank updates, and (iv) a logarithmic-regret guarantee for online null-space trackers. We derive a Spectral Null-Leakage (SNL) metric with non-asymptotic tail bounds and a concentration inequality, yielding a-priori thresholds for drift under a Gaussian null model. These results show that monitoring right/left null spaces of layer activations and their Fisher geometry provides concrete, testable guarantees on representational change.

large language model, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

Aug-12-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States > North Carolina > Mecklenburg County > Charlotte (0.04)

Genre:
- Research Report (0.69)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.65)
  - Machine Learning
    - Neural Networks (0.68)
    - Performance Analysis > Accuracy (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found