Analysing the Generalisation and Reliability of Steering Vectors Daniel Tan

Neural Information Processing Systems 

However, the reliability and generalisation properties of this approach are unknown.