Linear Representation Transferability Hypothesis: Leveraging Small Models to Steer Large Models

Open in new window