Finding and Reactivating Post-Trained LLMs' Hidden Safety Mechanisms

Open in new window