Eliciting Language Model Behaviors with Investigator Agents

Open in new window