Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails

Open in new window