Towards Ethical Multi-Agent Systems of Large Language Models: A Mechanistic Interpretability Perspective

Open in new window