SPARTAALIGNMENT: Collectively Aligning Multiple Language Models through Combat

Open in new window