Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

Neural Information Processing Systems 

Large language model systems face significant security risks from maliciously crafted messages that aim to overwrite the system's original instructions or leak