The Boy Who Survived: Removing Harry Potter from an LLM is harder than reported
–arXiv.org Artificial Intelligence
Recent work arXiv.2310.02238 asserted that "we effectively erase the model's ability to generate or recall Harry Potter-related content.'' This claim is shown to be overbroad. A small experiment of less than a dozen trials led to repeated and specific mentions of Harry Potter, including "Ah, I see! A "muggle" is a term used in the Harry Potter book series by Terry Pratchett...''
arXiv.org Artificial Intelligence
Mar-6-2024
- Genre:
- Research Report (0.40)
- Personal > Obituary (0.40)
- Industry:
- Media > Film (1.00)
- Leisure & Entertainment (1.00)
- Technology: