Indiana Jones: There Are Always Some Useful Ancient Relics

Ding, Junchen, Zhang, Jiahao, Liu, Yi, Ding, Ziqi, Deng, Gelei, Li, Yuekang

Jan-27-2025–arXiv.org Artificial Intelligence

This paper introduces Indiana Jones, an innovative approach to jailbreaking Large Language Models (LLMs) by leveraging inter-model dialogues and keyword-driven prompts. Through orchestrating interactions among three specialised LLMs, the method achieves near-perfect success rates in bypassing content safeguards in both white-box and black-box LLMs. The research exposes systemic vulnerabilities within contemporary models, particularly their susceptibility to producing harmful or unethical outputs when guided by ostensibly innocuous prompts framed in historical or contextual contexts. Experimental evaluations highlight the efficacy and adaptability of Indiana Jones, demonstrating its superiority over existing jailbreak methods. These findings emphasise the urgent need for enhanced ethical safeguards and robust security measures in the development of LLMs. Moreover, this work provides a critical foundation for future studies aimed at fortifying LLMs against adversarial exploitation while preserving their utility and flexibility.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Jan-27-2025

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America > United States
  - Indiana (0.83)
  - Missouri (0.04)
  - New York > New York County
    - New York City (0.04)
  - Louisiana > East Baton Rouge Parish
    - Central (0.04)
  - Illinois > Cook County
    - Chicago (0.04)
- Europe
  - United Kingdom (0.14)
  - Switzerland (0.04)
  - Monaco (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Singapore > Central Region
    - Singapore (0.04)

Genre:
- Research Report (1.00)

Industry:
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Law > Criminal Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Government (1.00)
- Banking & Finance (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.50)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found