Appendix
–Neural Information Processing Systems
The DeceptionBench is designed as a research benchmark to systematically study deception behaviors in LLMs, fostering a deeper understanding of their decision-making processes in real-world scenarios. Our primary intent is to provide a standardized, transparent tool for the research community to evaluate and improve LLMs' ethical alignment, not to enable or encourage deceptive practices. To prevent potential misuse by malicious actors, we commit to publicly releasing all evaluation data under an open license. This transparency ensures that DeceptionBench's methodology and outcomes are subject to scrutiny, replication, and improvement by the research community, reducing the risk of hidden exploitation. By prioritizing openness, we aim to advance responsible AI development while safeguarding against misuse in harmful contexts. The field of Large Language Models (LLMs) has undergone remarkable evolution in recent years, reshaping the landscape of natural language processing.
Neural Information Processing Systems
Jun-17-2026, 09:29:13 GMT
- Genre:
- Research Report (0.67)
- Personal > Interview (0.46)
- Industry:
- Leisure & Entertainment (1.00)
- Law (1.00)
- Government > Tax (0.93)
- Media > Film (0.68)
- Banking & Finance (0.68)
- Marketing (0.68)
- Health & Medicine
- Pharmaceuticals & Biotechnology (0.69)
- Therapeutic Area
- Psychiatry/Psychology (0.68)
- Immunology (0.47)
- Vaccines (0.47)
- Technology: