Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements

Yang, Shu, Zhu, Shenzhe, Wu, Zeyu, Wang, Keyu, Yao, Junchi, Wu, Junchao, Hu, Lijie, Li, Mengdi, Wong, Derek F., Wang, Di

Feb-18-2025–arXiv.org Artificial Intelligence

We introduce Fraud-R1, a benchmark designed to evaluate LLMs' ability to defend against internet fraud and phishing in dynamic, real-world scenarios. Fraud-R1 comprises 8,564 fraud cases sourced from phishing scams, fake job postings, social media, and news, categorized into 5 major fraud types. Unlike previous benchmarks, Fraud-R1 introduces a multi-round evaluation pipeline to assess LLMs' resistance to fraud at different stages, including credibility building, urgency creation, and emotional manipulation. Furthermore, we evaluate 15 LLMs under two settings: 1. Helpful-Assistant, where the LLM provides general decision-making assistance, and 2. Role-play, where the model assumes a specific persona, widely used in real-world agent-based interactions. Our evaluation reveals the significant challenges in defending against fraud and phishing inducement, especially in role-play settings and fake job postings. Additionally, we observe a substantial performance gap between Chinese and English, underscoring the need for improved multilingual fraud detection capabilities.

information, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

Feb-18-2025

arXiv.org PDF

Add feedback

Country:
- Africa > Senegal
  - Dakar Region > Dakar (0.04)
- Asia
  - Afghanistan
    - Ghazni Province > Ghazni (0.04)
    - Kabul Province > Kabul (0.04)
  - China
    - Beijing > Beijing (0.05)
    - Guangdong Province > Shenzhen (0.04)
    - Guangxi Province (0.04)
    - Hong Kong
      - Kowloon (0.04)
      - Wan Chai (0.04)
    - Shanghai > Shanghai (0.04)
    - Tianjin Province > Tianjin (0.04)
  - Macao (0.04)
  - Middle East
    - Jordan (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.14)
  - Thailand > Bangkok
    - Bangkok (0.04)
- Europe
  - Switzerland (0.04)
  - United Kingdom (0.04)
- North America
  - Canada
    - Ontario > Toronto (0.14)
    - Quebec (0.04)
  - United States
    - Florida > Miami-Dade County
      - Miami (0.04)
    - Nevada > Clark County
      - Las Vegas (0.04)
    - New York > New York County
      - New York City (0.04)
- Oceania > Papua New Guinea
  - Central Province > National Capital District > Port Moresby (0.04)
- Pacific Ocean > North Pacific Ocean
  - San Francisco Bay > Golden Gate (0.04)

Genre:
- Research Report > New Finding (0.45)

Industry:
- Information Technology > Security & Privacy (1.00)
- Law Enforcement & Public Safety > Fraud (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)
    - Natural Language > Large Language Model (1.00)
  - Security & Privacy (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found