Discussion of Evaluation Methodologies

Apr-25-2026, 01:14:47 GMT–Neural Information Processing Systems

In previous research, there are plenty of arguments about textual backdoor evaluation, including diverse metrics and experiment settings. These valuable discussions motivate us to construct a rigorous benchmark and we highly appreciate their efforts. In this section, we briefly summarize existing opinions and provide a more detailed discussion on this topic. Table 9 summarizes the attackers OpenBackdoorimplements. Effectiveness Besides the mainstream ASR (also called LFR [20]) and CACC metrics, there are also other effectiveness metrics. Shen et al. [46] proposed to count the number of inserted triggers that can successfully flip the label. However, although inserting more triggers could benefit attack strength, the triggers also corrupt the sentences gradually, so it is also possible that the poisoned samples become "adversarial", and we can hardly distinguish. Shen et al. [45] also mentioned this issue, and they advised calculating the ASR difference between a poisoned model and a clean model as an effectiveness metric.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Apr-25-2026, 01:14:47 GMT

Conferences PDF

Add feedback

Industry:
- Information Technology > Security & Privacy (0.48)
- Media > Film (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (0.69)
  - Machine Learning > Neural Networks (0.46)

Duplicate Docs Excel Report

Title
2052b3e0617ecb2ce9474a6feaf422b3-Supplemental-Datasets_and_Benchmarks.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found