HowShouldPre-TrainedLanguageModelsBe Fine-TunedTowardsAdversarialRobustness?

Neural Information Processing Systems 

Attack algorithms [19, 7, 76, 40, 2] aim to maliciously generate adversarial examples to fool a victim model, while adversarial defense aims at building robustness against them.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found