Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models
–Neural Information Processing Systems
Therefore, if we could properly restrict the PLM's adaptation to the moderate-fitting stage, the model would neglect the backdoor triggers but still achieve satisfying performance on the original task.
Neural Information Processing Systems
Oct-1-2025, 22:22:58 GMT
- Country:
- Asia
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy > Sicily
- Palermo (0.04)
- Romania > Sud - Muntenia Development Region
- Giurgiu County > Giurgiu (0.04)
- Ireland > Leinster
- North America
- Canada > Quebec
- Montreal (0.04)
- Dominican Republic (0.04)
- United States
- California
- Los Angeles County > Long Beach (0.14)
- San Diego County > San Diego (0.04)
- Illinois > Champaign County
- Urbana (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Washington > King County
- Seattle (0.04)
- California
- Canada > Quebec
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Information Technology > Security & Privacy (0.72)
- Technology: