Benchmarking Robustness to Adversarial Image Obfuscations
–Neural Information Processing Systems
Automated content filtering and moderation is an important tool that allows online platforms to build striving user communities that facilitate cooperation and prevent abuse. Unfortunately, resourceful actors try to bypass automated filters in a bid to post content that violate platform policies and codes of conduct. To reach this goal, these malicious actors may obfuscate policy violating images (e.g.
Neural Information Processing Systems
Mar-27-2025, 09:31:50 GMT
- Genre:
- Research Report > New Finding (0.45)
- Industry:
- Automobiles & Trucks (0.67)
- Information Technology (0.92)
- Law Enforcement & Public Safety (0.67)
- Transportation
- Freight & Logistics Services (0.67)
- Ground > Road (0.92)
- Technology: