Obliviator Reveals the Cost of Nonlinear Guardedness in Concept Erasure
–Neural Information Processing Systems
Concept erasure aims to remove unwanted attributes, such as social or demographic factors, from learned representations, while preserving their task-relevant utility. While the goal of concept erasure is protection against all adversaries, existing methods remain vulnerable to nonlinear ones. This vulnerability arises from their failure to fully capture the complex, nonlinear statistical dependencies between learned representations and unwanted attributes. Moreover, although the existence of a trade-off between utility and erasure is expected, its progression during the erasure process, i.e., the cost of erasure, remains unstudied. In this work, we introduce Obliviator, a post-hoc erasure method designed to fully capture nonlinear statistical dependencies.
Neural Information Processing Systems
Jun-22-2026, 21:48:05 GMT
- Genre:
- Overview (0.67)
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Industry:
- Information Technology > Security & Privacy (0.46)
- Government (0.45)
- Technology: