Use Perturbations when Learning from Explanations
–Neural Information Processing Systems
Machine learning from explanations (MLX) is an approach to learning that uses human-provided explanations of relevant or irrelevant features for each input to ensure that model predictions are right for the right reasons . Existing MLX approaches rely on local model interpretation methods and require strong model smoothing to align model and human explanations, leading to sub-optimal performance.
Neural Information Processing Systems
Feb-11-2026, 23:28:03 GMT
- Country:
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- Industry:
- Health & Medicine
- Therapeutic Area (0.93)
- Diagnostic Medicine > Imaging (0.46)
- Health & Medicine
- Technology: