Use Perturbations when Learning from Explanations

Neural Information Processing Systems 

Machine learning from explanations (MLX) is an approach to learning that uses human-provided explanations of relevant or irrelevant features for each input to ensure that model predictions are right for the right reasons . Existing MLX approaches rely on local model interpretation methods and require strong model smoothing to align model and human explanations, leading to sub-optimal performance.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found