Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation