Reward Engineering for Generating Semi-structured Explanation