Enforcing Reasoning in Visual Commonsense Reasoning