Online Knowledge Distillation with Reward Guidance