Adaptive Teaching in Heterogeneous Agents: Balancing Surprise in Sparse Reward Scenarios