Extending the reward structure in reinforcement learning: an interview with Tanmay Ambadkar