Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments