Beyond expected value: geometric mean optimization for long-term policy performance in reinforcement learning

Open in new window