Harnessing Bounded-Support Evolution Strategies for Policy Refinement