A Hierarchical Two-tier Approach to Hyper-parameter Optimization in Reinforcement Learning