Hyper-parameter optimization based on soft actor critic and hierarchical mixture regularization