D2C-HRHR: Discrete Actions with Double Distributional Critics for High-Risk-High-Return Tasks