Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem