A policy gradient approach for optimization of smooth risk measures