Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing