Balancing SoC in Battery Cells using Safe Action Perturbations