Policy Optimization for Robust Average Cost MDPs