Lyapunov-based Safe Policy Optimization for Continuous Control