Online Optimization for Offline Safe Reinforcement Learning