Are PPO-ed Language Models Hackable?