Towards Cost-Effective Reward Guided Text Generation