Functional Optimization Reinforcement Learning for Real-Time Bidding