Incrementality Bidding via Reinforcement Learning under Mixed and Delayed Rewards