Reinforcement Learning on Computational Resource Allocation of Cloud-based Wireless Networks