Optimisation of Resource Allocation in Heterogeneous Wireless Networks Using Deep Reinforcement Learning