Deep Reinforcement Learning with Discrete Normalized Advantage Functions for Resource Management in Network Slicing