Learning Deterministic Policy with Target for Power Control in Wireless Networks