A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents