Rich State Observations Empower Reinforcement Learning to Surpass PID: A Drone Ball Balancing Study