Applying Deep Reinforcement Learning to the HP Model for Protein Structure Prediction