Automatic Treatment Planning using Reinforcement Learning for High-dose-rate Prostate Brachytherapy