Markowitz Meets Bellman: Knowledge-distilled Reinforcement Learning for Portfolio Management