Multi-objective Reinforcement Learning through Continuous Pareto Manifold Approximation