SOE: Sample-Efficient Robot Policy Self-Improvement via On-Manifold Exploration