SLIM: Sim-to-Real Legged Instructive Manipulation via Long-Horizon Visuomotor Learning