Efficient Language-instructed Skill Acquisition via Reward-Policy Co-Evolution