VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model