See and Think: Embodied Agent in Virtual Environment