From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons