Beyond Unimodal Boundaries: Generative Recommendation with Multimodal Semantics