Node-Based Editing for Multimodal Generation of Text, Audio, Image, and Video