Environment-Aware Channel Inference via Cross-Modal Flow: From Multimodal Sensing to Wireless Channels