Learning Cell-Aware Hierarchical Multi-Modal Representations for Robust Molecular Modeling