Neuro-Inspired Information-Theoretic Hierarchical Perception for Multimodal Learning