M2I2: Learning Efficient Multi-Agent Communication via Masked State Modeling and Intention Inference