Unifying 2D and 3D Vision-Language Understanding