Unified Supervision For Vision-Language Modeling in 3D Computed Tomography