Bridging Information-Theoretic and Geometric Compression in Language Models