Theoretical Refinement of CLIP by Utilizing Linear Structure of Optimal Similarity