Subspace Optimization for Large Language Models with Convergence Guarantees