DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging