DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging

Open in new window