Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective

Open in new window