Large Multimodal Model Compression via Efficient Pruning and Distillation at AntGroup

Open in new window