Scalable iterative pruning of large language and vision models using block coordinate descent

Open in new window