Layer-adaptive Structured Pruning Guided by Latency

Open in new window