SpeedLimit: Neural Architecture Search for Quantized Transformer Models

Open in new window