Cheaply Estimating Inference Efficiency Metrics for Autoregressive Transformer Models