Large Language Model Inference with Lexical Shortlisting