O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models