SIRIUS : Contexual Sparisty with Correction for Efficient LLMs