KVCacheis1BitPerChannel: EfficientLarge LanguageModelInferencewithCoupledQuantization