LoCoCo: Dropping In Convolutions for Long Context Compression

Open in new window