Mamba-Shedder: Post-Transformer Compression for Efficient Selective Structured State Space Models

Open in new window