Efficient Long Sequence Modeling via State Space Augmented Transformer

Open in new window