Trained Mamba Emulates Online Gradient Descent in In-Context Linear Regression

Open in new window