DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs

Open in new window