RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models

Open in new window