LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior