Whats in a Video: Factorized Autoregressive Decoding for Online Dense Video Captioning

Open in new window