End-to-end Dense Video Captioning as Sequence Generation

Open in new window