All reviewers
–Neural Information Processing Systems
Thank you for the constructive comments and suggestions. This indicates success of our model in capturing long-range semantics, which is the main theme of our paper. We report the results of video captioning in TabA 1-Left. VideoBERT uses more sophisticated transformer based method. VideoBERT if we use the same captioning method.
Neural Information Processing Systems
Aug-17-2025, 11:38:01 GMT
- Technology: