Temporal Video-Language Alignment Network for Reward Shaping in Reinforcement Learning

Open in new window