Look, Remember and Reason: Grounded reasoning in videos with language models