Visual Reference Resolution using Attention Memory for Visual Dialog

Paul Hongsuck Seo, Andreas Lehrmann, Bohyung Han, Leonid Sigal

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/