Two Body Problem: Collaborative Visual Task Completion