Analyzing the Attention Heads for Pronoun Disambiguation in Context-aware Machine Translation Models