Improving Rare Word Translation With Dictionaries and Attention Masking