Quantifying the Plausibility of Context Reliance in Neural Machine Translation