On Reference (In-)Determinacy in Natural Language Inference