Position: Ensuring mutual privacy is necessary for effective external evaluation of proprietary AI systems