Generating and Detecting True Ambiguity: A Forgotten Danger in DNN Supervision Testing