Beyond the Pixels: VLM-based Evaluation of Identity Preservation in Reference-Guided Synthesis