Per-Domain Generalizing Policies: On Validation Instances and Scaling Behavior