Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation