ShoeFit: ANew Dataset and Dual-image-stream DiT Framework for Virtual Footwear Try-On
–Neural Information Processing Systems
Virtual footwear try-on (VFTON), a critical yet underexplored area in virtual try-on (VTON), aims to synthesize faithful try-on results given diverse footwear and model (1) Data Scarimages while maintaining 3D consistency and texture authenticity. Unlike convenwith difficult matchtional garment-focused VTON methods, VFTON presents unique challenges due to (1) Data Scarcity, which arises from the difficulty of perfectly matching product shoes with models wearing the identical ones, (2) Viewpoint Misalignment, where the target foot pose and source shoe views are always misaligned, leading to incomplete texture information and detail distortion, and (3) Background-induced iewpoint Color Distortion, where complex material of footwear interacts with environmental lighting, causing unintended color contamination.
Neural Information Processing Systems
Jun-16-2026, 00:23:15 GMT
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Information Technology > Security & Privacy (0.46)
- Technology: