From Scan to Action: Leveraging Realistic Scans for Embodied Scene Understanding