MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation