Being-M0.5: A Real-Time Controllable Vision-Language-Motion Model