Reasoning-Aware Multimodal Fusion for Hateful Video Detection