A Multimodal Framework for Deepfake Detection