Multi-Task Cross-Modality Attention-Fusion for 2D Object Detection