Towards Self-Supervised High Level Sensor Fusion