Self-Supervised Generation of Spatial Audio for 360° Video

Pedro Morgado, Nuno Nvasconcelos, Timothy Langlois, Oliver Wang

Neural Information Processing Systems 

During training, ground-truth spatial audio serves as self-supervision and a mixed down mono track forms the input to our network.