Goto

Collaborating Authors

 Europe





Towards In-context Scene Understanding

Neural Information Processing Systems

The resulting Hummingbird model, suitably prompted, performs various scene understanding tasks without modification while approaching the performance of specialists that have been finetuned for each task.







bit2bit: 1-bit quanta video reconstruction by self-supervised photon location prediction

Neural Information Processing Systems

This leads to the proposal of a novel self-supervised solution based on a masked loss function. We evaluate our method using both simulated and real data. On simulated data from a conventional video, we achieve 34.35 mean PSNR with extremely photon-sparse binary input (<0.06 photons per pixel per frame).