Goto

Collaborating Authors

 unmodnet



UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging

Neural Information Processing Systems

A conventional camera often suffers from over-or under-exposure when recording a real-world scene with a very high dynamic range (HDR). In contrast, a modulo camera with a Markov random field (MRF) based unwrapping algorithm can theoretically accomplish unbounded dynamic range but shows degenerate performances when there are modulus-intensity ambiguity, strong local contrast, and color misalignment. In this paper, we reformulate the modulo image unwrapping problem into a series of binary labeling problems and propose a modulo edge-aware model, named as UnModNet, to iteratively estimate the binary rollover masks of the modulo image for unwrapping. Experimental results show that our approach can generate 12-bit HDR images from 8-bit modulo images reliably, and runs much faster than the previous MRF-based algorithm thanks to the GPU acceleration.


Supplementary Material UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging Chu Zhou 1 Hang Zhao 2 Jin Han 1 Chang Xu

Neural Information Processing Systems

We could apply a binary search to achieve this, as shown in Algorithm 1 below. The formation of a spike can be expressed as an "accumulate-fire-reset" cycle: The This signal also resets the corresponding accumulator, in which all the electric charges are drained ( i.e ., resets Specifically, the sensor checks the accumulators periodically within a fixed interval.



Review for NeurIPS paper: UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging

Neural Information Processing Systems

Weaknesses: My primary concern with this paper is that the problem it is addressing is *extremely* niche --- Modulo cameras are a somewhat obscure problem even within the realm of the computational imaging community. If I was reviewing this paper for a computational imaging/photography conference, I would be more charitable towards this paper. But this subject is unlikely to be of interest to the general NeurIPS audience, and this paper seems unlikely to reach its intended audience if presented at NeurIPS. And the specifics of this neural network architecture are so specifically tailored to this particular problem that I'm not sure what a general ML researcher could come away from this paper with, nor am I convinced that this is a problem that should be popularized with ML researchers as, again, a solution to this problem has limited practical value given that modulo cameras are still a largely hypothetical concept. My other concern with this paper (which would be a significant concern even if I were reviewing this paper in a computational imaging conference) is that the baseline evaluation is misleading.


Review for NeurIPS paper: UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging

Neural Information Processing Systems

The submission has received two positive and two negative reviews. The post-rebuttal discussion has not lead to convergence, and the opinion of the reviewers remain split. The concerns of the "negative" reviewers are: 1) The application is too niche (R1). However, the topic of the paper falls into NeurIPS call for papers, as it is related to low-level computer vision, compressed sensing, deep neural architectures. The authors rebut that the results in [55] were cherry-picked and that they use the code from [55], while fixing the parameters.


UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging

Neural Information Processing Systems

A conventional camera often suffers from over- or under-exposure when recording a real-world scene with a very high dynamic range (HDR). In contrast, a modulo camera with a Markov random field (MRF) based unwrapping algorithm can theoretically accomplish unbounded dynamic range but shows degenerate performances when there are modulus-intensity ambiguity, strong local contrast, and color misalignment. In this paper, we reformulate the modulo image unwrapping problem into a series of binary labeling problems and propose a modulo edge-aware model, named as UnModNet, to iteratively estimate the binary rollover masks of the modulo image for unwrapping. Experimental results show that our approach can generate 12-bit HDR images from 8-bit modulo images reliably, and runs much faster than the previous MRF-based algorithm thanks to the GPU acceleration.