Reviews: The Synthesis of XNOR Recurrent Neural Networks with Stochastic Logic

Neural Information Processing Systems 

This paper introduces interesting stochastic finite state machine based methods to approximate nonlinear activation functions including hyperbolic tangent and sigmoid functions. A fully binary model of LSTM (both weights and hidden states are binary) is constructed in which XNOR operations are used to perform all the multiplications in the gate and state computations. Empirical results show that the proposed binary LSTM model can dramatically reduce the computational lost while without sacrificing latency or accuracy comparing with existing methods. In the rebuttal, concerns from the reviewers are carefully addressed, e.g., adding an FPGA based implementation. However, some of them are still lack of sufficient details and discussions, in particular, the cost of stochastic computing, and the memory movement cost.