Appendix A Implementation of Taylor Expansion on Unit Hamming Sphere

Neural Information Processing Systems 

Follow the discussion in Section 3.1.2 In this section, we continue the discussion in Section 3.3 and obtain the form used in (25). The architecture of the discriminator is shown in Table 3. Exponential Linear Units [ One of the issue with BLEU is that in the case that a higher order n-gram precision of a sentence is 0, then the BLEU score will be 0, resulting in severely underestimation. This is due to the fact that BLEU is calculated by the geometric mean of precision. Sentences in the COCO dataset have a maximum length of 24 tokens and a vocabulary of 4.6k Training and validation data both consist of 10k sentences.