Reviews: Mixtape: Breaking the Softmax Bottleneck Efficiently

Neural Information Processing Systems 

This paper proposes techniques to deal with the softmax bottleneck problem. Pros • Experimental results show strong performances in language modeling and machine translation. Cons • Writing of the paper can be further enhanced by making it self-contained. The paper represents solid work. There are clarity issues pointed out by the reviewers.