Alleviating " Posterior Collapse " in Deep Topic Models via Policy Gradient Y ewen Li