Learning Two layer Networks with Multinomial Activation and High Thresholds