Are generative deep models for novelty detection truly better?