A Simple Weight Decay Can Improve Generalization