version.53

Neural Information Processing Systems 

Why the escort transform:This transform is a natural choice due to its simplicity, and has a history in the physics28 literature [2]. Empirical evidence in SL shows that escort withp=2 performs better than softmax in MNIST and29 CIFAR-10[5]. Learning rate analysis for EPG:It is easy to establish thatkθtkp is finitely bounded from above and below.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found