version.53
–Neural Information Processing Systems
Why the escort transform:This transform is a natural choice due to its simplicity, and has a history in the physics28 literature [2]. Empirical evidence in SL shows that escort withp=2 performs better than softmax in MNIST and29 CIFAR-10[5]. Learning rate analysis for EPG:It is easy to establish thatkθtkp is finitely bounded from above and below.
Neural Information Processing Systems
Feb-11-2026, 02:02:54 GMT
- Technology: