A Details of based algorithms

Neural Information Processing Systems 

Here, if we use the model's last layer denoted by