NuTrea: Neural Tree Search for Context-guided Multi-hop KGQA (Supplement)

Neural Information Processing Systems 

The initial learning rate is set to 0.0005 with an exponential learning rate scheduler with a decay rate of 0.99. Also, we use the KL divergence as the loss function.