Appendix: Online Learning in Contextual Bandits using Gated Linear Networks Marcus Hutter