Supplementary Materials Evaluation beyond Task Performance: Analyzing Concepts in AlphaZero playing Hex
–Neural Information Processing Systems
Appendix A reports implementation details, hyperparameters and compute requirements. Appendix B gives more details on each concept introduced in the main body of the paper. Appendix C demonstrates how AlphaZero often wastes moves. Appendix D has additional results across the different architectures. We use agents trained by Jones [5]. See Table 1 for hyperparameters and relative agent strengths.
Neural Information Processing Systems
Feb-9-2025, 21:54:08 GMT