Appendix A Attention Mechanism In this section, we give a brief overview of the most commonly used attention mechanisms

Neural Information Processing Systems 

The comparisons of AHGP with the GPU-based baselines are presented in Figure 1 and Table 8, 9, 10. Results are similar to comparisons with the CPU-based baselines in the main section.