Appendix-Hard-AttentionforScalableImage Classification