FlashAttention on a Napkin: A Diagrammatic Approach to Deep Learning IO-Awareness