Paying Attention to Facts: Quantifying the Knowledge Capacity of Attention Layers