Reviews: Deep Model Transferability from Attribution Maps

Neural Information Processing Systems 

The transferabilities of taskonomy have a practical value (they're constructed and are shown to reduce the need for supervision through transfer learning), but Taskonomy's method is computationally expensive. So, the gold standard is duplication of taskonomy's affinity matrix, but with less complexity. Therefore I see the comparison between the transferability matrix by attribution maps and taskonomy's (fig 4) valid and what the main point is. But I don't understand why/how SVCCA vs attribution map's similarity matrix comparisons (figure 3) are useful. What exactly is the value of SVCCA based similarity matrix? Why isn't figure 3 comparing between attribution map's matrix and Taskonomy's affinity matrix (after being made symmetric)?