Dynamic Sparse Training of Diagonally Sparse Networks