The Role of Sparsity for Length Generalization in Transformers