Diversity-driven Data Selection for Language Model Tuning through Sparse Autoencoder