SALE : Low-bit Estimation for Efficient Sparse Attention in Long-context LLM Prefilling