Kernel Deformed Exponential Families for Sparse Continuous Attention