Learning Linear Attention in Polynomial Time