Linear Attention for Efficient Bidirectional Sequence Modeling