Learning Monotonic Attention in Transducer for Streaming Generation