Effective internal language model training and fusion for factorized transducer model