Reinforcement Learning from Bagged Reward: A Transformer-based Approach for Instance-Level Reward Redistribution