Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning