Imperfect Digital Twin Assisted Low Cost Reinforcement Training for Multi-UAV Networks