EditGen: Harnessing Cross-Attention Control for Instruction-Based Auto-Regressive Audio Editing