Coherent Audio-Visual Editing via Conditional Audio Generation Following Video Edits