3D-Aware Image compositing with Language Instructions