NeRF-Insert: 3D Local Editing with Multimodal Control Signals