Can LLM find the green circle? Investigation and Human-guided tool manipulation for compositional generalization