See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm