Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning

Open in new window