Tool Learning with Foundation Models

Qin, Yujia, Hu, Shengding, Lin, Yankai, Chen, Weize, Ding, Ning, Cui, Ganqu, Zeng, Zheni, Huang, Yufei, Xiao, Chaojun, Han, Chi, Fung, Yi Ren, Su, Yusheng, Wang, Huadong, Qian, Cheng, Tian, Runchu, Zhu, Kunlun, Liang, Shihao, Shen, Xingyu, Xu, Bokai, Zhang, Zhen, Ye, Yining, Li, Bowen, Tang, Ziwei, Yi, Jing, Zhu, Yuzhang, Dai, Zhenning, Yan, Lan, Cong, Xin, Lu, Yaxi, Zhao, Weilin, Huang, Yuxiang, Yan, Junxi, Han, Xu, Sun, Xian, Li, Dahai, Phang, Jason, Yang, Cheng, Wu, Tongshuang, Ji, Heng, Liu, Zhiyuan, Sun, Maosong

Jun-15-2023–arXiv.org Artificial Intelligence

Humans possess an extraordinary ability to create and utilize tools, allowing them to overcome physical limitations and explore new frontiers. With the advent of foundation models, AI systems have the potential to be equally adept in tool use as humans. This paradigm, i.e., tool learning with foundation models, combines the strengths of specialized tools and foundation models to achieve enhanced accuracy, efficiency, and automation in problem-solving. Despite its immense potential, there is still a lack of a comprehensive understanding of key challenges, opportunities, and future endeavors in this field. To this end, we present a systematic investigation of tool learning in this paper. We first introduce the background of tool learning, including its cognitive origins, the paradigm shift of foundation models, and the complementary roles of tools and models. Then we recapitulate existing tool learning research into tool-augmented and tool-oriented learning. We formulate a general tool learning framework: starting from understanding the user instruction, models should learn to decompose a complex task into several subtasks, dynamically adjust their plan through reasoning, and effectively conquer each sub-task by selecting appropriate tools. We also discuss how to train models for improved tool-use capabilities and facilitate the generalization in tool learning. Considering the lack of a systematic tool learning evaluation in prior works, we experiment with 18 representative tools and show the potential of current foundation models in skillfully utilizing tools. Finally, we discuss several open problems that require further investigation for tool learning. Overall, we hope this paper could inspire future research in integrating tools with foundation models.

large language model, machine learning, programming language, (25 more...)

arXiv.org Artificial Intelligence

Jun-15-2023

arXiv.org PDF

Add feedback

Country:
- Oceania
  - New Zealand > North Island
    - Auckland Region > Auckland (0.04)
  - Australia
    - Victoria > Melbourne (0.04)
    - New South Wales > Sydney (0.04)
- North America
  - Dominican Republic (0.04)
  - United States
    - Colorado (0.04)
    - Wyoming (0.04)
    - South Dakota (0.04)
    - Kansas (0.04)
    - New Mexico (0.04)
    - Nebraska (0.04)
    - Maryland > Baltimore (0.04)
    - Rocky Mountains (0.04)
    - Oklahoma (0.04)
    - District of Columbia > Washington (0.04)
    - Montana (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Illinois > Champaign County
      - Urbana (0.04)
    - Texas > Yoakum County
      - Plains (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Massachusetts > Middlesex County
      - Natick (0.04)
    - Washington > King County
      - Seattle (0.04)
    - California
      - Los Angeles County > Long Beach (0.04)
      - San Francisco County > San Francisco (0.04)
    - New York > New York County
      - New York City (0.04)
  - Canada
    - Rocky Mountains (0.04)
    - Quebec > Montreal (0.04)
    - Ontario > Middlesex County
      - London (0.04)
    - British Columbia > Metro Vancouver Regional District
      - Vancouver (0.04)
    - Alberta > Census Division No. 15
      - Improvement District No. 9 > Banff (0.13)
- Europe
  - France (0.04)
  - Austria (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Spain > Galicia
    - Madrid (0.04)
  - Czechia > Olomouc Region
    - Olomouc (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Switzerland
    - St. Gallen > St. Gallen (0.04)
    - Appenzell Innerrhoden > Appenzell (0.04)
    - Glarus > Glarus (0.04)
  - Portugal > Lisbon
    - Lisbon (0.04)
  - Russia
    - Southern Federal District (0.04)
    - Northwestern Federal District > Leningrad Oblast
      - Saint Petersburg (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - United Kingdom
    - Scotland > City of Edinburgh
      - Edinburgh (0.04)
    - England
      - Greater London > London (0.04)
      - Cambridgeshire > Cambridge (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Nepal (0.04)
  - Pakistan (0.04)
  - Russia (0.04)
  - Middle East > Jordan (0.04)
  - India (0.04)
  - South Korea > Seoul
    - Seoul (0.04)
  - China
    - Beijing > Beijing (0.04)
    - Shanghai > Shanghai (0.04)
    - Jiangsu Province > Nanjing (0.04)
    - Hong Kong (0.04)
- Africa > Ethiopia
  - Addis Ababa > Addis Ababa (0.04)

Genre:
- Research Report (1.00)
- Overview (1.00)

Industry:
- Information Technology > Security & Privacy (1.00)
- Energy (1.00)
- Transportation (1.00)
- Consumer Products & Services (0.92)
- Banking & Finance (0.92)
- Education > Educational Setting (0.92)
- Media > Film (0.67)
- Leisure & Entertainment > Games (0.67)
- Government > Regional Government
  - North America Government > United States Government (0.67)
- Health & Medicine > Therapeutic Area
  - Neurology (0.92)

Technology:
- Information Technology
  - Software > Programming Languages (1.00)
  - Security & Privacy (1.00)
  - Information Management > Search (1.00)
  - Human Computer Interaction > Interfaces (1.00)
  - Databases (1.00)
  - Data Science (1.00)
  - Communications
    - Web (1.00)
    - Social Media (1.00)
  - Artificial Intelligence
    - Representation & Reasoning > Agents (1.00)
    - Cognitive Science > Problem Solving (1.00)
    - Robots > Autonomous Vehicles (0.92)
    - Natural Language
      - Large Language Model (1.00)
      - Chatbot (1.00)
      - Information Retrieval (0.68)
    - Machine Learning
      - Reinforcement Learning (1.00)
      - Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found