TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios
Zhang, Xiaokang, Zhang, Jing, Ma, Zeyao, Li, Yang, Zhang, Bohan, Li, Guanlin, Yao, Zijun, Xu, Kangli, Zhou, Jinchang, Zhang-Li, Daniel, Yu, Jifan, Zhao, Shu, Li, Juanzi, Tang, Jie
–arXiv.org Artificial Intelligence
We introduce TableLLM, a robust large language model (LLM) with 13 billion parameters, purpose-built for proficiently handling tabular data manipulation tasks, whether they are embedded within documents or spreadsheets, catering to real-world office scenarios. We propose a distant supervision method for training, which comprises a reasoning process extension strategy, aiding in training LLMs to understand reasoning patterns more effectively as well as a cross-way validation strategy, ensuring the quality of the automatically generated data. To evaluate the performance of TableLLM, we have crafted a benchmark tailored to address both document and spreadsheet formats as well as constructed a well-organized evaluation pipeline capable of handling both scenarios. Thorough evaluations underscore the advantages of TableLLM when compared to various existing general-purpose and tabular data-focused LLMs.
arXiv.org Artificial Intelligence
Apr-1-2024
- Country:
- Asia
- Mongolia (0.04)
- Indonesia > Bali (0.04)
- Macao (0.04)
- North Korea (0.04)
- Japan (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- China
- South Korea (0.04)
- Singapore (0.04)
- India (0.15)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- France (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom (0.04)
- Belgium > Brussels-Capital Region
- North America > United States
- New York > New York County > New York City (0.04)
- Oceania > Guam (0.04)
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- Asia
- Genre:
- Instructional Material (0.45)
- Overview (0.67)
- Research Report (0.50)
- Industry:
- Government (0.68)
- Law Enforcement & Public Safety (0.46)
- Technology: