Rethinking Table Instruction Tuning
–arXiv.org Artificial Intelligence
Recent advances in table understanding have focused on instruction-tuning large language models (LLMs) for table-related tasks. However, existing research has overlooked the impact of hyperparameter choices and lacks a comprehensive evaluation of the out-of-domain table understanding ability and the general capabilities of these table LLMs. In this paper, we evaluate these abilities in existing table LLMs, and reveal significant declines in both out-of-domain table understanding and general capabilities compared to their base models. Through systematic analysis, we show that hyperparameters, such as learning rate, can significantly influence both table-specific and general capabilities. Contrary to the existing table instruction-tuning works, we demonstrate that smaller learning rates and fewer training instances can enhance table understanding while preserving general capabilities. Based on our findings, we introduce TAMA, a TAble LLM instruction-tuned from LLaMA 3.1 8B Instruct, which achieves performance on par with, or surpassing GPT-3.5 and GPT-4 on table tasks, while maintaining strong out-of-domain generalization and general capabilities. Our findings highlight the potential for reduced data annotation costs and more efficient model development through careful hyperparameter selection.
arXiv.org Artificial Intelligence
Jan-24-2025
- Country:
- North America
- Dominican Republic (0.04)
- United States
- South Carolina (0.04)
- Texas (0.04)
- Michigan (0.04)
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Mexico > Mexico City
- Mexico City (0.04)
- Europe
- United Kingdom (0.04)
- Netherlands (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Asia
- Thailand > Bangkok
- Bangkok (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- China > Beijing
- Beijing (0.04)
- Thailand > Bangkok
- North America
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Leisure & Entertainment > Sports (0.67)
- Government > Military (0.46)
- Technology: