SkyMath: Technical Report
Yang, Liu, Yang, Haihua, Cheng, Wenjun, Lin, Lei, Li, Chenxia, Chen, Yifu, Liu, Lunan, Pan, Jianfei, Wei, Tianwen, Li, Biye, Zhao, Liang, Wang, Lijie, Zhu, Bo, Li, Guoliang, Wu, Xuejie, Luo, Xilin, Hu, Rui
–arXiv.org Artificial Intelligence
Large language models (LLMs) have shown great potential to solve varieties of natural language processing (NLP) tasks, including mathematical reasoning. By applying self-compare fine-tuning, we have enhanced mathematical reasoning abilities of Skywork-13B-Base remarkably. On GSM8K, SkyMath outperforms all known open-source models of similar size and has established a new SOTA performance. On dataset MATH and out-of-domain dataset CMath, SkyMath also achieves a high accuracy rate, showing remarkable generalizability to varieties of math problems. Moreover, compared to traditional AI methods, LLMs gain unparalleled advantages in these landscapes.
arXiv.org Artificial Intelligence
Oct-26-2023
- Country:
- North America > Canada (0.14)
- Genre:
- Research Report (0.50)
- Industry:
- Education (0.94)
- Technology: