chinese chess
Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning
Chen, Yuhao, Liu, Shuochen, Lyu, Yuanjie, Zhang, Chao, Shi, Jiayao, Xu, Tong
Game playing has long served as a fundamental benchmark for evaluating Artificial General Intelligence. While Large Language Models (LLMs) have demonstrated impressive capabilities in general reasoning, their effectiveness in spatial strategic reasoning, which is critical for complex and fully observable board games, remains insufficiently explored. In this work, we adopt Chinese Chess (Xiangqi) as a challenging and rich testbed due to its intricate rules and spatial complexity. To advance LLMs' strategic competence in such environments, we propose a training framework tailored to Xiangqi, built upon a large-scale dataset of five million board-move pairs enhanced with expert annotations and engine evaluations. Building on this foundation, we introduce Xiangqi-R1, a 7B-parameter model trained in multi-stage manner. Our Experimental results indicate that, despite their size and power, general-purpose LLMs struggle to achieve satisfactory performance in these tasks. Compared to general-purpose LLMs, Xiangqi-R1 greatly advances with an 18% rise in move legality and a 22% boost in analysis accuracy. Our results point to a promising path for creating general strategic intelligence in complex areas.
Mastering Chinese Chess AI (Xiangqi) Without Search
Chen, Yu, Lin, Juntong, Shu, Zhichao
We have developed a high-performance Chinese Chess AI that operates without reliance on search algorithms. This AI has demonstrated the capability to compete at a level commensurate with the top 0.1\% of human players. By eliminating the search process typically associated with such systems, this AI achieves a Queries Per Second (QPS) rate that exceeds those of systems based on the Monte Carlo Tree Search (MCTS) algorithm by over a thousandfold and surpasses those based on the AlphaBeta pruning algorithm by more than a hundredfold. The AI training system consists of two parts: supervised learning and reinforcement learning. Supervised learning provides an initial human-like Chinese chess AI, while reinforcement learning, based on supervised learning, elevates the strength of the entire AI to a new level. Based on this training system, we carried out enough ablation experiments and discovered that 1. The same parameter amount of Transformer architecture has a higher performance than CNN on Chinese chess; 2. Possible moves of both sides as features can greatly improve the training process; 3. Selective opponent pool, compared to pure self-play training, results in a faster improvement curve and a higher strength limit. 4. Value Estimation with Cutoff(VECT) improves the original PPO algorithm training process and we will give the explanation.
- Asia > Middle East > Jordan (0.04)
- Asia > China (0.04)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)
AI-powered Chinese chess robot triumphs over grandmasters
SenseRobot, a physical artificial intelligence-powered robot, made Chinese chess history recently when it beat two professional human rivals, during a livestreamed event to an audience of 850,000. The SenseRobot AI Xiangqi Championship, which was held in Shanghai, was the first Chinese chess competition that has featured an AI-powered robot that plays Chinese chess face to face with human grandmasters. Co-hosted by leading artificial intelligence software company SenseTime, developer of SenseRobot, and Shanghai Chess Academy, the championship had the robot play against Xie Jing, a world champion, and Gu Bowen, a national youth champion. SenseRobot beat Gu at a level-16 game, while Xie failed in his challenge against the robot in a game at level 26, the most difficult level of the game. "Unlike traditional AI Chinese chess software, I was most impressed with SenseRobot's agility and steady operation, as well as its ability to calmly play the game, just like a real player," said Xie, who is an apprentice of legendary grandmaster Hu Ronghua and currently serves as a coach and player for the Shanghai Chinese Chess Team.
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Games > Chess (1.00)
On the complexity of Dark Chinese Chess
This paper provides a complexity analysis for the game of dark Chinese chess (a.k.a. "JieQi"), a variation of Chinese chess. Dark Chinese chess combines some of the most complicated aspects of board and card games, such as long-term strategy or planning, large state space, stochastic, and imperfect-information, which make it closer to the real world decision-making problem and pose great challenges to game AI. Here we design a self-play program to calculate the game tree complexity and average information set size of the game, and propose an algorithm to calculate the number of information sets.
- Asia > China > Hubei Province > Wuhan (0.05)
- North America > United States > Texas (0.04)
- Asia > Vietnam (0.04)