Combinational Q-Learning for Dou Di Zhu