Improving thermal state preparation of Sachdev-Ye-Kitaev model with reinforcement learning on quantum hardware