Rewarding Graph Reasoning Process makes LLMs more Generalized Reasoners