Brittleness and Promise: Knowledge Graph Based Reward Modeling for Diagnostic Reasoning