Evaluating the Safety and Skill Reasoning of Large Reasoning Models Under Compute Constraints