Cross-Platform Evaluation of Reasoning Capabilities in Foundation Models