Reasoning With a Star: A Heliophysics Dataset and Benchmark for Agentic Scientific Reasoning