STEPS: A Benchmark for Order Reasoning in Sequential Tasks