PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning