Towards Interpretable and Inference-Optimal COT Reasoning with Sparse Autoencoder-Guided Generation