START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation