Achieving Sparse Activation in Small Language Models