Characterizing the Behavior of Training Mamba-based State Space Models on GPUs