Sample-based Dynamic Hierarchical Transformer with Layer and Head Flexibility via Contextual Bandit