Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection