Designing Time Series Experiments in A/B Testing with Transformer Reinforcement Learning