Online Statistical Inference of Constant Sample-averaged Q-Learning