Coverage Analysis of Multi-Environment Q-Learning Algorithms for Wireless Network Optimization