Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation