Reinforcement Learning-driven Information Seeking: A Quantum Probabilistic Approach