Reinforcement Learning Based Goodput Maximization with Quantized Feedback in URLLC