On Enhancing Network Throughput using Reinforcement Learning in Sliced Testbeds