Learning Optimal Admission Control in Partially Observable Queueing Networks