Inverse design of the transmission matrix in a random system using Reinforcement Learning