Adversarial recovery of agent rewards from latent spaces of the limit order book