Model-free Reinforcement Learning of Semantic Communication by Stochastic Policy Gradient

Open in new window