Model-free Reinforcement Learning of Semantic Communication by Stochastic Policy Gradient

Beck, Edgar, Bockelmann, Carsten, Dekorsy, Armin

May-5-2023–arXiv.org Artificial Intelligence

Motivated by the recent success of Machine Learning tools in wireless communications, the idea of semantic communication by Weaver from 1949 has gained attention. It breaks with Shannon's classic design paradigm by aiming to transmit the meaning, i.e., semantics, of a message instead of its exact version, allowing for information rate savings. In this work, we apply the Stochastic Policy Gradient (SPG) to design a semantic communication system by reinforcement learning, not requiring a known or differentiable channel model - a crucial step towards deployment in practice. Further, we motivate the use of SPG for both classic and semantic communication from the maximization of the mutual information between received and target variables. Numerical results show that our approach achieves comparable performance to a model-aware approach based on the reparametrization trick, albeit with a decreased convergence rate.

communication, machine learning, reinforcement learning, (13 more...)

arXiv.org Artificial Intelligence

May-5-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Illinois (0.04)
- Europe > Germany
  - Bremen > Bremen (0.28)

Genre:
- Research Report > New Finding (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (0.70)
  - Neural Networks > Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found