communication degree
Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
A Extending to Multi Round Communications
The formulation in Section 3 can be extended to multiple rounds of communications per time step. We synthesize these programs independently. There are four main hyper-parameters in our synthesis algorithm. We used cross validation to choose these parameters. Figure 7: Comparing program policy with RL policy that treats communications as actions.