A Extending to Multi Round Communications

Neural Information Processing Systems 

The formulation in Section 3 can be extended to multiple rounds of communications per time step. We synthesize these programs independently. There are four main hyper-parameters in our synthesis algorithm. We used cross validation to choose these parameters. Figure 7: Comparing program policy with RL policy that treats communications as actions.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found