Appendix: Symbolic Distillation for Learned TCP Congestion Control S P Sharan

Neural Information Processing Systems 

We now specify how we build the DRL behavior dataset and process into a symbolic regression friendly format. It is an indicator of the population of genetic programs' performances. The fitness metric driving our evolution is simply the MSE between the predicted action and the "expert" action (teacher model's action). We specifically follow 5 different evolution schemes, either one picked stochastically. This mutant variant carries forth genetic material from both its sources.