Improving exploration in policy gradient search: Application to symbolic optimization

Open in new window