Truly Deterministic Policy Optimization, Matthew West