Learning Human-Like RLAgents through Trajectory Optimization with Action Quantization

Open in new window