Novel Policy Seeking with Constrained Optimization