Optimization for Robustness Evaluation beyond $\ell_p$ Metrics