Nonconvex sparse regularization for deep neural networks and its optimality