We evaluated adversarial robustness with CAA, an ensemble of gradient and search attacks which was recently demonstrated as the most effective attack against a tabular model.
Before deploying any newly developed policy, it is important to assess its impact. In many high-stakes domains, it is risky or unethical to implement such policies directly for online evaluation.
In many search applications related to passage retrieval, text entailment, and sub-graph search, the query and each'document' is a set of elements, with a document