AITopics | Hou, Bairu

Collaborating Authors

Hou, Bairu

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization

Hou, Bairu, Jia, Jinghan, Zhang, Yihua, Zhang, Guanhua, Zhang, Yang, Liu, Sijia, Chang, Shiyu

arXiv.org Artificial IntelligenceDec-19-2022

Robustness evaluation against adversarial examples has become increasingly important to unveil the trustworthiness of the prevailing deep models in natural language processing (NLP). However, in contrast to the computer vision domain where the first-order projected gradient descent (PGD) is used as the benchmark approach to generate adversarial examples for robustness evaluation, there lacks a principled first-order gradient-based robustness evaluation framework in NLP. The emerging optimization challenges lie in 1) the discrete nature of textual inputs together with the strong coupling between the perturbation location and the actual content, and 2) the additional constraint that the perturbed text should be fluent and achieve a low perplexity under a language model. These challenges make the development of PGD-like NLP attacks difficult. To bridge the gap, we propose TextGrad, a new attack generator using gradient-driven optimization, supporting high-accuracy and high-quality assessment of adversarial robustness in NLP. Specifically, we address the aforementioned challenges in a unified optimization framework. And we develop an effective convex relaxation method to co-optimize the continuously-relaxed site selection and perturbation variables and leverage an effective sampling method to establish an accurate mapping from the continuous optimization variables to the discrete textual perturbations. Moreover, as a first-order attack generation method, TextGrad can be baked into adversarial training to further improve the robustness of NLP models. Extensive experiments are provided to demonstrate the effectiveness of TextGrad not only in attack generation for robustness evaluation but also in adversarial defense.

adversarial example, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2212.09254

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations

Zang, Yuan, Hou, Bairu, Qi, Fanchao, Liu, Zhiyuan, Meng, Xiaojun, Sun, Maosong

arXiv.org Artificial IntelligenceSep-19-2020

Adversarial attacking aims to fool deep neural networks with adversarial examples. In the field of natural language processing, various textual adversarial attack models have been proposed, varying in the accessibility to the victim model. Among them, the attack models that only require the output of the victim model are more fit for real-world situations of adversarial attacking. However, to achieve high attack performance, these models usually need to query the victim model too many times, which is neither efficient nor viable in practice. To tackle this problem, we propose a reinforcement learning based attack model, which can learn from attack history and launch attacks more efficiently. In experiments, we evaluate our model by attacking several state-of-the-art models on the benchmark datasets of multiple tasks including sentiment analysis, text classification and natural language inference. Experimental results demonstrate that our model consistently achieves both better attack performance and higher efficiency than recently proposed baseline methods. We also find our attack model can bring more robustness improvement to the victim model by adversarial training. All the code and data of this paper will be made public.

attack model, deep learning, neural network, (21 more...)

arXiv.org Artificial Intelligence

2009.09192

Country: Asia (0.14)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology > Security & Privacy (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

OpenAttack: An Open-source Textual Adversarial Attack Toolkit

Zeng, Guoyang, Qi, Fanchao, Zhou, Qianrui, Zhang, Tingji, Hou, Bairu, Zang, Yuan, Liu, Zhiyuan, Sun, Maosong

arXiv.org Artificial IntelligenceSep-19-2020

OpenAttack has systematic to be susceptible to adversarial attacks (Szegedy modular design, which disassembles many et al., 2014; Goodfellow et al., 2015). The attacker different attack models, extract the common components uses adversarial examples, which are maliciously and wisely recombine them together. More crafted by imposing small perturbations on original importantly, it has following significant features: input, to fool the victim model. With the wide application - Full coverage of attack model types. of DNNs to practical systems accompanied OpenAttack currently includes 12 typical by growing concern about their security, research attack models which cover all the types on adversarial attacking has become increasingly of accessibility to the victim model and important. Moreover, adversarial attacks are also perturbation levels.

attack model, deep learning, neural network, (21 more...)

arXiv.org Artificial Intelligence

2009.09191

Genre: Research Report (0.40)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military (0.84)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.94)

Add feedback