Automatic Rule Induction for Interpretable Semi-Supervised Learning

Pryzant, Reid, Yang, Ziyi, Xu, Yichong, Zhu, Chenguang, Zeng, Michael

Oct-14-2022–arXiv.org Artificial Intelligence

Semi-supervised learning has shown promise in allowing NLP models to generalize from small amounts of labeled data. Meanwhile, pretrained transformer models act as black-box correlation engines that are difficult to explain and sometimes behave unreliably. In this paper, we propose tackling both of these challenges via Automatic Rule Induction (ARI), a simple and general-purpose framework for the automatic discovery and integration of symbolic rules into pretrained transformer models. First, we extract weak symbolic rules from low-capacity machine learning models trained on small amounts of labeled data. Next, we use an attention mechanism to integrate these rules into high-capacity pretrained transformer models. Last, the rule-augmented system becomes part of a self-training framework to boost supervision signal on unlabeled data. These steps can be layered beneath a variety of existing weak supervision and semi-supervised NLP algorithms in order to improve performance and interpretability. Experiments across nine sequence classification and relation extraction tasks suggest that ARI can improve state-of-the-art methods with no manual effort and minimal computational overhead.

artificial intelligence, machine learning, preprint arxiv, (19 more...)

arXiv.org Artificial Intelligence

Oct-14-2022

arXiv.org PDF

Add feedback

Country:
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Slovenia > Drava
    - Municipality of Benedikt > Benedikt (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Leisure & Entertainment (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Rule-Based Reasoning (1.00)
  - Machine Learning
    - Unsupervised or Indirectly Supervised Learning (0.92)
    - Inductive Learning (0.70)
    - Neural Networks > Deep Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found