Learning Deterministic Weighted Automata with Queries and Counterexamples

Weiss, Gail, Goldberg, Yoav, Yahav, Eran

Oct-30-2019–arXiv.org Machine Learning

We present an algorithm for extraction of a probabilistic deterministic finite automaton (PDFA) from a given black-box language model, such as a recurrent neural network (RNN). The algorithm is a variant of the exact-learning algorithm L*, adapted to a probabilistic setting with noise. The key insight is the use of conditional probabilities for observations, and the introduction of a local tolerance when comparing them. When applied to RNNs, our algorithm often achieves better word error rate (WER) and normalised distributed cumulative gain (NDCG) than that achieved by spectral extraction of weighted finite automata (WFA) from the same networks. PDFAs are substantially more expressive than n-grams, and are guaranteed to be stochastic and deterministic - unlike spectrally extracted WFAs.

algorithm, pdfa, rnn, (16 more...)

arXiv.org Machine Learning

Oct-30-2019

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada (0.04)
  - United States
    - District of Columbia > Washington (0.04)
    - Michigan > Washtenaw County
      - Ann Arbor (0.04)
    - California > Santa Clara County
      - Stanford (0.04)
- Europe
  - France (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Poland > Lower Silesia Province
    - Wroclaw (0.04)
  - Germany
    - Hamburg (0.04)
    - Baden-Württemberg > Stuttgart Region
      - Stuttgart (0.04)

Genre:
- Research Report (0.64)

Industry:
- Transportation (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning > Clustering (0.67)
  - Neural Networks > Deep Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found