Deep Reinforcement Learning in HOL4

Oct-25-2019–arXiv.org Artificial Intelligence

The paper describes an implementation of deep reinforcement learning through self-supervised learning within the proof assistant HOL4. A close interaction between the machine learning modules and the HOL4 library is achieved by the choice of tree neural networks (TNNs) as machine learning models and the internal use of HOL4 terms to represent tree structures of TNNs. Recursive improvement is possible when a given task is expressed as a search problem. In this case, a Monte Carlo Tree Search (MCTS) algorithm guided by a TNN can be used to explore the search space and produce better examples for training the next TNN. As an illustration, tasks over propositional and arithmetical terms, representative of fundamental theorem proving techniques, are specified and learned: truth estimation, end-to-end computation, term rewriting and term synthesis.

algorithm, deep reinforcement learning, tnn, (13 more...)

arXiv.org Artificial Intelligence

Oct-25-2019

arXiv.org PDF

Add feedback

Country:
- Africa > Botswana (0.04)
- South America > Brazil
  - Rio Grande do Norte > Natal (0.04)
- North America
  - United States
    - New York > New York County
      - New York City (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - California
      - San Francisco County > San Francisco (0.14)
      - Los Angeles County > Long Beach (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - Czechia > Prague (0.04)
  - Austria (0.04)
  - Switzerland > Fribourg
    - Fribourg (0.04)
  - Slovenia > Drava
    - Municipality of Benedikt > Benedikt (0.04)
- Asia > Japan
  - Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)

Genre:
- Research Report (0.50)
- Overview (0.46)
- Instructional Material > Course Syllabus & Notes (0.34)

Industry:
- Leisure & Entertainment > Games (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Search (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks > Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found