Deep Reinforcement Learning in HOL4
–arXiv.org Artificial Intelligence
The paper describes an implementation of deep reinforcement learning through self-supervised learning within the proof assistant HOL4. A close interaction between the machine learning modules and the HOL4 library is achieved by the choice of tree neural networks (TNNs) as machine learning models and the internal use of HOL4 terms to represent tree structures of TNNs. Recursive improvement is possible when a given task is expressed as a search problem. In this case, a Monte Carlo Tree Search (MCTS) algorithm guided by a TNN can be used to explore the search space and produce better examples for training the next TNN. As an illustration, tasks over propositional and arithmetical terms, representative of fundamental theorem proving techniques, are specified and learned: truth estimation, end-to-end computation, term rewriting and term synthesis.
arXiv.org Artificial Intelligence
Oct-25-2019
- Country:
- Africa > Botswana (0.04)
- South America > Brazil
- Rio Grande do Norte > Natal (0.04)
- North America
- United States
- New York > New York County
- New York City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Los Angeles County > Long Beach (0.04)
- New York > New York County
- Canada > Quebec
- Montreal (0.04)
- United States
- Europe
- Czechia > Prague (0.04)
- Austria (0.04)
- Switzerland > Fribourg
- Fribourg (0.04)
- Slovenia > Drava
- Municipality of Benedikt > Benedikt (0.04)
- Asia > Japan
- Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)
- Genre:
- Research Report (0.50)
- Overview (0.46)
- Instructional Material > Course Syllabus & Notes (0.34)
- Industry:
- Leisure & Entertainment > Games (0.68)
- Technology: