Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization

Beloborodov, Dmitrii, Ulanov, A. E., Foerster, Jakob N., Whiteson, Shimon, Lvovsky, A. I.

Feb-14-2020–arXiv.org Artificial Intelligence

Quantum hardware and quantum-inspired algorithms are becoming increasingly popular for combinatorial optimization. However, these algorithms may require careful hyperparameter tuning for each problem instance. We use a reinforcement learning agent in conjunction with a quantum-inspired algorithm to solve the Ising energy minimization problem, which is equivalent to the Maximum Cut problem. The agent controls the algorithm by tuning one of its parameters with the goal of improving recently seen solutions. We propose a new Rescaled Ranked Reward (R3) method that enables stable single-player version of self-play training that helps the agent to escape local optima. The training on any problem instance can be accelerated by applying transfer learning from an agent trained on randomly generated problems. Our approach allows sampling high-quality solutions to the Ising problem with high probability and outperforms both baseline heuristics and a black-box hyperparameter optimization approach.

agent, optimization, reinforcement learning enhanced quantum-inspired algorithm, (8 more...)

arXiv.org Artificial Intelligence

Feb-14-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California > Santa Clara County > Palo Alto (0.04)
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.14)
  - Russia > Central Federal District
    - Moscow Oblast > Moscow (0.04)
- Asia
  - Russia (0.04)
  - Japan > Honshū
    - Kantō > Tochigi Prefecture > Utsunomiya (0.04)

Genre:
- Research Report (0.50)
- Workflow (0.46)

Industry:
- Transportation (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found