Application of Deep Q Learning with Simulation Results for Elevator Optimization

Cao, Zheng, Guo, Raymond, Tuguinay, Caesar M., Pock, Mark, Gao, Jiayi, Wang, Ziyu

Dec-23-2022–arXiv.org Artificial Intelligence

This paper presents a methodology for combining programming and mathematics to optimize elevator wait times. Based on simulated user data generated according to the canonical three-peak model of elevator traffic, we first develop a naive model from an intuitive understanding of the logic behind elevators. We take into consideration a general array of features including capacity, acceleration, and maximum wait time thresholds to adequately model realistic circumstances. Using the same evaluation framework, we proceed to develop a Deep Q Learning model in an attempt to match the hard-coded naive approach for elevator control. Throughout the majority of the paper, we work under a Markov Decision Process (MDP) schema, but later explore how the assumption fails to characterize the highly stochastic overall Elevator Group Control System (EGCS).

elevator, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

Dec-23-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States > Washington > King County > Seattle (0.04)

Genre:
- Research Report (0.65)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.35)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found