Optimizing Traffic Lights with Multi-agent Deep Reinforcement Learning and V2X communication

Feb-23-2020–arXiv.org Artificial Intelligence

We consider a system to optimize duration of traffic signals using multi-agent deep reinforcement learning and Vehicle-to-Everything (V2X) communication. This system aims at analyzing independent and shared rewards for multi-agents to control duration of traffic lights. A learning agent traffic light gets information along its lanes within a circular V2X coverage. The duration cycles of traffic light are modeled as Markov decision Processes. We investigate four variations of reward functions. The first two are unshared-rewards: based on waiting number, and waiting time of vehicles between two cycles of traffic light. The third and fourth functions are: shared-rewards based on waiting cars, and waiting time for all agents. Each agent has a memory for optimization through target network and prioritized experience replay. We evaluate multi-agents through the Simulation of Urban MObility (SUMO) simulator. The results prove effectiveness of the proposed system to optimize traffic signals and reduce average waiting cars to 41.5 % as compared to the traditional periodic traffic control system.

agent, reinforcement, vehicle, (10 more...)

arXiv.org Artificial Intelligence

Feb-23-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Middlesex County > Cambridge (0.04)
- Europe
  - United Kingdom > England
    - Greater London > London (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.05)
- Asia
  - India > Chandigarh (0.04)
  - China > Heilongjiang Province
    - Harbin (0.04)

Genre:
- Research Report (0.69)

Industry:
- Transportation
  - Infrastructure & Services (1.00)
  - Ground > Road (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found