Goto

Collaborating Authors

 attendlight




AttendLight: Universal Attention-Based Reinforcement Learning Model for Traffic Signal Control

Neural Information Processing Systems

We propose AttendLight, an end-to-end Reinforcement Learning (RL) algorithm for the problem of traffic signal control. Previous approaches for this problem have the shortcoming that they require training for each new intersection with a different structure or traffic flow distribution. AttendLight solves this issue by training a single, universal model for intersections with any number of roads, lanes, phases (possible signals), and traffic flow. To this end, we propose a deep RL model which incorporates two attention models. The first attention model is introduced to handle different numbers of roads-lanes; and the second attention model is intended for enabling decision-making with any number of phases in an intersection. As a result, our proposed model works for any intersection configuration, as long as a similar configuration is represented in the training set. Experiments were conducted with both synthetic and real-world standard benchmark datasets.





Review for NeurIPS paper: AttendLight: Universal Attention-Based Reinforcement Learning Model for Traffic Signal Control

Neural Information Processing Systems

The primary motivation for the work is not well supported. Certainly, cities do manage thousands of intersections. While unquantified, it is not clear that the cost of training individually would surpass that of the degradation seen in the multi-env setting. These two statements seem to be conflicting. In section 5.1, the single-env results, it is not clear that FRAP is only applicable in 37 of the 112 cases.


Review for NeurIPS paper: AttendLight: Universal Attention-Based Reinforcement Learning Model for Traffic Signal Control

Neural Information Processing Systems

There was a consensus among reviewers that the paper should be accepted. The paper provides a novel application of attention-based networks to traffic light control. The universality of the architecture allows out-of-the box application of a trained policy to new intersections without the need for adaptation. Overall, this paper seems to have the potential to have a strong impact.


AttendLight: Universal Attention-Based Reinforcement Learning Model for Traffic Signal Control

Neural Information Processing Systems

We propose AttendLight, an end-to-end Reinforcement Learning (RL) algorithm for the problem of traffic signal control. Previous approaches for this problem have the shortcoming that they require training for each new intersection with a different structure or traffic flow distribution. AttendLight solves this issue by training a single, universal model for intersections with any number of roads, lanes, phases (possible signals), and traffic flow. To this end, we propose a deep RL model which incorporates two attention models. The first attention model is introduced to handle different numbers of roads-lanes; and the second attention model is intended for enabling decision-making with any number of phases in an intersection.


AttendLight: Universal Attention-Based Reinforcement Learning Model for Traffic Signal Control

Oroojlooy, Afshin, Nazari, Mohammadreza, Hajinezhad, Davood, Silva, Jorge

arXiv.org Artificial Intelligence

We propose AttendLight, an end-to-end Reinforcement Learning (RL) algorithm for the problem of traffic signal control. Previous approaches for this problem have the shortcoming that they require training for each new intersection with a different structure or traffic flow distribution. AttendLight solves this issue by training a single, universal model for intersections with any number of roads, lanes, phases (possible signals), and traffic flow. To this end, we propose a deep RL model which incorporates two attention models. The first attention model is introduced to handle different numbers of roads-lanes; and the second attention model is intended for enabling decision-making with any number of phases in an intersection. As a result, our proposed model works for any intersection configuration, as long as a similar configuration is represented in the training set. Experiments were conducted with both synthetic and real-world standard benchmark data-sets. The results we show cover intersections with three or four approaching roads; one-directional/bi-directional roads with one, two, and three lanes; different number of phases; and different traffic flows. We consider two regimes: (i) single-environment training, single-deployment, and (ii) multi-environment training, multi-deployment. AttendLight outperforms both classical and other RL-based approaches on all cases in both regimes.