TY - GEN
T1 - Traffic signal control using reinforcement learning and the max-plus algorithm as a coordinating strategy
AU - Medina, Juan C.
AU - Benekohal, Rahim F.
PY - 2012
Y1 - 2012
N2 - This paper explores the performance of decentralized reinforcement learning agents with communication capabilities for the operation of traffic signals in an oversaturated network. An explicit coordinating mechanism is implemented as part of the reward structure of the agent using the max-plus algorithm, aiming at improving the network-wide performance. Results from a simulated network with realistic features showed that Q-learning agents could process a greater number of vehicles than optimized signal timings from state-of-practice simulation software TRANSYT7F, even under varying oversaturation conditions. The effect of adding the max-plus algorithm was limited, but towards improved performance in terms of both total throughput and reduced number of stops per vehicle. Ongoing research evaluates potential conditions where the coordination should be emphasized to further enhance results, as well as alternative implementations of the max-plus algorithm.
AB - This paper explores the performance of decentralized reinforcement learning agents with communication capabilities for the operation of traffic signals in an oversaturated network. An explicit coordinating mechanism is implemented as part of the reward structure of the agent using the max-plus algorithm, aiming at improving the network-wide performance. Results from a simulated network with realistic features showed that Q-learning agents could process a greater number of vehicles than optimized signal timings from state-of-practice simulation software TRANSYT7F, even under varying oversaturation conditions. The effect of adding the max-plus algorithm was limited, but towards improved performance in terms of both total throughput and reduced number of stops per vehicle. Ongoing research evaluates potential conditions where the coordination should be emphasized to further enhance results, as well as alternative implementations of the max-plus algorithm.
UR - http://www.scopus.com/inward/record.url?scp=84871219349&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84871219349&partnerID=8YFLogxK
U2 - 10.1109/ITSC.2012.6338911
DO - 10.1109/ITSC.2012.6338911
M3 - Conference contribution
AN - SCOPUS:84871219349
SN - 9781467330640
T3 - IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
SP - 596
EP - 601
BT - 2012 15th International IEEE Conference on Intelligent Transportation Systems, ITSC 2012
T2 - 2012 15th International IEEE Conference on Intelligent Transportation Systems, ITSC 2012
Y2 - 16 September 2012 through 19 September 2012
ER -