CFP last date
20 May 2024
Reseach Article

Improved Multi-Agent Reinforcement Learning for Minimizing Traffic Waiting Time

by Vijay Kumar, B. Kaushik, H. Banka
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 81 - Number 9
Year of Publication: 2013
Authors: Vijay Kumar, B. Kaushik, H. Banka
10.5120/14043-2205

Vijay Kumar, B. Kaushik, H. Banka . Improved Multi-Agent Reinforcement Learning for Minimizing Traffic Waiting Time. International Journal of Computer Applications. 81, 9 ( November 2013), 30-34. DOI=10.5120/14043-2205

@article{ 10.5120/14043-2205,
author = { Vijay Kumar, B. Kaushik, H. Banka },
title = { Improved Multi-Agent Reinforcement Learning for Minimizing Traffic Waiting Time },
journal = { International Journal of Computer Applications },
issue_date = { November 2013 },
volume = { 81 },
number = { 9 },
month = { November },
year = { 2013 },
issn = { 0975-8887 },
pages = { 30-34 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume81/number9/14043-2205/ },
doi = { 10.5120/14043-2205 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:55:39.845965+05:30
%A Vijay Kumar
%A B. Kaushik
%A H. Banka
%T Improved Multi-Agent Reinforcement Learning for Minimizing Traffic Waiting Time
%J International Journal of Computer Applications
%@ 0975-8887
%V 81
%N 9
%P 30-34
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper depict using multi-agent reinforcement learning (MARL) algorithm for learning traffic pattern to minimize the traveling time or maximizing safety and optimizing traffic pattern (OTP). This model provides a description and solution to optimize traffic pattern that use multi-agent based reinforcement learning algorithms. MARL uses multi agent structure where vehicles and traffic signals are working as agents. In this model traffic area divide in different-different traffic ZONE. Each zone have own distributed agent and these agent will pass the information one zone to other threw the network. The Optimization objectives include the number of vehicle stops, the average waiting time and maximum queue length of the next (node) intersection. In addition, This research also introduce the priority control of buses and emergent vehicles into this model. Expected outcome of the algorithm is comparable to the performance of Q-Learning and Temporal difference learning. The results show significant reduction in waiting time comparable to those algorithms for the work more efficiently than other traffic system.

References
  1. Bowling,M. : Convergence and no-regret in multiagent learning. In: L. K. Saul, Y. Weiss, L. Bottou (eds. ) Advances in Neural Information Processing Systems 17, pp. 209–216. MIT Press (2005).
  2. Bus¸oniu, L. , De Schutter, B. , Babu?ska, R. : Multiagent reinforcement learning with adaptive state focus. In: Proceedings 17th Belgian-Dutch Conference on Artificial Intelligence (BNAIC-05), pp. 35–42. Brussels, Belgium (2005).
  3. Chalkiadakis, G. : Multiagent reinforcement learning: Stochastic games with multiple learning players. Tech. rep. , Dept. of Computer Science, University of Toronto, Canada (2003).
  4. Guestrin, C. , Lagoudakis, M. G. , Parr, R. : Coordinated reinforcement learning. In: Proceedings 19th International Conference on Machine Learning (ICML-02), pp. 227–234. Sydney, Australia (2002)
  5. Hu, J. , Wellman, M. P. : Nash Q-learning for general-sum stochastic games. Journal of Machine Learning Research 4, 1039–1069 (2003)
  6. M. Wiering, et al (2004). Intelligent Traffic Light Control. Technical Report UU-CS-2004-029, University Utrecht.
  7. M. Wiering (2000). Multi-Agent Reinforcement Learning for Traffic Light Control. Machine Learning: Proceedings of the 17th International Conference (ICML' 2000), 1151-1158.
  8. Mitchell, T. M. (1995) the Book of Machine Learning: McGraw-HILL INTERNATIONAL EDITIONS.
  9. Nunes L. , and Oliveira, E. C. Learning from multiple sources. In Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multi Agent Systems, AAMAS (New York, USA, July 2004), vol. 3, New York, IEEE Computer Society, pp. 1106–1113.
  10. Oliveira, D. , Bazzan, A. L. C. , and Lesser, V. using cooperative mediation to coordinate traffic lights: a case study. In Proceedings of the 4th International Joint Conference on Autonomous Agents and Multi Agent Systems (AAMAS) (July 2005), New York, IEEE Computer Society, pp. 463–470.
  11. Price, B. , Boutilier, C. : Accelerating reinforcement learning through implicit imitation Journal of Artificial Intelligence Research 19, 569–629 (2003).
  12. Tan, M. : Multi-agent reinforcement learning: Independent vs. cooperative agents. In: Proceedings 10th International Conference on Machine Learning (ICML-93), pp. 330– 337. Amherst, US (1993).
Index Terms

Computer Science
Information Sciences

Keywords

Agent Based System Intelligent Traffic Signal Control Multi Objective Scheme Optimization Objectives RL Multi-Agent System (MAS).