Transfer Learning Approach for Fast Convergence of Deep Q Networks in Game Pong

Baomin Shao; Xue Jiang; Qiuling Li

Call for Paper

June Edition

IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper

Know more

The week's pick

Enhancing Privacy Preservation: Multi-Attribute Protection with P-Sensitive K-Anonymity

Twinkle Patel Kiran Amin

Random Articles

A Novel Hidden Markov Model for Credit Card Fraud Detection

December

2012

An Efficient Approach Based on Trust to Purge the Weakness of Recommendation System

February

2010

Performance Enhancement of Database Driven Technique using Cynosure Method in Cloud

October

2014

Performance Analysis of Controlled Scalability in Unstructured Peer-to-Peer Networks

February

2012

Reseach Article

Transfer Learning Approach for Fast Convergence of Deep Q Networks in Game Pong

by Baomin Shao, Xue Jiang, Qiuling Li

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 181 - Number 21

Year of Publication: 2018

Authors: Baomin Shao, Xue Jiang, Qiuling Li

10.5120/ijca2018917925

Baomin Shao, Xue Jiang, Qiuling Li . Transfer Learning Approach for Fast Convergence of Deep Q Networks in Game Pong. International Journal of Computer Applications. 181, 21 ( Oct 2018), 11-14. DOI=10.5120/ijca2018917925

@article{ 10.5120/ijca2018917925,

author = { Baomin Shao, Xue Jiang, Qiuling Li },

title = { Transfer Learning Approach for Fast Convergence of Deep Q Networks in Game Pong },

journal = { International Journal of Computer Applications },

issue_date = { Oct 2018 },

volume = { 181 },

number = { 21 },

month = { Oct },

year = { 2018 },

issn = { 0975-8887 },

pages = { 11-14 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume181/number21/30008-2018917925/ },

doi = { 10.5120/ijca2018917925 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T01:06:34.369287+05:30

%A Baomin Shao

%A Xue Jiang

%A Qiuling Li

%T Transfer Learning Approach for Fast Convergence of Deep Q Networks in Game Pong

%J International Journal of Computer Applications

%@ 0975-8887

%V 181

%N 21

%P 11-14

%D 2018

%I Foundation of Computer Science (FCS), NY, USA

Abstract

By simulating the psychological and neurological system, deep reinforcement learning method has been playing an important role in the development and application of artificial intelligence with the help of the powerful feature representation capability of deep neural networks. The deep Q network which improves traditional RL methods by breaking out the learning mechanism of value function approximation and policy search based on shallow structure, has the capabilities of hierarchical feature extraction and accurate Q value approximation in various high-dimensional sensing environments. In this paper, DQN was adapted into Game Pong playing, however, it was found that by adjusting hyperparameters (network architecture, exploration, learning rate), the Q-values could not converge easily. The lacking convergence of the Q-loss might be the limiting factor for better game playing results. A transfer learning approach has been adopted for fast convergence of DQN in game Pong, several measure standards was used as rewards to train DQN, experiments showed that this approach can get fast convergence of DQN training, and DQN network play good performance on game Pong.

References

M. G. Bellmare, Y. Naddaf, J. Veness and M. Bowling, The Arcade learning environment: an evaluation platform for general agents, Journal of Artificial Intelligence Research, 47, pp.253–279, 2013
D. Zhao and Y. Zhu, MEC-a near-optimal online reinforcement learning algorithm for continuous deterministic systems, IEEE Trans. Neural Netw. Learn. Sys., vol. 26, no. 2, pp. 346–356, Feb. 2015.
B. Piot, M. Geist, and O. Pietquin, Bridging the gap between imitation learning and inverse reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst., vol. 28, no. 8, pp. 1814–1826, Aug. 2017.
J. Li, H. Modares, T. Chai, F. L. Lewis, and L. Xie, Off-policy reinforcement learning for synchronization in multiagent graphical games, IEEE Trans. Neural Netw. Learn. Syst., vol. 28, no. 10, pp. 2434–2445, Oct. 2017.
V. Mnih et al., Human-level control through deep reinforcement learning, Nature, vol. 518, pp. 529–533, 2015.
F. Abtahi, Z. Zhu, and A. M. Burry, A deep reinforcement learning approach to character segmentation of license plate images, in Proc. IAPR Int. Conf. Mach. Vis. Appl., Jul. 2015, pp. 539–542.
Y. Deng, F. Bao, Y. Kong, Z. Ren, and Q. Dai, Deep Direct Reinforcement Learning for Financial Signal Representation and Trading. IEEE Trans. Neural Netw. Learn. Syst., vol. 28, no. 3, pp. 653–664, Mar. 2017.
K. Narasimhan, T. Kulkarni, and R. Barzilay, Language understanding for text-based games using deep reinforcement learning, in Proc. Conf. Empir. Methods Nature Lang. Process., Sep. 2015, pp. 1–11.
H. Y. Ong, K. Chavez, and A. Hong. (2015). Distributed deep Q learning. [Online]. Available: https://arxiv.org/abs/1508.04186
M. E. Taylor, G. Kuhlmann, and P. Stone, Accelerating search with transferred heuristics, in Proc. ICAPS Workshop AI Planning Learn., 2007.
M. Riedmiller, Neural fitted Q iteration—First experiences with a data efficient neural reinforcement learning method, in Proc. Eur. Conf. Mach. Learn., Oct. 2005, pp. 317–328.
A. Fachantidis, I. Partalas, G. Tsoumakas, and I. Vlahavas, Transferring models in hybrid reinforcement learning agents, in Proc. IFIP Adv. Inf. Commun. Technol., Sep. 2011, pp. 162–171.
A. Lazaric, M. Restelli, and A. Bonarini, Transfer of samples in batch reinforcement learning, in Proc. 25th Int. Conf. Mach. Learn., Jul. 2008, pp. 544–551.

Index Terms

Computer Science

Information Sciences

Keywords

DQN Transfer Learning Game Pong Image Evaluation