Transfer Learning Approach for Fast Convergence of Deep Q Networks in Game Pong

Baomin Shao; Xue Jiang; Qiuling Li

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

FORENSIC ANALYSIS FRAMEWORKS FOR ENCRYPTED CLOUD STORAGE INVESTIGATIONS

Joy Awoleye Sarah Mavire Allan Munyira Kelvin Magora

Random Articles

Wirelessly Transmitting a Grayscale Image using Visible Light

November

2012

Development and Performance Evaluation of Mismatched Filter using Differential Evolution

May

2012

A Novel Prioritised Concealment and Flexible Macroblock Ordering Scheme for Video Transmission

Sep

2016

An Optimizing Technique based on Genetic Algorithm for Power Management in Heterogeneous Multi-Tier Web Clusters

April

2015

Reseach Article

Transfer Learning Approach for Fast Convergence of Deep Q Networks in Game Pong

by Baomin Shao, Xue Jiang, Qiuling Li

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 181 - Number 21

Year of Publication: 2018

Authors: Baomin Shao, Xue Jiang, Qiuling Li

10.5120/ijca2018917925

Baomin Shao, Xue Jiang, Qiuling Li . Transfer Learning Approach for Fast Convergence of Deep Q Networks in Game Pong. International Journal of Computer Applications. 181, 21 ( Oct 2018), 11-14. DOI=10.5120/ijca2018917925

@article{ 10.5120/ijca2018917925,

author = { Baomin Shao, Xue Jiang, Qiuling Li },

title = { Transfer Learning Approach for Fast Convergence of Deep Q Networks in Game Pong },

journal = { International Journal of Computer Applications },

issue_date = { Oct 2018 },

volume = { 181 },

number = { 21 },

month = { Oct },

year = { 2018 },

issn = { 0975-8887 },

pages = { 11-14 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume181/number21/30008-2018917925/ },

doi = { 10.5120/ijca2018917925 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T01:06:34.369287+05:30

%A Baomin Shao

%A Xue Jiang

%A Qiuling Li

%T Transfer Learning Approach for Fast Convergence of Deep Q Networks in Game Pong

%J International Journal of Computer Applications

%@ 0975-8887

%V 181

%N 21

%P 11-14

%D 2018

%I Foundation of Computer Science (FCS), NY, USA

Abstract

By simulating the psychological and neurological system, deep reinforcement learning method has been playing an important role in the development and application of artificial intelligence with the help of the powerful feature representation capability of deep neural networks. The deep Q network which improves traditional RL methods by breaking out the learning mechanism of value function approximation and policy search based on shallow structure, has the capabilities of hierarchical feature extraction and accurate Q value approximation in various high-dimensional sensing environments. In this paper, DQN was adapted into Game Pong playing, however, it was found that by adjusting hyperparameters (network architecture, exploration, learning rate), the Q-values could not converge easily. The lacking convergence of the Q-loss might be the limiting factor for better game playing results. A transfer learning approach has been adopted for fast convergence of DQN in game Pong, several measure standards was used as rewards to train DQN, experiments showed that this approach can get fast convergence of DQN training, and DQN network play good performance on game Pong.

References

M. G. Bellmare, Y. Naddaf, J. Veness and M. Bowling, The Arcade learning environment: an evaluation platform for general agents, Journal of Artificial Intelligence Research, 47, pp.253–279, 2013
D. Zhao and Y. Zhu, MEC-a near-optimal online reinforcement learning algorithm for continuous deterministic systems, IEEE Trans. Neural Netw. Learn. Sys., vol. 26, no. 2, pp. 346–356, Feb. 2015.
B. Piot, M. Geist, and O. Pietquin, Bridging the gap between imitation learning and inverse reinforcement learning, IEEE Trans. Neural Netw. Learn. Syst., vol. 28, no. 8, pp. 1814–1826, Aug. 2017.
J. Li, H. Modares, T. Chai, F. L. Lewis, and L. Xie, Off-policy reinforcement learning for synchronization in multiagent graphical games, IEEE Trans. Neural Netw. Learn. Syst., vol. 28, no. 10, pp. 2434–2445, Oct. 2017.
V. Mnih et al., Human-level control through deep reinforcement learning, Nature, vol. 518, pp. 529–533, 2015.
F. Abtahi, Z. Zhu, and A. M. Burry, A deep reinforcement learning approach to character segmentation of license plate images, in Proc. IAPR Int. Conf. Mach. Vis. Appl., Jul. 2015, pp. 539–542.
Y. Deng, F. Bao, Y. Kong, Z. Ren, and Q. Dai, Deep Direct Reinforcement Learning for Financial Signal Representation and Trading. IEEE Trans. Neural Netw. Learn. Syst., vol. 28, no. 3, pp. 653–664, Mar. 2017.
K. Narasimhan, T. Kulkarni, and R. Barzilay, Language understanding for text-based games using deep reinforcement learning, in Proc. Conf. Empir. Methods Nature Lang. Process., Sep. 2015, pp. 1–11.
H. Y. Ong, K. Chavez, and A. Hong. (2015). Distributed deep Q learning. [Online]. Available: https://arxiv.org/abs/1508.04186
M. E. Taylor, G. Kuhlmann, and P. Stone, Accelerating search with transferred heuristics, in Proc. ICAPS Workshop AI Planning Learn., 2007.
M. Riedmiller, Neural fitted Q iteration—First experiences with a data efficient neural reinforcement learning method, in Proc. Eur. Conf. Mach. Learn., Oct. 2005, pp. 317–328.
A. Fachantidis, I. Partalas, G. Tsoumakas, and I. Vlahavas, Transferring models in hybrid reinforcement learning agents, in Proc. IFIP Adv. Inf. Commun. Technol., Sep. 2011, pp. 162–171.
A. Lazaric, M. Restelli, and A. Bonarini, Transfer of samples in batch reinforcement learning, in Proc. 25th Int. Conf. Mach. Learn., Jul. 2008, pp. 544–551.

Index Terms

Computer Science

Information Sciences

Keywords

DQN Transfer Learning Game Pong Image Evaluation