An Improved Q-learning Algorithm for Path-Planning of a Mobile Robot

Pradipta K Das; S. C. Mandhata; H. S. Behera; S. N. Patro

Call for Paper

September Edition

IJCA solicits high quality original research papers for the upcoming September edition of the journal. The last date of research paper submission is 20 August 2025

Submit your paper

Know more

The week's pick

Real-time Synchronization Mechanisms Between Batch-oriented Legacy Systems and Modern Interfaces in the Retirement Domain

Balamurugan Krishnaswamy Gnanasekaran

Random Articles

Article:PID Control of Heat Exchanger System

October

2010

Shared Cryptography with Embedded Session Key for Secret Audio

July

2011

A Holistic Approach to Autonomic Self-Healing Distributed Computing System

August

2013

Study and Analysis of Scientific Scopes and Issues towards Developing an Efficient LECIM

July

2013

Reseach Article

An Improved Q-learning Algorithm for Path-Planning of a Mobile Robot

by Pradipta K Das, S. C. Mandhata, H. S. Behera, S. N. Patro

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 51 - Number 9

Year of Publication: 2012

Authors: Pradipta K Das, S. C. Mandhata, H. S. Behera, S. N. Patro

10.5120/8073-1468

Pradipta K Das, S. C. Mandhata, H. S. Behera, S. N. Patro . An Improved Q-learning Algorithm for Path-Planning of a Mobile Robot. International Journal of Computer Applications. 51, 9 ( August 2012), 40-46. DOI=10.5120/8073-1468

@article{ 10.5120/8073-1468,

author = { Pradipta K Das, S. C. Mandhata, H. S. Behera, S. N. Patro },

title = { An Improved Q-learning Algorithm for Path-Planning of a Mobile Robot },

journal = { International Journal of Computer Applications },

issue_date = { August 2012 },

volume = { 51 },

number = { 9 },

month = { August },

year = { 2012 },

issn = { 0975-8887 },

pages = { 40-46 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume51/number9/8073-1468/ },

doi = { 10.5120/8073-1468 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:49:59.308458+05:30

%A Pradipta K Das

%A S. C. Mandhata

%A H. S. Behera

%A S. N. Patro

%T An Improved Q-learning Algorithm for Path-Planning of a Mobile Robot

%J International Journal of Computer Applications

%@ 0975-8887

%V 51

%N 9

%P 40-46

%D 2012

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Classical Q-learning requires huge computations to attain convergence and a large storage to save the Q-values for all possible actions in a given state. This paper proposes an alternative approach to Q-learning to reduce the convergence time without using the optimal path from a random starting state of a final goal state, when the Q-table is used for path planning of a mobile robot. Further, the proposed algorithm stores the Q-value for the best possible action at a state, and thus save significant storage. Experiments reveal that the acquired Q-table obtained by the proposed algorithm helps in saving turning angles of the robot in the planning stage. Reduction in turning angles is economic from the point of view of energy consumption by the robot. Thus the proposed algorithm has several merits with respect to classical Q-learning. The proposed algorithm is constructed based on four fundamental properties derived here and the validation of the algorithm is studied with Khepera-II robot.

References

Dean, T. , Basye, K. and Shewchuk, J. "Reinforcement learning for planning and Control". In: Minton, S. (ed. ) Machine Learning Methods for Planning and Scheduling. Morgan Kaufmann 1993.
Bellman, R. E. , "Dynamic programming ", Princeton, NJ: Princeton University Press, p. 957.
Watkins, C. and Dayan, P. , "Q-learning," Machine Learning, Vol. 8, pp. 279- 292, 1992
Konar, A. , Computational Intelligence: Principles, Techniques and Applications. Springer-Verlag, 2005
Busoniu, L. , Babushka, R. , Schutter, B. De. , Ernst, D. , Reinforcement Learning and Dynamic Programming Using Function Approximators, CRC Press, Taylor & Francis group, Boca Raton, FL, 2010.
Chakraborty, J. , Konar A. , Jain, L. C. , and Chakraborty, U. , "Cooperative Multi-Robot Path Planning Using Differential Evolution" Journal of Intelligent & Fuzzy Systems, Vol. 20, Pp. 13-27, 2009.
Gerke, M. , and Hoyer, H. , Planning of Optimal paths for autonomous agents moving in inhomogeneous environments, in: Proceedings of the 8th Int. Conf. on Advanced Robotics, July 1997, pp. 347-352.
Xiao, J. , Michalewicz, Z. , Zhang, L. , and Trojanowski, K. , Adaptive Evolutionary Planner/ Navigator for Mobile robots, IEEE Transactions on evolutionary Computation 1 (1), April 1997.
Bien, Z. , and Lee, J. , A Minimum–Time trajectory planning Method for Two Robots, IEEE Trans on Robotics and Automation 8(3). PP. 443-450, JUNE 1992.
Moll, M. , and Kavraki, L. E. , Path Planning for minimal Energy Curves of Constant Length, in: Proceedings of the 2004 IEEE Int. Conf. on Robotics and Automation, pp. 2826-2831, April 2004.
Regele, R. , and Levi, P. , Cooperative Multi-Robot Path Planning by Heuristic Priority Adjustment, in: Proceedings of the IEEE/RSJ Int Conf on Intelligent Robots and Systems, 2006.
Yuan-Pao Hsu, Wei-Cheng Jiang, Hsin-Yi Lin, A CMAC-Q-Learning Based Dyna Agent, in: SICE Annual Conference, 2008, pp. 2946 – 2950, The University Electro-Communications, Tokyo, Japan.
Yi Zhou and Meng Joo Er, A Novel Q-Learning Approach with Continuous States and Actions, in: 16th IEEE International Conference on Control Applications Part of IEEE Multi-conference on Systems and Control, Singapore, 1-3 October 2007
Kyungeun Cho, Yunsick Sung, Kyhyun Um, A Production Technique for a Q-table with an Influence Map for Speeding up Q-learning, in : International Conference on Intelligent Pervasive Computing, 2007.
Deepshikha Pandey, Punit Pandey, Approximate Q-Learning: An Introduction, in : Second International Conference on Machine Learning and Computing, 2010.
S. S. Masoumzadeh and G. Taghizadeh, K. Meshgi and S. Shiry, Deep Blue: A Fuzzy Q-Learning Enhanced Active Queue Management Scheme, in : International Conference on Adaptive and Intelligent Systems, 2009.
Indrani Goswami (Chakraborty), Pradipta Kumar Das, Amit Konar, R. Janarthanan. "Extended Q-learning Algorithm for Path-Planning of a Mobile Robot", in: Eighth International Conference on Simulated Evolution And Learning (SEAL-2010), Indian Institute of Technology Kanpur, India, December 2010.

Index Terms

Computer Science

Information Sciences

Keywords

Q-learning Reinforcement learning Motion planning Mobile Robot Energy