BAT Q-LEARNING ALGORITHM

Fa | Ar | En

BAT Q-LEARNING ALGORITHM


نویسنده	abed-alguni bilal h.
منبع	jordanian journal of computers and information technology - 2017 - دوره : 3 - شماره : 1 - صفحه:51 -70
چکیده	Cooperative q-learning approach allows multiple learners to learn independently and then share their q-values among each other using a q-value sharing strategy. a main problem with this approach is that the solutions of the learners may not converge to optimality, because the optimal q-values may not be found. another problem is that some cooperative algorithms perform very well with single-task problems, but quite poorly with multi-task problems. this paper proposes a new cooperative q-learning algorithm called the bat q-learning algorithm (bq-learning) that implements a q-value sharing strategy based on the bat algorithm. the bat algorithm is a powerful optimization algorithm that increases the possibility of finding the optimal q-values by balancing between the exploration and exploitation of actions by tuning the parameters of the algorithm. the bq-learning algorithm was tested using two problems: the shortest path problem (single-task problem) and the taxi problem (multi-task problem). the experimental results suggest that bq-learning performs better than single-agent q-learning and some well-known cooperative q-learning algorithms.
کلیدواژه	Q-learning ,Bat algorithm ,Optimization ,Cooperative reinforcement learning.
آدرس	yarmouk university, computer science department, Jordan
پست الکترونیکی	e-mail: bilal.h@yu.edu.jo



Authors