FCM-type co-clustering transfer reinforcement learning for non-Markov processes

Akira Notsu; Takanori Ueno; Yuichi Hattori; Seiki Ubukata; Katsuhiro Honda

Conference Proceedings

FCM-type co-clustering transfer reinforcement learning for non-Markov processes

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2015) 9376 214-225

DOI: 10.1007/978-3-319-25135-6_21

0Citations

2Readers

Get full text

Abstract

In applying reinforcement learning to continuous space problems, discretization or redefinition of the learning space can be a promising approach. Several methods and algorithms have been introduced to learning agents to respond to this problem. In our previous study, we introduced an FCCM clustering technique into Q-learning (called QLFCCM) and its transfer learning in the Markov process. Since we could not respond to complicated environments like a non-Markov process, in this study, we propose a method in which an agent updates his Q-table by changing the trade-off ratio, Q-learning and QL-FCCM, based on the damping ratio. We conducted numerical experiments of the single pendulum standing problem and our model resulted in a smooth learning process.

Author supplied keywords

Cite

CITATION STYLE

APA

Notsu, A., Ueno, T., Hattori, Y., Ubukata, S., & Honda, K. (2015). FCM-type co-clustering transfer reinforcement learning for non-Markov processes. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 9376, pp. 214–225). Springer Verlag. https://doi.org/10.1007/978-3-319-25135-6_21

FCM-type co-clustering transfer reinforcement learning for non-Markov processes

Abstract

Author supplied keywords

Cite

Register to see more suggestions