We have proposed multi-agent reinforcement learning with repeated consultation (MARLRC) as a multi-agent reinforcement learning that agents can select the concerted action. In MARLRC, agents select a virtual action and share it with other agents several times in one robot decision-making. In this study, we focused on the problem that MARLRC does not take into account the environment that time constraints may occur in the decision-making. As an approach to solve this problem, we considered to introduce to determine the amount of time that can be used in decision-making and decision-making in predetermined time. We introduced the method to decision-making in time predetermined by MARLRC in this study.
CITATION STYLE
Takada, Y., & Kurashige, K. (2016). Action learning to single robot using MARL with repeated consultation: Realization of repeated consultation interruption for the adaptation to environmental change. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9835 LNCS, pp. 371–382). Springer Verlag. https://doi.org/10.1007/978-3-319-43518-3_36
Mendeley helps you to discover research relevant for your work.