Reinforcement learning agents are unable to respond effectively when faced with novel, out-of-distribution events until they have undergone a significant period of additional training. For lifelong learning agents, which cannot be simply taken offline during this period, sub-optimal actions may be taken that can result in unacceptable outcomes. This paper presents the Autonomous Emergency Management System (A-EMS) - an online, data-driven, emergency-response method that aims to provide autonomous agents the ability to react to unexpected situations that are very different from those it has been trained or designed to address. The proposed approach devises a customized response to the unforeseen situation sequentially, by selecting actions that minimize the rate of increase of the reconstruction error from a variational auto-encoder. This optimization is achieved online in a data-efficient manner (on the order of 30 to 80 data-points) using a modified Bayesian optimization procedure. The potential of A-EMS is demonstrated through emergency situations devised in a simulated 3D car-driving application.
CITATION STYLE
Maguire, G., Ketz, N., Pilly, P. K., & Mouret, J. B. (2022). A-EMS: An Adaptive Emergency Management System for Autonomous Agents in Unforeseen Situations. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13546 LNAI, pp. 266–281). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-15908-4_21
Mendeley helps you to discover research relevant for your work.