Reducing the Learning Time of Reinforcement Learning for the Supervisory Control of Discrete Event Systems

4Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Reinforcement learning (RL) can obtain the supervisory controller for discrete-event systems modeled by finite automata and temporal logic. The published methods often have two limitations. First, a large number of training data are required to learn the RL controller. Second, the RL algorithms do not consider uncontrollable events, which are essential for supervisory control theory (SCT). To address the limitations, we first apply SCT to find the supervisors for the specifications modeled by automata. These supervisors remove illegal training data violating these specifications and hence reduce the exploration space of the RL algorithm. For the remaining specifications modeled by temporal logic, the RL algorithm is applied to search for the optimal control decision within the confined exploration space. Uncontrollable events are considered by the RL algorithm as uncertainties in the plant model. The proposed method can obtain a nonblocking supervisor for all specifications with less learning time than the published methods.

References Powered by Scopus

Introduction to discrete event systems

2584Citations
N/AReaders
Get full text

The Control of Discrete Event Systems

2228Citations
N/AReaders
Get full text

Reinforcement learning and feedback control: Using natural decision methods to design optimal adaptive controllers

920Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A Multifaceted Approach to Stock Market Trading Using Reinforcement Learning

2Citations
N/AReaders
Get full text

Integrating reinforcement learning and supervisory control theory for optimal directed control of discrete-event systems

1Citations
N/AReaders
Get full text

On Efficient Sampling in Supervisory Reinforcement Learning

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Yang, J., Tan, K., Feng, L., El-Sherbeeny, A. M., & Li, Z. (2023). Reducing the Learning Time of Reinforcement Learning for the Supervisory Control of Discrete Event Systems. IEEE Access, 11, 59840–59853. https://doi.org/10.1109/ACCESS.2023.3285432

Readers' Seniority

Tooltip

Researcher 2

100%

Readers' Discipline

Tooltip

Engineering 2

67%

Computer Science 1

33%

Article Metrics

Tooltip
Mentions
News Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free