Abstract: Inverse reinforcement learning optimal control is under the framework of learner–expert, the learner system can learn expert system's trajectory and optimal control policy via a ...
Evaluating the advantages and potential drawbacks of shielding as a method for safe RL. Bettina Könighofer is an assistant ...
Abstract: Q-learning (QL) is a widely used algorithm in reinforcement learning (RL), but its convergence can be slow, especially when the discount factor is close to one. Successive over-relaxation ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results