Q Learning Algorithm Equation

Inverse Q-Learning Optimal Control for Takagi–Sugeno Fuzzy Systems

Abstract: Inverse reinforcement learning optimal control is under the framework of learner–expert, the learner system can learn expert system's trajectory and optimal control policy via a ...

Communications of the ACM

Shields for Safe Reinforcement Learning

Evaluating the advantages and potential drawbacks of shielding as a method for safe RL. Bettina Könighofer is an assistant ...

IEEE

Double Successive Over-Relaxation Q-Learning With an Extension to Deep Reinforcement Learning

Abstract: Q-learning (QL) is a widely used algorithm in reinforcement learning (RL), but its convergence can be slow, especially when the discount factor is close to one. Successive over-relaxation ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Inverse Q-Learning Optimal Control for Takagi–Sugeno Fuzzy Systems

Shields for Safe Reinforcement Learning

Double Successive Over-Relaxation Q-Learning With an Extension to Deep Reinforcement Learning

Trending now