Reinforcement Learning: Expected SARSA
A short introduction on Expected SARSA and comparing SARSA, Expected SARSA, and Q-Learning.
Good to Know:
Reinforcement Learning: Temporal Difference Learning
Reinforcement Learning: SARSA and Q-Learning
Reinforcement Learning aims for an agent to find an optimal control policy for a sequential decision problem in an environment that maximizes its long-term reward by continually interacting with its environment. Interaction of…