Reinforcement Learning: On Policy and Off Policy
An intuitive explanation of the terms used for On Policy and Off Policy, along with their differences
The explanation used in this article is to just simplify the concepts for understanding purpose.
You just moved to a new locality and have tried a few restaurants in your area. Today you…