In a recent blog post, Ben Recht described the Reinforcement Learning (RL) setup as:
Paraphrasing Thorndike’s Law of Effect, Lior defines reinforcement learning as the iterative process:
- Receive external validation on how good you’re currently doing
- Adjust what you’re currently doing so that you are better the next time around.
Whether or not this is how humans or animals learn, this is a spot-on definition of computer scientific reinforcement learning.