Temporal-Difference Learning: Combining Dynamic Programming and Monte Carlo Methods for Reinforcement Learning | Towards Data Science

Milestones of RL: Q-Learning and Double Q-Learning

By · · 1 min read
Temporal-Difference Learning: Combining Dynamic Programming and Monte Carlo Methods for Reinforcement Learning | Towards Data Science

Source: Towards Data Science

Milestones of RL: Q-Learning and Double Q-Learning