Reinforcement Learning Temporal-Difference Learning Temporal-Difference Learning in Reinforcement Learning, like what you love, TD(0), TD(1), TD(lambda).
Reinforcement Learning Basic model of Reinforcement Learning Basic model for Reinforcement learning, about value function V(s), Q(s, a) function and continuation value function C(s, a)