Temporal-Difference Learning

Temporal-Difference Learning in Reinforcement Learning, like what you love, TD(0), TD(1), TD(lambda).

  • TD(0)
    $$ V(s) = V(s-1) + \Gamma $$
  • TD(1)
    $$ V(s) = V(s-1) + \Gamma $$
  • TD(2)
    $$ V(s) = V(s-1) + \Gamma $$