Temporal-difference learning
(a) Construct a signal-flow graph representation of the TD(0) algorithm described in Eqs. (12.34) and (12.35)
(b) The TD(0) algorithm has a mathematical composition similar to that of the LMS algorithm described in Chapter 3. Discuss the similarities and differences between these two algorithms.