Temporal-difference learning
(a) Construct a signal-flow graph representation of the TD(0) algorithm described in Eqs. (12.34) and (12.35)
![852_9c83cac7-439f-4e29-b073-cdd13801e46e.png](https://secure.tutorsglobe.com/CMSImages/852_9c83cac7-439f-4e29-b073-cdd13801e46e.png)
![978_9efc5d7c-35ed-4e65-8b9c-22cf24010ea7.png](https://secure.tutorsglobe.com/CMSImages/978_9efc5d7c-35ed-4e65-8b9c-22cf24010ea7.png)
(b) The TD(0) algorithm has a mathematical composition similar to that of the LMS algorithm described in Chapter 3. Discuss the similarities and differences between these two algorithms.