The truncated BPTT(h) algorithm may be viewed as an approximation to the epoch wise BPTT algorithm The approximation can be improved by incorporating aspects of epoch wise BPTT into the truncated BPTT(h) algorithm. Specifically, we may let the network go through h' additional steps before performing the next BPTT computation, where h'