Backpropagation through time (BPTT) is a gradient-based technique for training certain types of recurrent neural networks. It can be used to train Elman networks. The algorithm was independently derived by numerous researchers
The training data for BPTT should be an ordered sequence of input-output pairs, . An initial value must be specified for . Typically, a vector of all zeros is used for this purpose.