An extension of the encoder-decoder model, where the decoder attends to different parts of the input sequence at each step. This helps the model learn to focus on the relevant parts of the input when generating the output.
An extension of the encoder-decoder model, where the decoder attends to different parts of the input sequence at each step. This helps the model learn to focus on the relevant parts of the input when generating the output.