Chapter 07.05: Attention and Transformers

In this subchapter, we introduce more recent sequence data modelling techniques such as attention and transformers.

Lecture slides