Friday, May 13, 2022

Lecture 19 (9/5/2022, 3 hours): the Transformer architecture

The Transformer architecture. Rationale. Self- and cross-attention; keys, queries, values. Encoder and decoder blocks.

No comments:

Post a Comment