Transformer

Transformer

Reference:

Transformers Explained Visually (Part 1): Overview of Functionality
Transformers Explained Visually (Part 2): How it works, step-by-step
Transformers Explained Visually (Part 3): Multi-head Attention, deep dive
Transformers Explained Visually — Not Just How, but Why They Work So Well
Foundations of NLP Explained — Bleu Score and WER Metrics
Foundations of NLP Explained Visually: Beam Search, How It Works

  1. For basic model
Figure 1 Transformer's sample vectors after embedding processing, referring to Transformer part 2