Q (T×d_k)
·
Kᵀ (d_k×T)
/ √d_k
Scores
Softmax
Attn Weights
·
V (T×d_v)
=
Output