promptdojo_

Attention and Transformer blocks — step 3 of 7

Read the editor. It turns relevance scores into attention weights by dividing each by the total. What does it print?