75.
A new lens on attention that I've been thinking:
A new lens on attention that I've been thinking: each key in attention defines a hyperplane in query space. The score qᵀk isn't just similarity — it's a signed incidence. Which side of the key-plane the query sits on, and how far.