56.
making a database index and querying it N times doesn't take N^2 complexity: it takes N + NlogN, which is what sc…
making a database index and querying it N times doesn't take N^2 complexity: it takes N + NlogN, which is what scaled dot product attention should take for a billion token context window that it's a weighted sum, rather than a lookup, is a