47.
1/ Today we're releasing AttuneBench, the first open EQ benchmark grounded in real multi-turn human-model convers… (x.com)
1/ Today we're releasing AttuneBench, the first open EQ benchmark grounded in real multi-turn human-model conversations, scored against what the person actually felt and wanted at each turn. Built by the research team at @pareto_ai in co