@xdotli on Backlist

1 appearance on the backlist front page in the last 30 days.

44.

there are levels to building evals lvl 1: using a spreadsheet qa pairs lvl 2: using public agent evals lvl 3: manually label private evals lvl 4: traces to evals and skills lvl 5: turn every prompt & traces into self healing loops almos

by (Xiangyi Li) · backlist 2026-06-23 · rubric 100.0