@xwang_lk on Backlist

3 appearances on the backlist front page in the last 30 days.

53.

We discover the ๐€๐ฌ๐ฒ๐ฆ๐ฆ๐ž๐ญ๐ซ๐ข๐œ ๐‘๐จ๐ฅ๐ž๐ฌ ๐จ๐Ÿ ๐ƒ๐š๐ญ๐š ๐†๐š๐ญ๐ข๐ง๐  ๐š๐ง๐ ๐‘๐ž๐ฐ๐š๐ซ๐ ๐†๐ซ๐จ๐ฎ๐ง๐๐ข๐ง๐  ๐ข๐ง ๐’๐ž๐ฅ๐Ÿ-๐๐ฅ๐š๐ฒ ๐‘๐‹: data gating, not reward grounding, is the binding constraint on stability. A strict gate stabiliz

by (Xin Eric Wang (hiring postdoc)) ยท backlist 2026-05-22 ยท rubric 90.0
49.

Your agent finished the task. Did it also read files it shouldn't have, call tools outside policy, or leak data across components? If you only score final outputs, you can't tell. ๐‡๐š๐ซ๐ง๐ž๐ฌ๐ฌ๐€๐ฎ๐๐ข๐ญ evaluates the three safety layers

by (Xin Eric Wang (hiring postdoc)) ยท backlist 2026-05-19 ยท rubric 92.0