17.
Most chain-of-thought faithfulness detectors perform near chance
Eight proposed methods for detecting unfaithful chains of thought mostly failed when tested against ground-truth faithfulness labels
1 appearance on the backlist front page in the last 30 days.
Eight proposed methods for detecting unfaithful chains of thought mostly failed when tested against ground-truth faithfulness labels