89.
Can reasoning models become overly reliant on chain-of-thought examples? (t.co)
Can reasoning models become overly reliant on chain-of-thought examples? Our #ACL2026 work shows excessive CoT supervision is not always beneficial, and gives a recipe for tuning the CoT fraction to improve novel-task accuracy. Website: