63.
Google DeepMind interpretability team rediscovered our year old work! SFT matters more for alignment than RLHF. (x.com)
Google DeepMind interpretability team rediscovered our year old work! SFT matters more for alignment than RLHF. https:// x.com/sivareddyg/sta tus/1985715581991936073 …