63.
New Anthropic Fellows research: Classifier Context Rot
New Anthropic Fellows research: Classifier Context Rot Anthropic monitors for dangerous actions in agent transcripts that are getting very long. Can monitors handle such long transcripts?