56.
Two scenarios can look almost identical, yet one is safe and one isn't.
Two scenarios can look almost identical, yet one is safe and one isn't. Can MLLMs tell the difference? Mostly, not well. Our #CVPR2026 paper tackles "contextual safety": not just refusing obviously unsafe inputs, but reading the subtle co