77.
How well do MLLMs and agentic video frameworks handle questions (e.g., tracking objects or abstracting recurring …
How well do MLLMs and agentic video frameworks handle questions (e.g., tracking objects or abstracting recurring behavior patterns) over long-horizon videos, which often require memory to retrieve and aggregate information across time? To