zero-shot long video question answering

2 papers with code • 0 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in zero-shot long video question answering

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Datasets

CinePile: A Long Video Question Answering Dataset and Benchmark

Most implemented papers

Most implemented Social Latest No code

MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

rese1f/MovieChat • • 31 Jul 2023

Recently, integrating video foundation models and large language models to build a video understanding system can overcome the limitations of specific pre-defined vision tasks.

Paper
Code

Understanding Long Videos in One Multimodal Language Model Pass

kahnchana/mvu • • 25 Mar 2024

In addition to faster inference, we discover the resulting models to yield surprisingly good accuracy on long-video tasks, even with no video specific information.