AS-V2 (The All-Seeing Dataset v2)

Introduced by Wang et al. in The All-Seeing Project V2: Towards General Relation Comprehension of the Open World

We propose a novel task, termed Relation Conversation (ReC), which unifies the formulation of text generation, object localization, and relation comprehension. Based on the unified formulation, we construct the AS-V2 dataset, which consists of 127K high-quality relation conversation samples, to unlock the ReC capability for Multi-modal Large Language Models (MLLMs).

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • Apache 2.0 license

Modalities


Languages