Rendered synthetically using a library of standard 3D objects, and tests the ability to recognize compositions of object movements that require long-term reasoning.
Source: CATER: A diagnostic dataset for Compositional Actions and TEmporal ReasoningPaper | Code | Results | Date | Stars |
---|