UICaption is a dataset of 114k UI images paired with descriptions of their functionality. It is designed for the tasks of UI action entailment, instruction-based UI image retrieval, grounding referring expressions, and UI entity recognition.
Source: Lexi: Self-Supervised Learning of the UI LanguagePaper | Code | Results | Date | Stars |
---|