1 code implementation • CVPR 2022 • Hyunmin Lee, Jaesik Park
The dataset consists of 2. 9M annotations of geometric orderings for class-labeled instances in 101K natural scenes.
no code implementations • NAACL 2019 • Chris Dongjoo Kim, Byeongchang Kim, Hyunmin Lee, Gunhee Kim
We explore the problem of Audio Captioning: generating natural language description for any kind of audio in the wild, which has been surprisingly unexplored in previous research.
Ranked #9 on Audio captioning on AudioCaps