Search Results for author: Michael Kidd

Found 2 papers, 2 papers with code

AISFormer: Amodal Instance Segmentation with Transformer

1 code implementation • 12 Oct 2022 • Minh Tran, Khoa Vo, Kashu Yamazaki, Arthur Fernandes, Michael Kidd, Ngan Le

AISFormer explicitly models the complex coherence between occluder, visible, amodal, and invisible masks within an object's regions of interest by treating them as learnable queries.

Amodal Instance Segmentation Decoder +2

Paper
Code

VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

1 code implementation • 26 Jun 2022 • Kashu Yamazaki, Sang Truong, Khoa Vo, Michael Kidd, Chase Rainwater, Khoa Luu, Ngan Le

In this paper, we leverage the human perceiving process, that involves vision and language interaction, to generate a coherent paragraph description of untrimmed videos.

Ranked #3 on Video Captioning on ActivityNet Captions

Contrastive Learning Video Captioning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.