no code implementations • 14 Feb 2023 • Siddhesh Khandelwal, Anirudth Nambirajan, Behjat Siddiquie, Jayan Eledath, Leonid Sigal
Methods for object detection and segmentation often require abundant instance-level annotations for training, which are time-consuming and expensive to collect.
no code implementations • CVPR 2022 • Jialian Wu, Sudhir Yarram, Hui Liang, Tian Lan, Junsong Yuan, Jayan Eledath, Gerard Medioni
In addition, VisTR is not fully end-to-end learnable in multiple video clips as it requires a hand-crafted data association to link instance tracklets between successive clips.
no code implementations • CVPR 2021 • N. Dinesh Reddy, Laurent Guigues, Leonid Pischulini, Jayan Eledath, Srinivasa Narasimhan
At the core of our approach is a novel spatio-temporal formulation that operates in a common voxelized feature space aggregated from single- or multiple camera views.
Ranked #1 on 3D Human Pose Estimation on Panoptic (using extra training data)
1 code implementation • CVPR 2021 • Mohammed Suhail, Abhay Mittal, Behjat Siddiquie, Chris Broaddus, Jayan Eledath, Gerard Medioni, Leonid Sigal
The proposed formulation allows for efficiently incorporating the structure of scene graphs in the output space.
Ranked #3 on Scene Graph Classification on Visual Genome (R@20 metric)
no code implementations • 18 Feb 2020 • Donghyun Kim, Tian Lan, Chuhang Zou, Ning Xu, Bryan A. Plummer, Stan Sclaroff, Jayan Eledath, Gerard Medioni
We embed the attention module in a ``slow-fast'' architecture, where the slower network runs on sparsely sampled keyframes and the light-weight shallow network runs on non-keyframes at a high frame rate.