1 code implementation • 25 Sep 2023 • Jyoti Kini, Sarah Fleischer, Ishan Dave, Mubarak Shah
Our work focuses on recognizing actions from egocentric RGB and Depth modalities in an industry-like environment.
1 code implementation • 10 Aug 2023 • Jyoti Kini, Sarah Fleischer, Ishan Dave, Mubarak Shah
In this work, we propose an ensemble modeling approach for multimodal action recognition.
no code implementations • 14 Oct 2021 • Ishan Dave, Naman Biyani, Brandon Clark, Rohit Gupta, Yogesh Rawat, Mubarak Shah
This technical report presents our approach "Knights" to solve the action recognition task on a small subset of Kinetics-400 i. e. Kinetics400ViPriors without using any extra-data.
1 code implementation • 20 Jan 2021 • Ishan Dave, Rohit Gupta, Mamshad Nayeem Rizve, Mubarak Shah
However, prior work on contrastive learning for video data has not explored the effect of explicitly encouraging the features to be distinct across the temporal dimension.
Ranked #9 on Self-supervised Video Retrieval on UCF101
no code implementations • 23 Apr 2020 • Mamshad Nayeem Rizve, Ugur Demir, Praveen Tirupattur, Aayush Jung Rana, Kevin Duarte, Ishan Dave, Yogesh Singh Rawat, Mubarak Shah
For tubelet extraction, we propose a localization network which takes a video clip as input and spatio-temporally detects potential foreground regions at multiple scales to generate action tubelets.