Search Results for author: Manjin Kim

Found 6 papers, 4 papers with code

Learning Correlation Structures for Vision Transformers

no code implementations • 5 Apr 2024 • Manjin Kim, Paul Hongsuck Seo, Cordelia Schmid, Minsu Cho

We introduce a new attention mechanism, dubbed structural self-attention (StructSA), that leverages rich correlation patterns naturally emerging in key-query interactions of attention.

Ranked #4 on Action Recognition on Diving-48

Action Classification Action Recognition +2

Paper
Add Code

Future Transformer for Long-term Action Anticipation

no code implementations • CVPR 2022 • Dayoung Gong, Joonseok Lee, Manjin Kim, Seong Jong Ha, Minsu Cho

The task of predicting future actions from a video is crucial for a real-world agent interacting with others.

Action Anticipation Long Term Action Anticipation +1

Paper
Add Code

Relational Self-Attention: What's Missing in Attention for Video Understanding

1 code implementation • NeurIPS 2021 • Manjin Kim, Heeseung Kwon, Chunyu Wang, Suha Kwak, Minsu Cho

Convolution has been arguably the most important feature transform for modern neural networks, leading to the advance of deep learning.

Ranked #11 on Action Recognition on Diving-48

Action Recognition Temporal Action Localization +1

Paper
Code

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

1 code implementation • ICCV 2021 • Heeseung Kwon, Manjin Kim, Suha Kwak, Minsu Cho

With a sufficient volume of the neighborhood in space and time, it effectively captures long-term interaction and fast motion in the video, leading to robust action recognition.

Ranked #18 on Action Recognition on Something-Something V1 (using extra training data)

Action Recognition Temporal Action Localization +1

Paper
Code

Learning Self-Similarity in Space and Time as a Generalized Motion for Action Recognition

1 code implementation • 1 Jan 2021 • Heeseung Kwon, Manjin Kim, Suha Kwak, Minsu Cho

We leverage the whole volume of STSS and let our model learn to extract an effective motion representation from it.

Action Recognition Video Understanding

Paper
Code

MotionSqueeze: Neural Motion Feature Learning for Video Understanding

2 code implementations • ECCV 2020 • Heeseung Kwon, Manjin Kim, Suha Kwak, Minsu Cho

As the frame-by-frame optical flows require heavy computation, incorporating motion information has remained a major computational bottleneck for video understanding.

Ranked #1 on Video Classification on Something-Something V2

Action Classification Action Recognition +2

132

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.