no code implementations • 29 Nov 2023 • Sanghwan Kim, Daoji Huang, Yongqin Xian, Otmar Hilliges, Luc van Gool, Xi Wang
Understanding human activity is a crucial yet intricate task in egocentric vision, a field that focuses on capturing visual perspectives from the camera wearer's viewpoint.
1 code implementation • 28 Jun 2023 • Daoji Huang, Otmar Hilliges, Luc van Gool, Xi Wang
We present Palm, a solution to the Long-Term Action Anticipation (LTA) task utilizing vision-language and large language models.