Search Results for author: Zixin Guo

Found 4 papers, 1 papers with code

Impact of Design Decisions in Scanpath Modeling

no code implementations14 May 2024 Parvin Emami, Yue Jiang, Zixin Guo, Luis A. Leiva

We show that even small variations of these design parameters have a noticeable impact on standard evaluation metrics such as DTW or Eyenalysis.

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

no code implementations15 Apr 2024 Yue Jiang, Zixin Guo, Hamed Rezazadegan Tavakoli, Luis A. Leiva, Antti Oulasvirta

From a visual perception perspective, modern graphical user interfaces (GUIs) comprise a complex graphics-rich two-dimensional visuospatial arrangement of text, images, and interactive objects such as buttons and menus.

reinforcement-learning

PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting

no code implementations14 Jul 2023 Zixin Guo, Tzu-Jui Julius Wang, Selen Pehlivan, Abduljalil Radman, Jorma Laaksonen

To further reduce the amount of supervision, we propose Prompts-in-The-Loop (PiTL) that prompts knowledge from large language models (LLMs) to describe images.

Cross-Modal Retrieval Object +1

CLIP4IDC: CLIP for Image Difference Captioning

1 code implementation1 Jun 2022 Zixin Guo, Tzu-Jui Julius Wang, Jorma Laaksonen

Different from directly fine-tuning CLIP to generate sentences, we introduce an adaptation training process to adapt CLIP's visual encoder to capture and align differences in image pairs based on the textual descriptions.

Domain Adaptation Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.