Search Results for author: Yi Xin

Found 8 papers, 1 papers with code

Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model

no code implementations24 May 2024 Mingyang Yi, Aoxue Li, Yi Xin, Zhenguo Li

We conclude that in the earlier generation stage, the image is mostly decided by the special token [\texttt{EOS}] in the text prompt, and the information in the text prompt is already conveyed in this stage.

Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference

no code implementations23 May 2024 Ting Liu, Xuyang Liu, Liangtao Shi, Zunnan Xu, Siteng Huang, Yi Xin, Quanjun Yin

Sparse-Tuning efficiently fine-tunes the pre-trained ViT by sparsely preserving the informative tokens and merging redundant ones, enabling the ViT to focus on the foreground while reducing computational costs on background regions in the images.

Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey

1 code implementation3 Feb 2024 Yi Xin, Siqi Luo, Haodi Zhou, Junlong Du, Xiaohong Liu, Yue Fan, Qing Li, Yuntao Du

Large-scale pre-trained vision models (PVMs) have shown great potential for adaptability across various downstream vision tasks.

Transfer Learning

VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding

no code implementations14 Dec 2023 Yi Xin, Junlong Du, Qiang Wang, Zhiwen Lin, Ke Yan

Extensive experiments on four dense scene understanding tasks demonstrate the superiority of VMT-Adapter(-Lite), achieving a 3. 96%(1. 34%) relative improvement compared to single-task full fine-tuning, while utilizing merely ~1% (0. 36%) trainable parameters of the pre-trained model.

Scene Understanding Transfer Learning

MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning

no code implementations14 Dec 2023 Yi Xin, Junlong Du, Qiang Wang, Ke Yan, Shouhong Ding

On the one hand, to maximize the complementarity of tasks with high similarity, we utilize a gradient-driven task grouping method that partitions tasks into several disjoint groups and assign a group-shared MmAP to each group.

Decoder Language Modelling +2

Cannot find the paper you are looking for? You can Submit a new open access paper.