Search Results for author: Enming Zhang

Found 6 papers, 5 papers with code

Optimization of Prompt Learning via Multi-Knowledge Representation for Vision-Language Models

1 code implementation • 16 Apr 2024 • Enming Zhang, Bingke Zhu, Yingying Chen, Qinghai Miao, Ming Tang, Jinqiao Wang

This limitation restricts the capabilities of pretrained VLMs and can result in incorrect predictions in downstream tasks.

Paper
Code

PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model

1 code implementation • 21 Mar 2024 • Zheng Zhang, Yeyao Ma, Enming Zhang, Xiang Bai

PSALM is a powerful extension of the Large Multi-modal Model (LMM) to address the segmentation task challenges.

Decoder Generalized Referring Expression Segmentation +6

124

Paper
Code

SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector

1 code implementation • 14 Dec 2023 • Shuailei Ma, Yuefeng Wang, Ying WEI, Jiaqi Fan, Enming Zhang, Xinyu Sun, Peihao Chen

Ablation experiments demonstrate that both of them are effective in mitigating the impact of open-world knowledge distillation on the learning of known objects.

Knowledge Distillation Object +3

Paper
Code

Looking and Listening: Audio Guided Text Recognition

1 code implementation • 6 Jun 2023 • Wenwen Yu, MingYu Liu, Biao Yang, Enming Zhang, Deqiang Jiang, Xing Sun, Yuliang Liu, Xiang Bai

Text recognition in the wild is a long-standing problem in computer vision.

Decoder Scene Text Recognition

Paper
Code

Detecting the open-world objects with the help of the Brain

1 code implementation • 21 Mar 2023 • Shuailei Ma, Yuefeng Wang, Ying WEI, Peihao Chen, Zhixiang Ye, Jiaqi Fan, Enming Zhang, Thomas H. Li

We propose leveraging the VL as the ``Brain'' of the open-world detector by simply generating unknown labels.

Object object-detection +1

Paper
Code

Transferability-Guided Cross-Domain Cross-Task Transfer Learning

no code implementations • 12 Jul 2022 • Yang Tan, Enming Zhang, Yang Li, Shao-Lun Huang, Xiao-Ping Zhang

We propose two novel transferability metrics F-OTCE (Fast Optimal Transport based Conditional Entropy) and JC-OTCE (Joint Correspondence OTCE) to evaluate how much the source model (task) can benefit the learning of the target task and to learn more transferable representations for cross-domain cross-task transfer learning.

Transfer Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.