Search Results for author: Mengxi Zhang

Found 8 papers, 5 papers with code

Dense Connector for MLLMs

1 code implementation • 22 May 2024 • Huanjin Yao, Wenhao Wu, Taojiannan Yang, Yuxin Song, Mengxi Zhang, Haocheng Feng, Yifan Sun, Zhiheng Li, Wanli Ouyang, Jingdong Wang

We witness the rise of larger and higher-quality instruction datasets, as well as the involvement of larger-sized LLMs.

Video Understanding

Paper
Code

Automated Multi-level Preference for MLLMs

1 code implementation • 18 May 2024 • Mengxi Zhang, Wenhao Wu, Yu Lu, Yuxin Song, Kang Rong, Huanjin Yao, Jianbo Zhao, Fanglong Liu, Yifan Sun, Haocheng Feng, Jingdong Wang

To verify our viewpoint, we present the Automated Multi-level Preference (AMP) framework for MLLMs.

Hallucination

Paper
Code

HARIS: Human-Like Attention for Reference Image Segmentation

no code implementations • 17 May 2024 • Mengxi Zhang, Heqing Lian, Yiming Liu, Jie Chen

In this paper, we propose a referring image segmentation method called HARIS, which introduces the Human-Like Attention mechanism and uses the parameter-efficient fine-tuning (PEFT) framework.

Image Segmentation Segmentation +1

Paper
Add Code

DeeDSR: Towards Real-World Image Super-Resolution via Degradation-Aware Stable Diffusion

no code implementations • 31 Mar 2024 • Chunyang Bi, Xin Luo, Sheng Shen, Mengxi Zhang, Huanjing Yue, Jingyu Yang

In the second stage, we integrate a degradation-aware module into a simplified ControlNet, enabling flexible adaptation to various degradations based on the learned representations.

Contrastive Learning Denoising +1

Paper
Add Code

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

2 code implementations • 27 Nov 2023 • Wenhao Wu, Huanjin Yao, Mengxi Zhang, Yuxin Song, Wanli Ouyang, Jingdong Wang

Our study centers on the evaluation of GPT-4's linguistic and visual capabilities in zero-shot visual recognition tasks: Firstly, we explore the potential of its generated rich textual descriptions across various categories to enhance recognition performance without any training.

Zero-Shot Learning

871

Paper
Code

RISAM: Referring Image Segmentation via Mutual-Aware Attention Features

no code implementations • 27 Nov 2023 • Mengxi Zhang, Yiming Liu, Xiangjun Yin, Huanjing Yue, Jingyu Yang

Referring image segmentation (RIS) aims to segment a particular region based on a language expression prompt.

Decoder Image Segmentation +2

Paper
Add Code

RT-SRTS: Angle-Agnostic Real-Time Simultaneous 3D Reconstruction and Tumor Segmentation from Single X-Ray Projection

1 code implementation • 12 Oct 2023 • Miao Zhu, Qiming Fu, Bo Liu, Mengxi Zhang, Bojian Li, Xiaoyan Luo, Fugen Zhou

In this study, a novel imaging method RT-SRTS is proposed which integrates 3D imaging and tumor segmentation into one network based on multi-task learning (MTL) and achieves real-time simultaneous 3D reconstruction and tumor segmentation from a single X-ray projection at any angle.

3D Reconstruction Multi-Task Learning +2

Paper
Code

Flexible Alignment Super-Resolution Network for Multi-Contrast MRI

1 code implementation • 7 Oct 2022 • Yiming Liu, Mengxi Zhang, Weiqin Zhang, Bo Jiang, Bo Hou, Dan Liu, Jie Chen, Heqing Lian

To tackle this problem, we propose the Flexible Alignment Super-Resolution Network (FASR-Net) for multi-contrast MRI Super-Resolution.

Super-Resolution

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.