Search Results for author: Wushao Wen

Found 10 papers, 7 papers with code

Mirror Gradient: Towards Robust Multimodal Recommender Systems via Exploring Flat Local Minima

1 code implementation • 17 Feb 2024 • Shanshan Zhong, Zhongzhan Huang, Daifeng Li, Wushao Wen, Jinghui Qin, Liang Lin

This strategy can implicitly enhance the model's robustness during the optimization process, mitigating instability risks arising from multimodal information inputs.

Multimodal Recommendation

Paper
Code

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation

1 code implementation • 5 Dec 2023 • Shanshan Zhong, Zhongzhan Huang, ShangHua Gao, Wushao Wen, Liang Lin, Marinka Zitnik, Pan Zhou

To this end, we study LLMs on the popular Oogiri game which needs participants to have good creativity and strong associative thinking for responding unexpectedly and humorously to the given image, text, or both, and thus is suitable for LoT study.

Logical Reasoning

245

Paper
Code

LSAS: Lightweight Sub-attention Strategy for Alleviating Attention Bias Problem

1 code implementation • 9 May 2023 • Shanshan Zhong, Wushao Wen, Jinghui Qin, Qiangpu Chen, Zhongzhan Huang

In computer vision, the performance of deep neural networks (DNNs) is highly related to the feature extraction ability, i. e., the ability to recognize and focus on key pixel regions in an image.

Paper
Code

SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models

1 code implementation • 9 May 2023 • Shanshan Zhong, Zhongzhan Huang, Wushao Wen, Jinghui Qin, Liang Lin

Our approach can make text-to-image diffusion models easier to use with better user experience, which demonstrates our approach has the potential for further advancing the development of user-friendly text-to-image generation models by bridging the semantic gap between simple narrative prompts and complex keyword-based prompts.

Knowledge Distillation Text-to-Image Generation

106

Paper
Code

ASR: Attention-alike Structural Re-parameterization

no code implementations • 13 Apr 2023 • Shanshan Zhong, Zhongzhan Huang, Wushao Wen, Jinghui Qin, Liang Lin

This technique enables the mitigation of the extra costs for performance improvement during training, such as parameter size and inference time, through these transformations during inference, and therefore SRP has great potential for industrial and practical applications.

Paper
Add Code

Deepening Neural Networks Implicitly and Locally via Recurrent Attention Strategy

no code implementations • 27 Oct 2022 • Shanshan Zhong, Wushao Wen, Jinghui Qin, Zhongzhan Huang

More and more empirical and theoretical evidence shows that deepening neural networks can effectively improve their performance under suitable training settings.

Paper
Add Code

Switchable Self-attention Module

1 code implementation • 13 Sep 2022 • Shanshan Zhong, Wushao Wen, Jinghui Qin

Attention mechanism has gained great success in vision recognition.

Paper
Code

Mix-Pooling Strategy for Attention Mechanism

1 code implementation • 22 Aug 2022 • Shanshan Zhong, Wushao Wen, Jinghui Qin

Recently many effective attention modules are proposed to boot the model performance by exploiting the internal information of convolutional neural networks in computer vision.

Paper
Code

Difficulty-aware Image Super Resolution via Deep Adaptive Dual-Network

1 code implementation • 11 Apr 2019 • Jinghui Qin, Ziwei Xie, Yukai Shi, Wushao Wen

To identify whether a region is easy or hard, we propose a novel image difficulty recognition network based on PSNR prior.

Image Super-Resolution

Paper
Code

PIRM Challenge on Perceptual Image Enhancement on Smartphones: Report

no code implementations • 3 Oct 2018 • Andrey Ignatov, Radu Timofte, Thang Van Vu, Tung Minh Luu, Trung X. Pham, Cao Van Nguyen, Yongwoo Kim, Jae-Seok Choi, Munchurl Kim, Jie Huang, Jiewen Ran, Chen Xing, Xingguang Zhou, Pengfei Zhu, Mingrui Geng, Yawei Li, Eirikur Agustsson, Shuhang Gu, Luc van Gool, Etienne de Stoutz, Nikolay Kobyshev, Kehui Nie, Yan Zhao, Gen Li, Tong Tong, Qinquan Gao, Liu Hanwen, Pablo Navarrete Michelini, Zhu Dan, Hu Fengshuo, Zheng Hui, Xiumei Wang, Lirui Deng, Rang Meng, Jinghui Qin, Yukai Shi, Wushao Wen, Liang Lin, Ruicheng Feng, Shixiang Wu, Chao Dong, Yu Qiao, Subeesh Vasu, Nimisha Thekke Madam, Praveen Kandula, A. N. Rajagopalan, Jie Liu, Cheolkon Jung

This paper reviews the first challenge on efficient perceptual image enhancement with the focus on deploying deep learning models on smartphones.

Image Enhancement Image Super-Resolution

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.