Search Results for author: Chengyue Wu

Found 9 papers, 3 papers with code

Adapting LLaMA Decoder to Vision Transformer

no code implementations • 10 Apr 2024 • Jiahao Wang, Wenqi Shao, Mengzhao Chen, Chengyue Wu, Yong liu, Kaipeng Zhang, Songyang Zhang, Kai Chen, Ping Luo

We first "LLaMAfy" a standard ViT step-by-step to align with LLaMA's architecture, and find that directly applying a casual mask to the self-attention brings an attention collapse issue, resulting in the failure to the network training.

Computational Efficiency Decoder +2

Paper
Add Code

FiT: Flexible Vision Transformer for Diffusion Model

2 code implementations • 19 Feb 2024 • Zeyu Lu, Zidong Wang, Di Huang, Chengyue Wu, Xihui Liu, Wanli Ouyang, Lei Bai

Nature is infinitely resolution-free.

Image Cropping

325

Paper
Code

LLaMA Pro: Progressive LLaMA with Block Expansion

1 code implementation • 4 Jan 2024 • Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ping Luo, Ying Shan

Humans generally acquire new skills without compromising the old; however, the opposite holds for Large Language Models (LLMs), e. g., from LLaMA to CodeLLaMA.

Instruction Following Math

402

Paper
Code

$π$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation

1 code implementation • 27 Apr 2023 • Chengyue Wu, Teng Wang, Yixiao Ge, Zeyu Lu, Ruisong Zhou, Ying Shan, Ping Luo

Foundation models have achieved great advances in multi-task learning with a unified interface of unimodal and multimodal tasks.

Multi-Task Learning

Paper
Code

Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation

no code implementations • 24 Apr 2023 • Zeyu Lu, Chengyue Wu, Xinyuan Chen, Yaohui Wang, Lei Bai, Yu Qiao, Xihui Liu

To mitigate those limitations, we propose Hierarchical Diffusion Autoencoders (HDAE) that exploit the fine-grained-to-abstract and lowlevel-to-high-level feature hierarchy for the latent space of diffusion models.

Image Generation Image Manipulation +1

Paper
Add Code

Generative Data Augmentation for Non-IID Problem in Decentralized Clinical Machine Learning

no code implementations • 2 Dec 2022 • ZiRui Wang, Shaoming Duan, Chengyue Wu, Wenhao Lin, Xinyu Zha, Peiyi Han, Chuanyi Liu

To address this problem, we propose a generative augmentation framework in swarm learning called SL-GAN, which augments the non-IID data by generating the synthetic data from participants.

Data Augmentation Edge-computing +1

Paper
Add Code

An untrained deep learning method for reconstructing dynamic magnetic resonance images from accelerated model-based data

no code implementations • 3 May 2022 • Kalina P. Slavkova, Julie C. DiCarlo, Viraj Wadhwa, Chengyue Wu, John Virostko, Sidharth Kumar, Thomas E. Yankeelov, Jonathan I. Tamir

We conclude that the use of an untrained neural network together with a physics-based regularization loss shows promise as a measure for determining the optimal stopping point in training without relying on fully-sampled ground truth data.

SSIM

Paper
Add Code

AutoTS: Automatic Time Series Forecasting Model Design Based on Two-Stage Pruning

no code implementations • 26 Mar 2022 • Chunnan Wang, Xingyu Chen, Chengyue Wu, Hongzhi Wang

We allow the effective combination of design experience from different sources, so as to create an effective search space containing a variety of TSF models to support different TSF tasks.

Neural Architecture Search Time Series +1

Paper
Add Code

A 1D-0D-3D coupled model for simulating blood flow and transport processes in breast tissue

no code implementations • 14 Jan 2022 • Marvin Fritz, Tobias Köppl, J. Tinsley Oden, Andreas Wagner, Barbara Wohlmuth, Chengyue Wu

In this work, we present mixed dimensional models for simulating blood flow and transport processes in breast tissue and the vascular tree supplying it.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.