Search Results for author: Zexi Li

Found 17 papers, 5 papers with code

WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models

1 code implementation • 23 May 2024 • Peng Wang, Zexi Li, Ningyu Zhang, Ziwen Xu, Yunzhi Yao, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen

In WISE, we design a dual parametric memory scheme, which consists of the main memory for the pretrained knowledge and a side memory for the edited knowledge.

Hallucination Model Editing +2

1,488

Paper
Code

Text-to-Model: Text-Conditioned Neural Network Diffusion for Train-Once-for-All Personalization

no code implementations • 23 May 2024 • Zexi Li, Lingzhi Gao, Chao Wu

Generative artificial intelligence (GenAI) has made significant progress in understanding world knowledge and generating content from human languages across various modalities, like text-to-text large language models, text-to-image stable diffusion, and text-to-video Sora.

Out-of-Distribution Generalization World Knowledge

Paper
Add Code

Retrieving and Refining: A Hybrid Framework with Large Language Models for Rare Disease Identification

no code implementations • 16 May 2024 • Jinge Wu, Hang Dong, Zexi Li, Arijit Patra, Honghan Wu

The infrequency and heterogeneity of clinical presentations in rare diseases often lead to underdiagnosis and their exclusion from structured datasets.

Retrieval

Paper
Add Code

Improving Group Connectivity for Generalization of Federated Deep Learning

no code implementations • 29 Feb 2024 • Zexi Li, Jie Lin, Zhiqi Li, Didi Zhu, Chao Wu

Bridging the gap between LMC and FL, in this paper, we leverage fixed anchor models to empirically and theoretically study the transitivity property of connectivity from two models (LMC) to a group of models (model fusion in FL).

Federated Learning Linear Mode Connectivity

Paper
Add Code

Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models

no code implementations • 19 Feb 2024 • Didi Zhu, Zhongyi Sun, Zexi Li, Tao Shen, Ke Yan, Shouhong Ding, Kun Kuang, Chao Wu

Catastrophic forgetting emerges as a critical challenge when fine-tuning multi-modal large language models (MLLMs), where improving performance on unseen tasks often leads to a significant performance drop on the original tasks.

Image Captioning Question Answering +1

Paper
Add Code

OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning

1 code implementation • 10 Feb 2024 • Rui Ye, Wenhao Wang, Jingyi Chai, Dihan Li, Zexi Li, Yinda Xu, Yaxin Du, Yanfeng Wang, Siheng Chen

Trained on massive publicly available data, large language models (LLMs) have demonstrated tremendous success across various fields.

Federated Learning Instruction Following +1

243

Paper
Code

Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusion

no code implementations • 2 Feb 2024 • Zexi Li, Zhiqi Li, Jie Lin, Tao Shen, Tao Lin, Chao Wu

In deep learning, stochastic gradient descent often yields functionally similar yet widely scattered solutions in the weight space even under the same initialization, causing barriers in the Linear Mode Connectivity (LMC) landscape.

Federated Learning Linear Mode Connectivity

Paper
Add Code

Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers

1 code implementation • 19 Dec 2023 • Ruiyuan Zhang, Jiaxiang Liu, Zexi Li, Hao Dong, Jie Fu, Chao Wu

Therefore, there is a need to develop a scalable framework for geometric fracture assembly without relying on semantic information.

3D Assembly

Paper
Code

FediOS: Decoupling Orthogonal Subspaces for Personalization in Feature-skew Federated Learning

no code implementations • 30 Nov 2023 • Lingzhi Gao, Zexi Li, Yang Lu, Chao Wu

A typical way of pFL focuses on label distribution skew, and they adopt a decoupling scheme where the model is split into a common feature extractor and two prediction heads (generic and personalized).

Personalized Federated Learning

Paper
Add Code

Understanding Prompt Tuning for V-L Models Through the Lens of Neural Collapse

no code implementations • 28 Jun 2023 • Didi Zhu, Zexi Li, Min Zhang, Junkun Yuan, Yunfeng Shao, Jiashuo Liu, Kun Kuang, Yinchuan Li, Chao Wu

It is found that NC optimality of text-to-image representations shows a positive correlation with downstream generalizability, which is more severe under class imbalance settings.

Paper
Add Code

Universal Domain Adaptation via Compressive Attention Matching

no code implementations • ICCV 2023 • Didi Zhu, Yincuan Li, Junkun Yuan, Zexi Li, Kun Kuang, Chao Wu

To address this issue, we propose a Universal Attention Matching (UniAM) framework by exploiting the self-attention mechanism in vision transformer to capture the crucial object information.

Ranked #1 on Universal Domain Adaptation on Office-Home

Universal Domain Adaptation

Paper
Add Code

Edge-cloud Collaborative Learning with Federated and Centralized Features

no code implementations • 12 Apr 2023 • Zexi Li, Qunwei Li, Yi Zhou, Wenliang Zhong, Guannan Zhang, Chao Wu

Federated learning (FL) is a popular way of edge computing that doesn't compromise users' privacy.

Edge-computing Federated Learning +2

Paper
Add Code

Learning Cautiously in Federated Learning with Noisy and Heterogeneous Clients

no code implementations • 6 Apr 2023 • Chenrui Wu, Zexi Li, Fangxin Wang, Chao Wu

It includes a noise-resilient local solver and a robust global aggregator.

Federated Learning

Paper
Add Code

No Fear of Classifier Biases: Neural Collapse Inspired Federated Learning with Synthetic and Fixed Classifier

1 code implementation • ICCV 2023 • Zexi Li, Xinyi Shang, Rui He, Tao Lin, Chao Wu

Recent advances in neural collapse have shown that the classifiers and feature prototypes under perfect training scenarios collapse into an optimal structure called simplex equiangular tight frame (ETF).

Classifier calibration Federated Learning

Paper
Code

Revisiting Weighted Aggregation in Federated Learning with Neural Networks

1 code implementation • 14 Feb 2023 • Zexi Li, Tao Lin, Xinyi Shang, Chao Wu

In federated learning (FL), weighted aggregation of local models is conducted to generate a global model, and the aggregation weights are normalized (the sum of weights is 1) and proportional to the local data sizes.

Federated Learning

Paper
Code

Towards Effective Clustered Federated Learning: A Peer-to-peer Framework with Adaptive Neighbor Matching

no code implementations • 23 Mar 2022 • Zexi Li, Jiaxun Lu, Shuang Luo, Didi Zhu, Yunfeng Shao, Yinchuan Li, Zhimeng Zhang, Yongheng Wang, Chao Wu

In the literature, centralized clustered FL algorithms require the assumption of the number of clusters and hence are not effective enough to explore the latent relationships among clients.

Federated Learning

Paper
Add Code

Ensemble Federated Adversarial Training with Non-IID data

no code implementations • 26 Oct 2021 • Shuang Luo, Didi Zhu, Zexi Li, Chao Wu

Despite federated learning endows distributed clients with a cooperative training mode under the premise of protecting data privacy and security, the clients are still vulnerable when encountering adversarial samples due to the lack of robustness.

Federated Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.