Search Results for author: Nianwen Si

Found 5 papers, 0 papers with code

MPN: Leveraging Multilingual Patch Neuron for Cross-lingual Model Editing

no code implementations • 6 Jan 2024 • Nianwen Si, Hao Zhang, WeiQiang Zhang

Large language models are known for encoding a vast amount of factual knowledge, but they often becomes outdated due to the ever-changing nature of external information.

Model Editing

Paper
Add Code

Knowledge Unlearning for LLMs: Tasks, Methods, and Challenges

no code implementations • 27 Nov 2023 • Nianwen Si, Hao Zhang, Heyu Chang, Wenlin Zhang, Dan Qu, WeiQiang Zhang

We further present evaluation datasets used in existing methods, and finally conclude this survey by presenting the ongoing challenges and future directions.

In-Context Learning Machine Unlearning +1

Paper
Add Code

Tuning Large language model for End-to-end Speech Translation

no code implementations • 3 Oct 2023 • Hao Zhang, Nianwen Si, Yaqi Chen, Wenlin Zhang, Xukui Yang, Dan Qu, Xiaolin Jiao

The training of LST consists of two stages: (1) Modality adjustment, where the adapter is tuned to align speech representation with text embedding space, and (2) Downstream task fine-tuning, where both the adapter and LLM model are trained to optimize performance on the E2EST task.

Language Modelling Large Language Model +2

Paper
Add Code

Improving Speech Translation by Cross-Modal Multi-Grained Contrastive Learning

no code implementations • 20 Apr 2023 • Hao Zhang, Nianwen Si, Yaqi Chen, Wenlin Zhang, Xukui Yang, Dan Qu, Wei-Qiang Zhang

However, the final model often performs worse on the MT task than the MT model trained alone, which means that the knowledge transfer ability of this method is also limited.

Contrastive Learning Machine Translation +3

Paper
Add Code

Decouple Non-parametric Knowledge Distillation For End-to-end Speech Translation

no code implementations • 20 Apr 2023 • Hao Zhang, Nianwen Si, Yaqi Chen, Wenlin Zhang, Xukui Yang, Dan Qu, Zhen Li

Existing techniques often attempt to make knowledge transfer from a powerful machine translation (MT) to speech translation (ST) model with some elaborate techniques, which often requires transcription as extra input during training.

Knowledge Distillation Machine Translation +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.