Search Results for author: Jiahuan Li

Found 7 papers, 0 papers with code

Data Augmentation for Low-resource Word Segmentation and POS Tagging of Ancient Chinese Texts

no code implementations • LT4HALA (LREC) 2022 • Yutong Shen, Jiahuan Li, ShuJian Huang, Yi Zhou, Xiaopeng Xie, Qinxin Zhao

Although SikuRoberta significantly boosts performance on WSG and POS tasks on ancient Chinese texts, the lack of labeled data still limits the performance of the model.

Data Augmentation Language Modelling +3

Paper
Add Code

MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation

no code implementations • 14 Mar 2024 • Jiahuan Li, Shanbo Cheng, ShuJian Huang, Jiajun Chen

Large Language Models (LLM) have demonstrated their strong ability in the field of machine translation (MT), yet they suffer from high computational cost and latency.

Knowledge Distillation Machine Translation +1

Paper
Add Code

Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions

no code implementations • 24 May 2023 • Jiahuan Li, Hao Zhou, ShuJian Huang, Shanbo Cheng, Jiajun Chen

Secondly, we find that LLMs' ability to carry out translation instructions relies on the understanding of translation instructions and the alignment among different languages.

Language Modelling Translation

Paper
Add Code

Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation

no code implementations • 17 Dec 2022 • Jiahuan Li, Shanbo Cheng, Zewei Sun, Mingxuan Wang, ShuJian Huang

The effectiveness of kNNMT directly depends on the quality of retrieved neighbors.

Machine Translation NMT +2

Paper
Add Code

When is Char Better Than Subword: A Systematic Study of Segmentation Algorithms for Neural Machine Translation

no code implementations • ACL 2021 • Jiahuan Li, Yutong Shen, ShuJian Huang, Xinyu Dai, Jiajun Chen

Subword segmentation algorithms have been a \textit{de facto} choice when building neural machine translation systems.

Machine Translation NMT +2

Paper
Add Code

DirectQE: Direct Pretraining for Machine Translation Quality Estimation

no code implementations • 15 May 2021 • Qu Cui, ShuJian Huang, Jiahuan Li, Xiang Geng, Zaixiang Zheng, Guoping Huang, Jiajun Chen

However, we argue that there are gaps between the predictor and the estimator in both data quality and training objectives, which preclude QE models from benefiting from a large number of parallel corpora more directly.

Machine Translation Translation

Paper
Add Code

Explicit Semantic Decomposition for Definition Generation

no code implementations • ACL 2020 • Jiahuan Li, Yu Bao, Shu-Jian Huang, Xin-yu Dai, Jia-Jun Chen

Definition generation, which aims to automatically generate dictionary definitions for words, has recently been proposed to assist the construction of dictionaries and help people understand unfamiliar texts.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.