no code implementations • LT4HALA (LREC) 2022 • Yutong Shen, Jiahuan Li, ShuJian Huang, Yi Zhou, Xiaopeng Xie, Qinxin Zhao
Although SikuRoberta significantly boosts performance on WSG and POS tasks on ancient Chinese texts, the lack of labeled data still limits the performance of the model.
no code implementations • 14 Mar 2024 • Jiahuan Li, Shanbo Cheng, ShuJian Huang, Jiajun Chen
Large Language Models (LLM) have demonstrated their strong ability in the field of machine translation (MT), yet they suffer from high computational cost and latency.
no code implementations • 24 May 2023 • Jiahuan Li, Hao Zhou, ShuJian Huang, Shanbo Cheng, Jiajun Chen
Secondly, we find that LLMs' ability to carry out translation instructions relies on the understanding of translation instructions and the alignment among different languages.
no code implementations • 17 Dec 2022 • Jiahuan Li, Shanbo Cheng, Zewei Sun, Mingxuan Wang, ShuJian Huang
The effectiveness of kNNMT directly depends on the quality of retrieved neighbors.
no code implementations • ACL 2021 • Jiahuan Li, Yutong Shen, ShuJian Huang, Xinyu Dai, Jiajun Chen
Subword segmentation algorithms have been a \textit{de facto} choice when building neural machine translation systems.
no code implementations • 15 May 2021 • Qu Cui, ShuJian Huang, Jiahuan Li, Xiang Geng, Zaixiang Zheng, Guoping Huang, Jiajun Chen
However, we argue that there are gaps between the predictor and the estimator in both data quality and training objectives, which preclude QE models from benefiting from a large number of parallel corpora more directly.
no code implementations • ACL 2020 • Jiahuan Li, Yu Bao, Shu-Jian Huang, Xin-yu Dai, Jia-Jun Chen
Definition generation, which aims to automatically generate dictionary definitions for words, has recently been proposed to assist the construction of dictionaries and help people understand unfamiliar texts.