no code implementations • 6 Feb 2024 • Si Shen, Peijun Shen, Danhao Zhu
This paper presents RevOrder, a novel technique aimed at improving arithmetic operations in large language models (LLMs) by reversing the output digits in addition, subtraction, and n-digit by 1-digit (nD by 1D) multiplication tasks.
no code implementations • 11 Jul 2023 • Dongbo Wang, Chang Liu, Zhixiao Zhao, Si Shen, Liu Liu, Bin Li, Haotian Hu, Mengcheng Wu, Litao Lin, Xue Zhao, Xiyu Wang
In the context of the rapid development of large language models, we have meticulously trained and introduced the GujiBERT and GujiGPT language models, which are foundational models specifically designed for intelligent information processing of ancient texts.
no code implementations • 16 Oct 2022 • Baijun Ji, Tong Zhang, Yicheng Zou, Bojie Hu, Si Shen
Multimodal machine translation (MMT) aims to improve translation quality by equipping the source sentence with its corresponding image.
1 code implementation • 9 Jun 2022 • Si Shen, Jiangfeng Liu, Litao Lin, Ying Huang, Lin Zhang, Chang Liu, Yutong Feng, Dongbo Wang
The academic literature of social sciences records human civilization and studies human social problems.
no code implementations • 3 May 2017 • Danhao Zhu, Si Shen, Xin-yu Dai, Jia-Jun Chen
Recurrent Neural Network (RNN) has been widely applied for sequence modeling.