no code implementations • EMNLP (ACL) 2021 • Baoli Zhang, Zhucong Li, Zhen Gan, Yubo Chen, Jing Wan, Kang Liu, Jun Zhao, Shengping Liu, Yafei Shi
2) Inconsistency Detector: CroAno employs a detector to locate corpus-level label inconsistency and provides users an interface to correct inconsistent entities in batches.
no code implementations • NAACL (SMM4H) 2021 • Tong Zhou, Zhucong Li, Zhen Gan, Baoli Zhang, Yubo Chen, Kun Niu, Jing Wan, Kang Liu, Jun Zhao, Yafei Shi, Weifeng Chong, Shengping Liu
This is the system description of the CASIA_Unisound team for Task 1, Task 7b, and Task 8 of the sixth Social Media Mining for Health Applications (SMM4H) shared task in 2021.
no code implementations • SMM4H (COLING) 2022 • Jia Fu, Sirui Li, Hui Ming Yuan, Zhucong Li, Zhen Gan, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu
This paper presents a description of our system in SMM4H-2022, where we participated in task 1a, task 4, and task 6 to task 10.
1 code implementation • EMNLP 2021 • Cheng Yan, Yuanzhe Zhang, Kang Liu, Jun Zhao, Yafei Shi, Shengping Liu
Biomedical Concept Normalization (BCN) is widely used in biomedical text processing as a fundamental module.
1 code implementation • 22 Mar 2024 • Huanxuan Liao, Shizhu He, Yao Xu, Yuanzhe Zhang, Kang Liu, Shengping Liu, Jun Zhao
Retrieval-Augmented-Generation and Gener-ation-Augmented-Generation have been proposed to enhance the knowledge required for question answering over Large Language Models (LLMs).
no code implementations • 21 Feb 2024 • YuHeng Chen, Pengfei Cao, Yubo Chen, Yining Wang, Shengping Liu, Kang Liu, Jun Zhao
This paper provides a comprehensive definition of DKNs that covers both structural and functional aspects, pioneering the study of structures in PLMs' factual knowledge storage units.
no code implementations • 19 Feb 2024 • Xiaowei Yuan, Zhao Yang, Yequan Wang, Shengping Liu, Jun Zhao, Kang Liu
Large language models internalize enormous parametric knowledge during pre-training.
1 code implementation • 15 Feb 2024 • Yixuan Weng, Shizhu He, Kang Liu, Shengping Liu, Jun Zhao
This heightens the need to control model behaviors.
1 code implementation • 21 Nov 2023 • Tong Zhou, Yubo Chen, Pengfei Cao, Kang Liu, Jun Zhao, Shengping Liu
To this end, we present a pretraining corpus curation and assessment platform called Oasis -- a one-stop system for data quality improvement and quantification with user-friendly interactive interfaces.
no code implementations • 28 Aug 2023 • Baoli Zhang, Haining Xie, Pengfan Du, JunHao Chen, Pengfei Cao, Yubo Chen, Shengping Liu, Kang Liu, Jun Zhao
To this end, we propose the ZhuJiu benchmark, which has the following strengths: (1) Multi-dimensional ability coverage: We comprehensively evaluate LLMs across 7 ability dimensions covering 51 tasks.
1 code implementation • 20 Aug 2023 • Yixuan Weng, Zhiqi Wang, Huanxuan Liao, Shizhu He, Shengping Liu, Kang Liu, Jun Zhao
With the burgeoning development in the realm of large language models (LLMs), the demand for efficient incremental training tailored to specific industries and domains continues to increase.
1 code implementation • 19 Dec 2022 • Yixuan Weng, Minjun Zhu, Fei Xia, Bin Li, Shizhu He, Shengping Liu, Bin Sun, Kang Liu, Jun Zhao
By performing a backward verification of the answers that LLM deduced for itself, we can obtain interpretable answer validation scores to select the candidate answer with the highest score.
1 code implementation • ACL 2021 • Tong Zhou, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Kun Niu, Weifeng Chong, Shengping Liu
The ICD coding task aims at assigning codes of the International Classification of Diseases in clinical notes.
no code implementations • 27 May 2021 • Yinyu Lan, Shizhu He, Xiangrong Zeng, Shengping Liu, Kang Liu, Jun Zhao
To address the above issues, this paper proposes two novel path-based reasoning methods to solve the sparsity issues of entity and path respectively, which adopts the textual semantic information of entities and paths for MedKGC.
1 code implementation • 3 Nov 2020 • Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao, Xiangrong Zeng, Shengping Liu
Compared with cross-entropy loss that highly penalizes small shifts in triple order, the proposed bipartite matching loss is invariant to any permutation of predictions; thus, it can provide the proposed networks with a more accurate training signal by ignoring triple order and focusing on relation types and entities.
Ranked #1 on Joint Entity and Relation Extraction on NYT
no code implementations • ACL 2020 • Yuanzhe Zhang, Zhongtao Jiang, Tao Zhang, Shiwan Liu, Jiarun Cao, Kang Liu, Shengping Liu, Jun Zhao
Electronic Medical Records (EMRs) have become key components of modern medical care systems.
no code implementations • ACL 2020 • Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu, Weifeng Chong
Specifically, we propose a hyperbolic representation method to leverage the code hierarchy.
no code implementations • ACL 2020 • Pengfei Cao, Chenwei Yan, Xiangling Fu, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu, Weifeng Chong
In this paper, we introduce Clinical-Coder, an online system aiming to assign ICD codes to Chinese clinical notes.
1 code implementation • IJCNLP 2019 • Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu
The lack of word boundaries information has been seen as one of the main obstacles to develop a high performance Chinese named entity recognition (NER) system.
Ranked #11 on Chinese Named Entity Recognition on Weibo NER
Chinese Named Entity Recognition named-entity-recognition +2
no code implementations • IJCNLP 2019 • Xiangrong Zeng, Shizhu He, Daojian Zeng, Kang Liu, Shengping Liu, Jun Zhao
Existing works didn{'}t consider the extraction order of relational facts in a sentence.
no code implementations • 21 Aug 2019 • Qingbin Liu, Shizhu He, Kang Liu, Shengping Liu, Jun Zhao
How to integrate the semantic information of pre-defined ontology and dialogue text (heterogeneous texts) to generate unknown values and improve performance becomes a severe challenge.
no code implementations • 20 Aug 2019 • Liang Zhao, Zhiyuan Ma, Yangming Zhou, Kai Wang, Shengping Liu, Ju Gao
Electronic health record is an important source for clinical researches and applications, and errors inevitably occur in the data, which could lead to severe damages to both patients and hospital services.
1 code implementation • EMNLP 2018 • Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu
However, existing methods for Chinese NER either do not exploit word boundary information from CWS or cannot filter the specific information of CWS.
Ranked #1 on Chinese Named Entity Recognition on SighanNER
Chinese Named Entity Recognition Chinese Word Segmentation +4