1 code implementation • 20 Dec 2023 • Dan Shi, Chaobin You, Jiantao Huang, Taihao Li, Deyi Xiong
With these pre-defined domains and slots, we collect 76, 787 commonsense knowledge annotations from 19, 700 dialogues through crowdsourcing.
1 code implementation • 30 Oct 2023 • Zishan Guo, Renren Jin, Chuang Liu, Yufei Huang, Dan Shi, Supryadi, Linhao Yu, Yan Liu, Jiaxuan Li, Bojian Xiong, Deyi Xiong
We hope that this comprehensive overview will stimulate further research interests in the evaluation of LLMs, with the ultimate goal of making evaluation serve as a cornerstone in guiding the responsible development of LLMs.