1 code implementation • 18 Jun 2023 • Yan Zhuang, Qi Liu, Yuting Ning, Weizhe Huang, Rui Lv, Zhenya Huang, Guanhao Zhao, Zheng Zhang, Qingyang Mao, Shijin Wang, Enhong Chen
Different tests for different models using efficient adaptive testing -- we believe this has the potential to become a new norm in evaluating large language models.