no code implementations • 24 Apr 2024 • Dongryeol Lee, Minwoo Lee, Kyungmin Min, Joonsuk Park, Kyomin Jung
Recently, directly using large language models (LLMs) has been shown to be the most reliable method to evaluate QA models.