no code implementations • 30 May 2024 • Eric Chamoun, Michael Schlichktrull, Andreas Vlachos
In this paper, we propose a novel task: automated focused feedback generation for scientific writing assistance.
1 code implementation • 4 Apr 2024 • Zhangdie Yuan, Chenxi Whitehouse, Eric Chamoun, Rami Aly, Andreas Vlachos
This paper introduces PRobELM (Plausibility Ranking Evaluation for Language Models), a benchmark designed to assess language models' ability to discern more plausible from less plausible scenarios through their parametric knowledge.
no code implementations • 14 Nov 2023 • Eric Chamoun, Marzieh Saeidi, Andreas Vlachos
Prior research has shown that typical fact-checking models for stand-alone claims struggle with claims made in dialogues.