no code implementations • 16 Feb 2024 • John Dougrez-Lewis, Mahmud Elahi Akhter, Yulan He, Maria Liakata
Our study contributes to the growing body of research suggesting that ChatGPT's reasoning processes are unlikely to mirror human-like reasoning, and that LLMs need to be more rigorously evaluated to distinguish between hype and actual capabilities, especially in high-stakes real-world tasks such as claim verification.
1 code implementation • 29 Jan 2022 • Ibraheem Muhammad Moosa, Mahmud Elahi Akhter, Ashfia Binte Habib
We empirically measure the effect of transliteration on MLLMs in this context.
Language Modelling Multiple Choice Question Answering (MCQA) +5
no code implementations • 29 Sep 2021 • Ibraheem Muhammad Moosa, Mahmud Elahi Akhter, Ashfia Binte Habib
In addition, XLM-Indic establishes new SOTA results for most tasks the on IndicGLUE benchmark while being competitive at the rest.