no code implementations • 8 Mar 2024 • Aisha Khatun, Anisur Rahman, Md Saiful Islam, Hemayet Ahmed Chowdhury, Ayesha Tasnim
Moreover, we introduce the publicly available Bangla Authorship Attribution Dataset of 16 authors (BAAD16) containing 17, 966 sample texts and 13. 4+ million words to solve the standard dataset scarcity problem and release six variations of pre-trained language models for use in any Bangla NLP downstream task.
no code implementations • 15 Jan 2024 • Aisha Khatun, Daniel G. Brown
The widespread adoption of Large Language Models (LLMs) has become commonplace, particularly with the emergence of open-source models.
1 code implementation • 9 Jun 2023 • Aisha Khatun, Daniel G. Brown
Large language models (LLMs) have become mainstream technology with their versatile use cases and impressive performance.
no code implementations • 10 May 2023 • Piotr Sawicki, Marek Grzes, Fabricio Goes, Dan Brown, Max Peeperkorn, Aisha Khatun
This study examines the ability of GPT-3. 5, GPT-3. 5-turbo (ChatGPT) and GPT-4 models to generate poems in the style of specific authors using zero-shot and many-shot prompts (which use the maximum context length of 8192 tokens).
1 code implementation • 11 Jan 2020 • Aisha Khatun, Anisur Rahman, Md. Saiful Islam, Marium-E-Jannat
Characters are the smallest unit of text that can extract stylometric signals to determine the author of a text.
no code implementations • 11 Jan 2020 • Hemayet Ahmed Chowdhury, Md. Azizul Haque Imon, Anisur Rahman, Aisha Khatun, Md. Saiful Islam
Language models are generally employed to estimate the probability distribution of various linguistic units, making them one of the fundamental parts of natural language processing.
no code implementations • 15 Nov 2019 • Aisha Khatun, Anisur Rahman, Hemayet Ahmed Chowdhury, Md. Saiful Islam, Ayesha Tasnim
Language models are at the core of natural language processing.