Search Results for author: Agha Ali Raza

Found 6 papers, 2 papers with code

UQA: Corpus for Urdu Question Answering

1 code implementation2 May 2024 Samee Arif, Sualeha Farid, Awais Athar, Agha Ali Raza

This paper introduces UQA, a novel dataset for question answering and text comprehension in Urdu, a low-resource language with over 70 million native speakers.

Multilingual NLP Question Answering +1

To Label or Not to Label: Hybrid Active Learning for Neural Machine Translation

no code implementations14 Mar 2024 Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza

A weighted hybrid score that combines uncertainty and diversity is then used to select the top instances for annotation in each AL iteration.

Active Learning Domain Adaptation +4

Representative Subset Selection for Efficient Fine-Tuning in Self-Supervised Speech Recognition

no code implementations18 Mar 2022 Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza

Self-supervised speech recognition models require considerable labeled training data for learning high-fidelity representations for Automatic Speech Recognition (ASR) which is computationally demanding and time-consuming.

Active Learning Automatic Speech Recognition +2

SimplifyUR: Unsupervised Lexical Text Simplification for Urdu

no code implementations LREC 2020 Namoos Hayat Qasmi, Haris Bin Zia, Awais Athar, Agha Ali Raza

Being a low-resource language in terms of standard linguistic resources, recent text simplification approaches that rely on manually crafted simplified corpora or lexicons such as WordNet are not applicable to Urdu.

Lexical Simplification Text Simplification +1

Urdu Word Segmentation using Conditional Random Fields (CRFs)

1 code implementation COLING 2018 Haris Bin Zia, Agha Ali Raza, Awais Athar

State-of-the-art Natural Language Processing algorithms rely heavily on efficient word segmentation.

Segmentation

PronouncUR: An Urdu Pronunciation Lexicon Generator

no code implementations LREC 2018 Haris Bin Zia, Agha Ali Raza, Awais Athar

The tool predicts the pronunciation of words using a LSTM-based model trained on a handcrafted expert lexicon of around 39, 000 words and shows an accuracy of 64% upon internal evaluation.

Language Modelling speech-recognition +1

Cannot find the paper you are looking for? You can Submit a new open access paper.