Search Results for author: Gaurish Thakkar

Accessing and understanding contemporary and historical events of global impact such as the US elections and the Olympic Games is a major prerequisite for cross-lingual event analytics that investigate event causes, perception and consequences across country borders.

Image Retrieval Knowledge Graphs +5

Paper
Add Code

Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study

no code implementations • 14 Dec 2022 • Jelena Sarajlić, Gaurish Thakkar, Diego Alves, Nives Mikelic Preradović

This paper presents a corpus annotated for the task of direct-speech extraction in Croatian.

coreference-resolution Speech Extraction

Paper
Add Code

Building and Evaluating Universal Named-Entity Recognition English corpus

no code implementations • 14 Dec 2022 • Diego Alves, Gaurish Thakkar, Marko Tadić

This article presents the application of the Universal Named Entity framework to generate automatically annotated corpora.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Multi-task Learning for Cross-Lingual Sentiment Analysis

1 code implementation • 14 Dec 2022 • Gaurish Thakkar, Nives Mikelic Preradovic, Marko Tadic

This paper presents a cross-lingual sentiment analysis of news articles using zero-shot and few-shot learning.

Few-Shot Learning Multi-Task Learning +2

Paper
Code

Building Multilingual Corpora for a Complex Named Entity Recognition and Classification Hierarchy using Wikipedia and DBpedia

no code implementations • 14 Dec 2022 • Diego Alves, Gaurish Thakkar, Gabriel Amaral, Tin Kuculo, Marko Tadić

With the ever-growing popularity of the field of NLP, the demand for datasets in low resourced-languages follows suit.

Named Entity Recognition Named Entity Recognition (NER)

Paper
Add Code

Natural Language Processing Chains Inside a Cross-lingual Event-Centric Knowledge Pipeline for European Union Under-resourced Languages

no code implementations • LREC 2020 • Diego Alves, Gaurish Thakkar, Marko Tadić

Due to the differences in terms of availability of language resources for each language, we have built this strategy in three steps, starting with processing chains for the well-resourced languages and finishing with the development of new modules for the under-resourced ones.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Evaluating Language Tools for Fifteen EU-official Under-resourced Languages

no code implementations • LREC 2020 • Diego Alves, Gaurish Thakkar, Marko Tadić

We considered the difference between reported and our tested results within a single percentage point as being within the limits of acceptable tolerance and thus consider this result as reproducible.

Paper
Add Code

UNER: Universal Named-Entity RecognitionFramework

no code implementations • 23 Oct 2020 • Diego Alves, Tin Kuculo, Gabriel Amaral, Gaurish Thakkar, Marko Tadic

We introduce the Universal Named-Entity Recognition (UNER)framework, a 4-level classification hierarchy, and the methodology that isbeing adopted to create the first multilingual UNER corpus: the SETimesparallel corpus annotated for named-entities.

Knowledge Graphs named-entity-recognition +2

Paper
Add Code

Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets

1 code implementation • 23 Oct 2020 • Gaurish Thakkar, Marcis Pinnis

In this paper, we present various pre-training strategies that aid in im-proving the accuracy of the sentiment classification task.

Sentiment Analysis Sentiment Classification

Paper
Code

Towards Normalising Konkani-English Code-Mixed Social Media Text

no code implementations • WS 2017 • Akshata Phadte, Gaurish Thakkar

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.