Search Results for author: Olga Kozlova

Found 7 papers, 4 papers with code

RuPAWS: A Russian Adversarial Dataset for Paraphrase Identification

1 code implementation • LREC 2022 • Nikita Martynov, Irina Krotova, Varvara Logacheva, Alexander Panchenko, Olga Kozlova, Nikita Semenov

We compare it to the largest available dataset for Russian ParaPhraser and show that the best available paraphrase identifiers for the Russian language fail on the RuPAWS dataset.

Paraphrase Identification

Paper
Code

Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company’s Reputation

no code implementations • EACL (BSNLP) 2021 • Nikolay Babakov, Varvara Logacheva, Olga Kozlova, Nikita Semenov, Alexander Panchenko

We define a set of sensitive topics that can yield inappropriate and toxic messages and describe the methodology of collecting and labelling a dataset for appropriateness.

Paper
Add Code

Text Detoxification using Large Pre-trained Neural Models

1 code implementation • EMNLP 2021 • David Dale, Anton Voronov, Daryna Dementieva, Varvara Logacheva, Olga Kozlova, Nikita Semenov, Alexander Panchenko

We compare our models with a number of methods for style transfer.

Style Transfer

Paper
Code

SkoltechNLP at SemEval-2021 Task 5: Leveraging Sentence-level Pre-training for Toxic Span Detection

no code implementations • SEMEVAL 2021 • David Dale, Igor Markov, Varvara Logacheva, Olga Kozlova, Nikita Semenov, Alexander Panchenko

We show that fine-tuning a RoBERTa model for this problem is a strong baseline.

Sentence Toxic Spans Detection

Paper
Add Code

Methods for Detoxification of Texts for the Russian Language

3 code implementations • 19 May 2021 • Daryna Dementieva, Daniil Moskovskiy, Varvara Logacheva, David Dale, Olga Kozlova, Nikita Semenov, Alexander Panchenko

We introduce the first study of automatic detoxification of Russian texts to combat offensive language.

Style Transfer

2,052

Paper
Code

Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation

1 code implementation • 9 Mar 2021 • Nikolay Babakov, Varvara Logacheva, Olga Kozlova, Nikita Semenov, Alexander Panchenko

We define a set of sensitive topics that can yield inappropriate and toxic messages and describe the methodology of collecting and labeling a dataset for appropriateness.

Paper
Code

Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

no code implementations • EACL 2021 • Artem Shelmanov, Dmitri Puzyrev, Lyubov Kupriyanova, Denis Belyakov, Daniil Larionov, Nikita Khromov, Olga Kozlova, Ekaterina Artemova, Dmitry V. Dylov, Alexander Panchenko

Annotating training data for sequence tagging of texts is usually very time-consuming.

Active Learning Transfer Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.