Search Results for author: Ivan Srba

Found 17 papers, 9 papers with code

Comparing Specialised Small and General Large Language Models on Text Classification: 100 Labelled Samples to Achieve Break-Even Performance

no code implementations • 20 Feb 2024 • Branislav Pecher, Ivan Srba, Maria Bielikova

When performance variance is taken into consideration, the number of required labels increases on average by $100 - 200\%$ and even up to $1500\%$ in specific cases.

In-Context Learning Language Modelling +3

Paper
Add Code

On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices

no code implementations • 20 Feb 2024 • Branislav Pecher, Ivan Srba, Maria Bielikova

To measure the true effects of an individual randomness factor, our method mitigates the effects of other factors and observes how the performance varies across multiple runs.

In-Context Learning Meta-Learning +2

Paper
Add Code

Automatic Combination of Sample Selection Strategies for Few-Shot Learning

no code implementations • 5 Feb 2024 • Branislav Pecher, Ivan Srba, Maria Bielikova, Joaquin Vanschoren

In few-shot learning, such as meta-learning, few-shot fine-tuning or in-context learning, the limited number of samples used to train a model have a significant impact on the overall success.

Few-Shot Learning In-Context Learning

Paper
Add Code

Authorship Obfuscation in Multilingual Machine-Generated Text Detection

no code implementations • 15 Jan 2024 • Dominik Macko, Robert Moro, Adaku Uchendu, Ivan Srba, Jason Samuel Lucas, Michiharu Yamashita, Nafis Irtiza Tripto, Dongwon Lee, Jakub Simko, Maria Bielikova

However, it is susceptible to authorship obfuscation (AO) methods, such as paraphrasing, which can cause MGTs to evade detection.

Adversarial Robustness Benchmarking +3

Paper
Add Code

Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation

1 code implementation • 12 Jan 2024 • Jan Cegin, Branislav Pecher, Jakub Simko, Ivan Srba, Maria Bielikova, Peter Brusilovsky

The latest generative large language models (LLMs) have found their application in data augmentation tasks, where small numbers of text samples are LLM-paraphrased and then used to fine-tune downstream models.

Text Augmentation

Paper
Code

On the Effects of Randomness on Stability of Learning with Limited Labelled Data: A Systematic Literature Review

no code implementations • 2 Dec 2023 • Branislav Pecher, Ivan Srba, Maria Bielikova

Recently, this area started to attract a research attention and the number of relevant studies is continuously growing.

Few-Shot Learning Transfer Learning

Paper
Add Code

Disinformation Capabilities of Large Language Models

1 code implementation • 15 Nov 2023 • Ivan Vykopal, Matúš Pikuliak, Ivan Srba, Robert Moro, Dominik Macko, Maria Bielikova

Automated disinformation generation is often listed as an important risk associated with large language models (LLMs).

Paper
Code

A Ship of Theseus: Curious Cases of Paraphrasing in LLM-Generated Texts

no code implementations • 14 Nov 2023 • Nafis Irtiza Tripto, Saranya Venkatraman, Dominik Macko, Robert Moro, Ivan Srba, Adaku Uchendu, Thai Le, Dongwon Lee

In the realm of text manipulation and linguistic transformation, the question of authorship has always been a subject of fascination and philosophical inquiry.

Paper
Add Code

Is it indeed bigger better? The comprehensive study of claim detection LMs applied for disinformation tackling

no code implementations • 10 Nov 2023 • Martin Hyben, Sebastian Kula, Ivan Srba, Robert Moro, Jakub Simko

This study compares the performance of (1) fine-tuned models and (2) extremely large language models on the task of check-worthy claim detection.

Paper
Add Code

MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark

1 code implementation • 20 Oct 2023 • Dominik Macko, Robert Moro, Adaku Uchendu, Jason Samuel Lucas, Michiharu Yamashita, Matúš Pikuliak, Ivan Srba, Thai Le, Dongwon Lee, Jakub Simko, Maria Bielikova

There is a lack of research into capabilities of recent LLMs to generate convincing text in languages other than English and into performance of detectors of machine-generated text in multilingual settings.

Benchmarking Text Detection

Paper
Code

Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification

1 code implementation • 14 Aug 2023 • Olesya Razuvayevskaya, Ben Wu, Joao A. Leite, Freddy Heppell, Ivan Srba, Carolina Scarton, Kalina Bontcheva, Xingyi Song

Adapters and Low-Rank Adaptation (LoRA) are parameter-efficient fine-tuning techniques designed to make the training of language models more efficient.

Multilingual text classification text-classification +1

Paper
Code

Multilingual Previously Fact-Checked Claim Retrieval

1 code implementation • 13 May 2023 • Matúš Pikuliak, Ivan Srba, Robert Moro, Timo Hromadka, Timotej Smolen, Martin Melisek, Ivan Vykopal, Jakub Simko, Juraj Podrouzek, Maria Bielikova

Fact-checkers are often hampered by the sheer amount of online content that needs to be fact-checked.

Retrieval

Paper
Code

KInITVeraAI at SemEval-2023 Task 3: Simple yet Powerful Multilingual Fine-Tuning for Persuasion Techniques Detection

1 code implementation • 24 Apr 2023 • Timo Hromadka, Timotej Smolen, Tomas Remis, Branislav Pecher, Ivan Srba

This paper presents the best-performing solution to the SemEval 2023 Task 3 on the subtask 3 dedicated to persuasion techniques detection.

Paper
Code

Automated, not Automatic: Needs and Practices in European Fact-checking Organizations as a basis for Designing Human-centered AI Systems

no code implementations • 22 Nov 2022 • Andrea Hrckova, Robert Moro, Ivan Srba, Jakub Simko, Maria Bielikova

Second, we have identified fact-checkers' needs and pains focusing on so far unexplored dimensions and emphasizing the needs of fact-checkers from Central and Eastern Europe as well as from low-resource language groups which have implications for development of new resources (datasets) as well as for the focus of AI research in this domain.

Fact Checking

Paper
Add Code

Auditing YouTube's Recommendation Algorithm for Misinformation Filter Bubbles

1 code implementation • 18 Oct 2022 • Ivan Srba, Robert Moro, Matus Tomlein, Branislav Pecher, Jakub Simko, Elena Stefancova, Michal Kompan, Andrea Hrckova, Juraj Podrouzek, Adrian Gavornik, Maria Bielikova

We also observe a sudden decrease of misinformation filter bubble effect when misinformation debunking videos are watched after misinformation promoting videos, suggesting a strong contextuality of recommendations.

Misinformation

Paper
Code

Monant Medical Misinformation Dataset: Mapping Articles to Fact-Checked Claims

1 code implementation • 26 Apr 2022 • Ivan Srba, Branislav Pecher, Matus Tomlein, Robert Moro, Elena Stefancova, Jakub Simko, Maria Bielikova

It also contains 573 manually and more than 51k automatically labelled mappings between claims and articles.

Misinformation

Paper
Code

An Audit of Misinformation Filter Bubbles on YouTube: Bubble Bursting and Recent Behavior Changes

1 code implementation • 25 Mar 2022 • Matus Tomlein, Branislav Pecher, Jakub Simko, Ivan Srba, Robert Moro, Elena Stefancova, Michal Kompan, Andrea Hrckova, Juraj Podrouzek, Maria Bielikova

We present a study in which pre-programmed agents (acting as YouTube users) delve into misinformation filter bubbles by watching misinformation promoting content (for various topics).

Misinformation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.