Search Results for author: Gabriel Stanovsky

Found 62 papers, 34 papers with code

Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation

no code implementations • 2 Jun 2024 • Bar Iluz, Yanai Elazar, Asaf Yehudai, Gabriel Stanovsky

Most works on gender bias focus on intrinsic bias -- removing traces of information about a protected group from the model's internal representation.

Machine Translation

Paper
Add Code

A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns

no code implementations • 23 May 2024 • Asaf Yehudai, Taelin Karidi, Gabriel Stanovsky, Ariel Goldstein, Omri Abend

In this paper, we adapt this task from cognitive science to evaluate the conceptualization and reasoning abilities of large language models (LLMs) through a behavioral study.

valid

Paper
Add Code

Do Zombies Understand? A Choose-Your-Own-Adventure Exploration of Machine Cognition

no code implementations • 1 Mar 2024 • Ariel Goldstein, Gabriel Stanovsky

Recent advances in LLMs have sparked a debate on whether they understand text.

Chatbot Philosophy +1

Paper
Add Code

Leveraging Collection-Wide Similarities for Unsupervised Document Structure Extraction

no code implementations • 21 Feb 2024 • Gili Lior, Yoav Goldberg, Gabriel Stanovsky

Document collections of various domains, e. g., legal, medical, or financial, often share some underlying collection-wide structure, which captures information that can aid both human users and structure-aware models.

Paper
Add Code

K-QA: A Real-World Medical Q&A Benchmark

1 code implementation • 25 Jan 2024 • Itay Manes, Naama Ronn, David Cohen, Ran Ilan Ber, Zehavi Horowitz-Kugler, Gabriel Stanovsky

Ensuring the accuracy of responses provided by large language models (LLMs) is crucial, particularly in clinical settings where incorrect information may directly impact patient health.

Hallucination In-Context Learning +1

Paper
Code

State of What Art? A Call for Multi-Prompt LLM Evaluation

2 code implementations • 31 Dec 2023 • Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, Gabriel Stanovsky

Recent advances in large language models (LLMs) have led to the development of various evaluation benchmarks.

Paper
Code

Exploring the Impact of Training Data Distribution and Subword Tokenization on Gender Bias in Machine Translation

1 code implementation • 21 Sep 2023 • Bar Iluz, Tomasz Limisiewicz, Gabriel Stanovsky, David Mareček

We study the effect of tokenization on gender bias in machine translation, an aspect that has been largely overlooked in previous works.

Gender Prediction Machine Translation +1

Paper
Code

Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias

1 code implementation • 1 Aug 2023 • Itay Itzhak, Gabriel Stanovsky, Nir Rosenfeld, Yonatan Belinkov

Recent studies show that instruction tuning (IT) and reinforcement learning from human feedback (RLHF) improve the abilities of large language models (LMs) dramatically.

Decision Making

Paper
Code

Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents

1 code implementation • 1 Jun 2023 • Catherine Chen, Zejiang Shen, Dan Klein, Gabriel Stanovsky, Doug Downey, Kyle Lo

Recent work has shown that infusing layout features into language models (LMs) improves processing of visually-rich documents such as scientific papers.

Paper
Code

Comparing Humans and Models on a Similar Scale: Towards Cognitive Gender Bias Evaluation in Coreference Resolution

1 code implementation • 24 May 2023 • Gili Lior, Gabriel Stanovsky

We approach this question through the lens of the dual-process theory for human decision-making.

coreference-resolution Decision Making +1

Paper
Code

Schema-Driven Information Extraction from Heterogeneous Tables

1 code implementation • 23 May 2023 • Fan Bai, Junmo Kang, Gabriel Stanovsky, Dayne Freitag, Alan Ritter

We use this collection of annotated tables to evaluate the ability of open-source and API-based language models to extract information from tables covering diverse domains and data formats.

Ranked #1 on Attribute Extraction on SWDE

Attribute Extraction Instruction Following +1

Paper
Code

The Perfect Victim: Computational Analysis of Judicial Attitudes towards Victims of Sexual Violence

no code implementations • 9 May 2023 • Eliya Habba, Renana Keydar, Dan Bareket, Gabriel Stanovsky

Second, we curate a manually annotated dataset for judicial assessments of victim's credibility in the Hebrew language, as well as a model that can extract credibility labels from court cases.

Paper
Add Code

Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images

no code implementations • ICCV 2023 • Nitzan Bitton-Guetta, Yonatan Bitton, Jack Hessel, Ludwig Schmidt, Yuval Elovici, Gabriel Stanovsky, Roy Schwartz

We introduce WHOOPS!, a new dataset and benchmark for visual commonsense.

Ranked #1 on Image-to-Text Retrieval on WHOOPS! (using extra training data)

Common Sense Reasoning Explanation Generation +6

Paper
Add Code

Evaluating and Improving the Coreference Capabilities of Machine Translation Models

no code implementations • 16 Feb 2023 • Asaf Yehudai, Arie Cattan, Omri Abend, Gabriel Stanovsky

Machine translation (MT) requires a wide range of linguistic capabilities, which current end-to-end models are expected to learn implicitly by observing aligned sentences in bilingual corpora.

coreference-resolution Machine Translation +1

Paper
Add Code

A Large-Scale Multilingual Study of Visual Constraints on Linguistic Selection of Descriptions

no code implementations • 9 Feb 2023 • Uri Berger, Lea Frermann, Gabriel Stanovsky, Omri Abend

We study the relation between visual input and linguistic choices by training classifiers to predict the probability of expressing a property from raw images, and find evidence supporting the claim that linguistic properties are constrained by visual context across languages.

Text Generation

Paper
Add Code

VASR: Visual Analogies of Situation Recognition

1 code implementation • 8 Dec 2022 • Yonatan Bitton, Ron Yosef, Eli Strugo, Dafna Shahaf, Roy Schwartz, Gabriel Stanovsky

We leverage situation recognition annotations and the CLIP model to generate a large set of 500k candidate analogies.

Ranked #1 on Visual Reasoning on VASR

Common Sense Reasoning Visual Analogies +1

Paper
Code

"Covid vaccine is against Covid but Oxford vaccine is made at Oxford!" Semantic Interpretation of Proper Noun Compounds

1 code implementation • 24 Oct 2022 • Keshav Kolluru, Gabriel Stanovsky, Mausam

Proper noun compounds, e. g., "Covid vaccine", convey information in a succinct manner (a "Covid vaccine" is a "vaccine that immunizes against the Covid disease").

Proper Noun

Paper
Code

You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models

no code implementations • 13 Oct 2022 • Tomasz Limisiewicz, Dan Malkin, Gabriel Stanovsky

Our method outperforms standard training methods in low-resource languages and retrains performance on high-resource languages while using the same amount of data.

Cross-Lingual Transfer Knowledge Distillation

Paper
Add Code

WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models

1 code implementation • 25 Jul 2022 • Yonatan Bitton, Nitzan Bitton Guetta, Ron Yosef, Yuval Elovici, Mohit Bansal, Gabriel Stanovsky, Roy Schwartz

While vision-and-language models perform well on tasks such as visual question answering, they struggle when it comes to basic human commonsense reasoning skills.

Ranked #1 on Common Sense Reasoning on WinoGAViL

Common Sense Reasoning General Knowledge +4

Paper
Code

A Computational Acquisition Model for Multimodal Word Categorization

1 code implementation • NAACL 2022 • Uri Berger, Gabriel Stanovsky, Omri Abend, Lea Frermann

Recent advances in self-supervised modeling of text and images open new opportunities for computational models of child language acquisition, which is believed to rely heavily on cross-modal signals.

Language Acquisition Object Recognition

Paper
Code

A Balanced Data Approach for Evaluating Cross-Lingual Transfer: Mapping the Linguistic Blood Bank

1 code implementation • NAACL 2022 • Dan Malkin, Tomasz Limisiewicz, Gabriel Stanovsky

We show that the choice of pretraining languages affects downstream cross-lingual transfer for BERT-based models.

Cross-Lingual Transfer

Paper
Code

On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations

no code implementations • Findings (NAACL) 2022 • Roy Schwartz, Gabriel Stanovsky

Recent work has shown that deep learning models in NLP are highly sensitive to low-level correlations between simple features and specific output labels, leading to overfitting and lack of generalization.

Common Sense Reasoning World Knowledge

Paper
Add Code

Automated Extraction of Sentencing Decisions from Court Cases in the Hebrew Language

no code implementations • EMNLP (NLLP) 2021 • Mohr Wenger, Tom Kalir, Noga Berger, Carmit Chalamish, Renana Keydar, Gabriel Stanovsky

We present the task of Automated Punishment Extraction (APE) in sentencing decisions from criminal court cases in Hebrew.

Sentence

Paper
Add Code

Filling the Gaps in Ancient Akkadian Texts: A Masked Language Modelling Approach

1 code implementation • EMNLP 2021 • Koren Lazar, Benny Saret, Asaf Yehudai, Wayne Horowitz, Nathan Wasserman, Gabriel Stanovsky

We present models which complete missing text given transliterations of ancient Mesopotamian documents, originally written on cuneiform clay tablets (2500 BCE - 100 CE).

Language Modelling

Paper
Code

Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

1 code implementation • Findings (EMNLP) 2021 • Shahar Levy, Koren Lazar, Gabriel Stanovsky

We manually verify the quality of our corpus and use it to evaluate gender bias in various coreference resolution and machine translation models.

coreference-resolution Machine Translation +1

Paper
Code

Data Efficient Masked Language Modeling for Vision and Language

1 code implementation • Findings (EMNLP) 2021 • Yonatan Bitton, Gabriel Stanovsky, Michael Elhadad, Roy Schwartz

We investigate a range of alternative masking strategies specific to the cross-modal setting that address these shortcomings, aiming for better fusion of text and image in the learned representation.

Language Modelling Masked Language Modeling +1

Paper
Code

Realistic Evaluation Principles for Cross-document Coreference Resolution

1 code implementation • Joint Conference on Lexical and Computational Semantics 2021 • Arie Cattan, Alon Eirew, Gabriel Stanovsky, Mandar Joshi, Ido Dagan

We point out that common evaluation practices for cross-document coreference resolution have been unrealistically permissive in their assumed settings, yielding inflated results.

coreference-resolution Cross Document Coreference Resolution

Paper
Code

Cross-document Coreference Resolution over Predicted Mentions

1 code implementation • Findings (ACL) 2021 • Arie Cattan, Alon Eirew, Gabriel Stanovsky, Mandar Joshi, Ido Dagan

Here, we introduce the first end-to-end model for CD coreference resolution from raw text, which extends the prominent model for within-document coreference to the CD setting.

coreference-resolution Cross Document Coreference Resolution

Paper
Code

Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA

2 code implementations • NAACL 2021 • Yonatan Bitton, Gabriel Stanovsky, Roy Schwartz, Michael Elhadad

Recent works have shown that supervised models often exploit data artifacts to achieve good test scores while their performance severely degrades on samples outside their training distribution.

Question Answering Relational Reasoning +1

Paper
Code

Process-Level Representation of Scientific Protocols with Interactive Annotation

2 code implementations • EACL 2021 • Ronen Tamari, Fan Bai, Alan Ritter, Gabriel Stanovsky

We develop Process Execution Graphs (PEG), a document-level representation of real-world wet lab biochemistry protocols, addressing challenges such as cross-sentence relations, long-range coreference, grounding, and implicit arguments.

Relation Extraction Sentence

Paper
Code

GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation

2 code implementations • 17 Jan 2021 • Daniel Khashabi, Gabriel Stanovsky, Jonathan Bragg, Nicholas Lourie, Jungo Kasai, Yejin Choi, Noah A. Smith, Daniel S. Weld

While often assumed a gold standard, effective human evaluation of text generation remains an important, open area for research.

Machine Translation Reading Comprehension +2

Paper
Code

Gender Coreference and Bias Evaluation at WMT 2020

1 code implementation • WMT (EMNLP) 2020 • Tom Kocmi, Tomasz Limisiewicz, Gabriel Stanovsky

Our work presents the largest evidence for the phenomenon in more than 19 systems submitted to the WMT over four diverse target languages: Czech, German, Polish, and Russian.

Machine Translation Translation

Paper
Code

MOCHA: A Dataset for Training and Evaluating Generative Reading Comprehension Metrics

1 code implementation • EMNLP 2020 • Anthony Chen, Gabriel Stanovsky, Sameer Singh, Matt Gardner

Posing reading comprehension as a generation problem provides a great deal of flexibility, allowing for open-ended questions with few restrictions on possible answers.

Question Answering Reading Comprehension

Paper
Code

Streamlining Cross-Document Coreference Resolution: Evaluation and Modeling

2 code implementations • 23 Sep 2020 • Arie Cattan, Alon Eirew, Gabriel Stanovsky, Mandar Joshi, Ido Dagan

Recent evaluation protocols for Cross-document (CD) coreference resolution have often been inconsistent or lenient, leading to incomparable results across works and overestimation of performance.

Ranked #4 on Entity Cross-Document Coreference Resolution on ECB+ test

coreference-resolution Cross Document Coreference Resolution +2

Paper
Code

Active Learning for Coreference Resolution using Discrete Annotation

1 code implementation • ACL 2020 • Belinda Z. Li, Gabriel Stanovsky, Luke Zettlemoyer

We improve upon pairwise annotation for active learning in coreference resolution, by asking annotators to identify mention antecedents if a presented mention pair is deemed not coreferent.

Active Learning Clustering +1

Paper
Code

The Right Tool for the Job: Matching Model and Instance Complexities

1 code implementation • ACL 2020 • Roy Schwartz, Gabriel Stanovsky, Swabha Swayamdipta, Jesse Dodge, Noah A. Smith

Our method presents a favorable speed/accuracy tradeoff in almost all cases, producing models which are up to five times faster than the state of the art, while preserving their accuracy.

Natural Language Inference text-classification +1

Paper
Code

Ecological Semantics: Programming Environments for Situated Language Understanding

no code implementations • 10 Mar 2020 • Ronen Tamari, Gabriel Stanovsky, Dafna Shahaf, Reut Tsarfaty

Large-scale natural language understanding (NLU) systems have made impressive progress: they can be applied flexibly across a variety of tasks, and employ minimal structural assumptions.

Common Sense Reasoning Grounded language learning +1

Paper
Add Code

Controlled Crowdsourcing for High-Quality QA-SRL Annotation

1 code implementation • ACL 2020 • Paul Roit, Ayal Klein, Daniela Stepanov, Jonathan Mamou, Julian Michael, Gabriel Stanovsky, Luke Zettlemoyer, Ido Dagan

Question-answer driven Semantic Role Labeling (QA-SRL) was proposed as an attractive open and natural flavour of SRL, potentially attainable from laymen.

Semantic Role Labeling Vocal Bursts Intensity Prediction

Paper
Code

Y'all should read this! Identifying Plurality in Second-Person Personal Pronouns in English Texts

no code implementations • WS 2019 • Gabriel Stanovsky, Ronen Tamari

Distinguishing between singular and plural {``}you{''} in English is a challenging task which has potential for downstream applications, such as machine translation or coreference resolution.

coreference-resolution Machine Translation +1

Paper
Add Code

Evaluating Question Answering Evaluation

no code implementations • WS 2019 • Anthony Chen, Gabriel Stanovsky, Sameer Singh, Matt Gardner

Our study suggests that while current metrics may be suitable for existing QA datasets, they limit the complexity of QA datasets that can be created.

Answer Generation Multiple-choice +1

Paper
Add Code

Yall should read this! Identifying Plurality in Second-Person Personal Pronouns in English Texts

1 code implementation • 26 Oct 2019 • Gabriel Stanovsky, Ronen Tamari

Distinguishing between singular and plural "you" in English is a challenging task which has potential for downstream applications, such as machine translation or coreference resolution.

coreference-resolution Machine Translation +1

Paper
Code

On the Limits of Learning to Actively Learn Semantic Representations

no code implementations • CONLL 2019 • Omri Koshorek, Gabriel Stanovsky, Yichu Zhou, Vivek Srikumar, Jonathan Berant

We conclude that the current applicability of LTAL for improving data efficiency in learning semantic meaning representations is limited.

Learning Semantic Representations Natural Language Understanding

Paper
Add Code

Evaluating Gender Bias in Machine Translation

1 code implementation • ACL 2019 • Gabriel Stanovsky, Noah A. Smith, Luke Zettlemoyer

We present the first challenge set and evaluation protocol for the analysis of gender bias in machine translation (MT).

coreference-resolution Machine Translation +2

Paper
Code

SemEval-2019 Task 10: Math Question Answering

1 code implementation • SEMEVAL 2019 • Mark Hopkins, Ronan Le Bras, Cristian Petrescu-Prahova, Gabriel Stanovsky, Hannaneh Hajishirzi, Rik Koncel-Kedziorski

Systems were evaluated based on the percentage of correctly answered questions.

Math Question Answering

Paper
Code

DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs

3 code implementations • NAACL 2019 • Dheeru Dua, Yizhong Wang, Pradeep Dasigi, Gabriel Stanovsky, Sameer Singh, Matt Gardner

We introduce a new English reading comprehension benchmark, DROP, which requires Discrete Reasoning Over the content of Paragraphs.

Ranked #14 on Question Answering on DROP Test

Question Answering Reading Comprehension +1

Paper
Code

Semantics as a Foreign Language

no code implementations • EMNLP 2018 • Gabriel Stanovsky, Ido Dagan

We propose a novel approach to semantic dependency parsing (SDP) by casting the task as an instance of multi-lingual machine translation, where each semantic representation is a different foreign dialect.

Dependency Parsing Machine Translation +3

Paper
Add Code

Spot the Odd Man Out: Exploring the Associative Power of Lexical Resources

no code implementations • EMNLP 2018 • Gabriel Stanovsky, Mark Hopkins

We propose Odd-Man-Out, a novel task which aims to test different properties of word representations.

Clustering Natural Language Inference +2

Paper
Add Code

Supervised Open Information Extraction

no code implementations • NAACL 2018 • Gabriel Stanovsky, Julian Michael, Luke Zettlemoyer, Ido Dagan

We present data and methods that enable a supervised learning approach to Open Information Extraction (Open IE).

Knowledge Base Population Natural Language Inference +3

Paper
Add Code

Crowdsourcing Question-Answer Meaning Representations

1 code implementation • NAACL 2018 • Julian Michael, Gabriel Stanovsky, Luheng He, Ido Dagan, Luke Zettlemoyer

We introduce Question-Answer Meaning Representations (QAMRs), which represent the predicate-argument structure of a sentence as a set of question-answer pairs.

Sentence

Paper
Code

Acquiring Predicate Paraphrases from News Tweets

no code implementations • SEMEVAL 2017 • Vered Shwartz, Gabriel Stanovsky, Ido Dagan

We present a simple method for ever-growing extraction of predicate paraphrases from news headlines in Twitter.

Natural Language Inference Question Answering

Paper
Add Code

Integrating Deep Linguistic Features in Factuality Prediction over Unified Datasets

no code implementations • ACL 2017 • Gabriel Stanovsky, Judith Eckle-Kohler, Yevgeniy Puzikov, Ido Dagan, Iryna Gurevych

Previous models for the assessment of commitment towards a predicate in a sentence (also known as factuality prediction) were trained and tested against a specific annotated dataset, subsequently limiting the generality of their results.

Knowledge Base Population Question Answering +1

Paper
Add Code

Recognizing Mentions of Adverse Drug Reaction in Social Media Using Knowledge-Infused Recurrent Models

no code implementations • EACL 2017 • Gabriel Stanovsky, Daniel Gruhl, Pablo Mendes

Recognizing mentions of Adverse Drug Reactions (ADR) in social media is challenging: ADR mentions are context-dependent and include long, varied and unconventional descriptions as compared to more formal medical symptom terminology.

Active Learning Knowledge Graph Embeddings

Paper
Add Code

A Consolidated Open Knowledge Representation for Multiple Texts

1 code implementation • WS 2017 • Rachel Wities, Vered Shwartz, Gabriel Stanovsky, Meni Adler, Ori Shapira, Shyam Upadhyay, Dan Roth, Eugenio Martinez Camara, Iryna Gurevych, Ido Dagan

We propose to move from Open Information Extraction (OIE) ahead to Open Knowledge Representation (OKR), aiming to represent information conveyed jointly in a set of texts in an open text-based manner.

Lexical Entailment Open Information Extraction

Paper
Code

Modeling Extractive Sentence Intersection via Subtree Entailment

no code implementations • COLING 2016 • Omer Levy, Ido Dagan, Gabriel Stanovsky, Judith Eckle-Kohler, Iryna Gurevych

Sentence intersection captures the semantic overlap of two texts, generalizing over paradigms such as textual entailment and semantic text similarity.

Abstractive Text Summarization Natural Language Inference +2