Search Results for author: Alon Lavie

Found 29 papers, 9 papers with code

Searching for COMETINHO: The Little Metric That Could

no code implementations • EAMT 2022 • Ricardo Rei, Ana C Farinha, José G.C. de Souza, Pedro G. Ramos, André F.T. Martins, Luisa Coheur, Alon Lavie

In recent years, several neural fine-tuned machine translation evaluation metrics such as COMET and BLEURT have been proposed.

Computational Efficiency Knowledge Distillation +2

Paper
Add Code

A Case Study on the Importance of Named Entities in a Machine Translation Pipeline for Customer Support Content

no code implementations • EAMT 2022 • Miguel Menezes, Vera Cabarrão, Pedro Mota, None Helena Moniz, Alon Lavie

This paper describes the research developed at Unbabel, a Portuguese Machine-translation start-up, that combines MT with human post-edition and focuses strictly on customer service content.

Machine Translation named-entity-recognition +2

Paper
Add Code

Agent and User-Generated Content and its Impact on Customer Support MT

no code implementations • EAMT 2022 • Madalena Gonçalves, Marianna Buchicchio, Craig Stewart, Helena Moniz, Alon Lavie

This paper illustrates a new evaluation framework developed at Unbabel for measuring the quality of source language text and its effect on both Machine Translation (MT) and Human Post-Edition (PE) performed by non-professional post-editors.

Machine Translation Translation

Paper
Add Code

Unbabel’s Participation in the WMT20 Metrics Shared Task

no code implementations • WMT (EMNLP) 2020 • Ricardo Rei, Craig Stewart, Ana C Farinha, Alon Lavie

We present the contribution of the Unbabel team to the WMT 2020 Shared Task on Metrics.

Paper
Add Code

Results of the WMT21 Metrics Shared Task: Evaluating Metrics with Expert-based Human Evaluations on TED and News Domain

no code implementations • WMT (EMNLP) 2021 • Markus Freitag, Ricardo Rei, Nitika Mathur, Chi-kiu Lo, Craig Stewart, George Foster, Alon Lavie, Ondřej Bojar

Contrary to previous years’ editions, this year we acquired our own human ratings based on expert-based human evaluation via Multidimensional Quality Metrics (MQM).

Translation

Paper
Add Code

Are References Really Needed? Unbabel-IST 2021 Submission for the Metrics Shared Task

1 code implementation • WMT (EMNLP) 2021 • Ricardo Rei, Ana C Farinha, Chrysoula Zerva, Daan van Stigt, Craig Stewart, Pedro Ramos, Taisiya Glushkova, André F. T. Martins, Alon Lavie

In this paper, we present the joint contribution of Unbabel and IST to the WMT 2021 Metrics Shared Task.

401

Paper
Code

QualityAdapt: an Automatic Dialogue Quality Estimation Framework

1 code implementation • SIGDIAL (ACL) 2022 • John Mendonca, Alon Lavie, Isabel Trancoso

Despite considerable advances in open-domain neural dialogue systems, their evaluation remains a bottleneck.

Paper
Code

Dialogue Quality and Emotion Annotations for Customer Support Conversations

1 code implementation • 23 Nov 2023 • John Mendonça, Patrícia Pereira, Miguel Menezes, Vera Cabarrão, Ana C. Farinha, Helena Moniz, João Paulo Carvalho, Alon Lavie, Isabel Trancoso

Task-oriented conversational datasets often lack topic variability and linguistic diversity.

Benchmarking Emotion Recognition +2

Paper
Code

Simple LLM Prompting is State-of-the-Art for Robust and Multilingual Dialogue Evaluation

1 code implementation • 31 Aug 2023 • John Mendonça, Patrícia Pereira, Helena Moniz, João Paulo Carvalho, Alon Lavie, Isabel Trancoso

Despite significant research effort in the development of automatic dialogue evaluation metrics, little thought is given to evaluating dialogues other than in English.

Dialogue Evaluation

Paper
Code

Towards Multilingual Automatic Dialogue Evaluation

1 code implementation • 31 Aug 2023 • John Mendonça, Alon Lavie, Isabel Trancoso

The main limiting factor in the development of robust multilingual dialogue evaluation metrics is the lack of multilingual data and the limited availability of open sourced multilingual dialogue systems.

Dialogue Evaluation Machine Translation +1

Paper
Code

The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics

1 code implementation • 19 May 2023 • Ricardo Rei, Nuno M. Guerreiro, Marcos Treviso, Luisa Coheur, Alon Lavie, André F. T. Martins

Neural metrics for machine translation evaluation, such as COMET, exhibit significant improvements in their correlation with human judgments, as compared to traditional metrics based on lexical overlap, such as BLEU.

Decision Making Machine Translation +2

401

Paper
Code

Appropriateness is all you need!

no code implementations • 27 Apr 2023 • Hendrik Kempt, Alon Lavie, Saskia K. Nagel

In answering this limitation, in this paper we argue for limiting chatbots in the range of topics they can chat about according to the normative concept of appropriateness.

Chatbot

Paper
Add Code

CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task

1 code implementation • 13 Sep 2022 • Ricardo Rei, Marcos Treviso, Nuno M. Guerreiro, Chrysoula Zerva, Ana C. Farinha, Christine Maroti, José G. C. de Souza, Taisiya Glushkova, Duarte M. Alves, Alon Lavie, Luisa Coheur, André F. T. Martins

We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE).

Sentence

401

Paper
Code

MT-Telescope: An interactive platform for contrastive evaluation of MT systems

no code implementations • ACL 2021 • Ricardo Rei, Ana C Farinha, Craig Stewart, Luisa Coheur, Alon Lavie

We present MT-Telescope, a visualization platform designed to facilitate comparative analysis of the output quality of two Machine Translation (MT) systems.

Machine Translation Translation

Paper
Add Code

Unbabel's Participation in the WMT20 Metrics Shared Task

1 code implementation • 29 Oct 2020 • Ricardo Rei, Craig Stewart, Catarina Farinha, Alon Lavie

Overall, our systems achieve strong results for all language pairs on previous test sets and in many cases set a new state-of-the-art.

401

Paper
Code

COMET: A Neural Framework for MT Evaluation

1 code implementation • EMNLP 2020 • Ricardo Rei, Craig Stewart, Ana C Farinha, Alon Lavie

We present COMET, a neural framework for training multilingual machine translation evaluation models which obtains new state-of-the-art levels of correlation with human judgements.

Language Modelling Machine Translation +1

401

Paper
Code

Synthesizing Compound Words for Machine Translation

no code implementations • ACL 2016 • Austin Matthews, Eva Schlinger, Alon Lavie, Chris Dyer

Machine Translation Translation

Paper
Add Code

Humor Recognition and Humor Anchor Extraction

no code implementations • EMNLP 2015 • Diyi Yang, Alon Lavie, Chris Dyer, Eduard Hovy

Paper
Add Code

Domain and Dialect Adaptation for Machine Translation into Egyptian Arabic

no code implementations • WS 2014 • Serena Jeblee, Weston Feely, Houda Bouamor, Alon Lavie, Nizar Habash, Kemal Oflazer

Machine Translation Translation

Paper
Add Code

The CMU Machine Translation Systems at WMT 2014

no code implementations • WS 2014 • Austin Matthews, Waleed Ammar, Archna Bhatia, Weston Feely, Greg Hanneman, Eva Schlinger, Swabha Swayamdipta, Yulia Tsvetkov, Alon Lavie, Chris Dyer

Lemmatization Machine Translation +1

Paper
Add Code

Meteor Universal: Language Specific Translation Evaluation for Any Target Language

no code implementations • WS 2014 • Michael Denkowski, Alon Lavie

Machine Translation Translation

Paper
Add Code

Learning from Post-Editing: Online Model Adaptation for Statistical Machine Translation

no code implementations • EACL 2014 • Michael Denkowski, Chris Dyer, Alon Lavie

Language Modelling Machine Translation +1

Paper
Add Code

Real Time Adaptive Machine Translation for Post-Editing with cdec and TransCenter

no code implementations • WS 2014 • Michael Denkowski, Alon Lavie, Isabel Lacruz, Chris Dyer

Language Modelling Machine Translation +1

Paper
Add Code

Locally Non-Linear Learning for Statistical Machine Translation via Discretization and Structured Regularization

no code implementations • TACL 2014 • Jonathan H. Clark, Chris Dyer, Alon Lavie

Linear models, which support efficient learning and inference, are the workhorses of statistical machine translation; however, linear decision rules are less attractive from a modeling perspective.

Feature Engineering Language Modelling +3