Search Results for author: Luoxin Chen

Found 4 papers, 0 papers with code

Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems

no code implementations • 15 Jun 2022 • Jack FitzGerald, Shankar Ananthakrishnan, Konstantine Arkoudas, Davide Bernardi, Abhishek Bhagia, Claudio Delli Bovi, Jin Cao, Rakesh Chada, Amit Chauhan, Luoxin Chen, Anurag Dwarakanath, Satyam Dwivedi, Turan Gojayev, Karthik Gopalakrishnan, Thomas Gueudre, Dilek Hakkani-Tur, Wael Hamza, Jonathan Hueser, Kevin Martin Jose, Haidar Khan, Beiye Liu, Jianhua Lu, Alessandro Manzotti, Pradeep Natarajan, Karolina Owczarzak, Gokmen Oz, Enrico Palumbo, Charith Peris, Chandana Satya Prakash, Stephen Rawls, Andy Rosenbaum, Anjali Shenoy, Saleh Soltan, Mukund Harakere Sridhar, Liz Tan, Fabian Triefenbach, Pan Wei, Haiyang Yu, Shuai Zheng, Gokhan Tur, Prem Natarajan

We present results from a large-scale experiment on pretraining encoders with non-embedding parameter counts ranging from 700M to 9. 3B, their subsequent distillation into smaller models ranging from 17M-170M parameters, and their application to the Natural Language Understanding (NLU) component of a virtual assistant system.

Cross-Lingual Natural Language Inference intent-classification +5

Paper
Add Code

Industry Scale Semi-Supervised Learning for Natural Language Understanding

no code implementations • NAACL 2021 • Luoxin Chen, Francisco Garcia, Varun Kumar, He Xie, Jianhua Lu

This paper presents a production Semi-Supervised Learning (SSL) pipeline based on the student-teacher framework, which leverages millions of unlabeled examples to improve Natural Language Understanding (NLU) tasks.

intent-classification Intent Classification +6

Paper
Add Code

Enhance Robustness of Sequence Labelling with Masked Adversarial Training

no code implementations • Findings of the Association for Computational Linguistics 2020 • Luoxin Chen, Xinyue Liu, Weitong Ruan, Jianhua Lu

Adversarial training (AT) has shown strong regularization effects on deep learning algorithms by introducing small input perturbations to improve model robustness.

Ranked #3 on Chunking on CoNLL 2000 (using extra training data)

Chunking named-entity-recognition +5

Paper
Add Code

SeqVAT: Virtual Adversarial Training for Semi-Supervised Sequence Labeling

no code implementations • ACL 2020 • Luoxin Chen, Weitong Ruan, Xinyue Liu, Jianhua Lu

Virtual adversarial training (VAT) is a powerful technique to improve model robustness in both supervised and semi-supervised settings.

Ranked #7 on Chunking on CoNLL 2000

Chunking General Classification +6

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.