Methods > General > Regularization

Dropout is a regularization technique for neural networks that drops a unit (along with connections) at training time with a specified probability $p$ (a common value is $p=0.5$). At test time, all units are present, but with weights scaled by $p$ (i.e. $w$ becomes $pw$).

The idea is to prevent co-adaptation, where the neural network becomes too reliant on particular connections, as this could be symptomatic of overfitting. Intuitively, dropout can be thought of as creating an implicit ensemble of neural networks.

Source: Dropout: A Simple Way to Prevent Neural Networks from Overfitting

Latest Papers

PAPER DATE
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Yuankai QiZizheng PanYicong HongMing-Hsuan YangAnton Van Den HengelQi Wu
2021-04-09
Knowledge-Aware Graph-Enhanced GPT-2 for Dialogue State Tracking
Weizhe LinBo-Hsian TsengBill Byrne
2021-04-09
DenResCov-19: A deep transfer learning network for robust automatic classification of COVID-19, pneumonia, and tuberculosis from X-rays
Michail MamalakisAndrew J. SwiftBart VorselaarsSurajit RaySimonne WeeksWeiping DingRichard H. ClaytonLouise S. MackenzieAbhirup Banerjee
2021-04-08
Lone Pine at SemEval-2021 Task 5: Fine-Grained Detection of Hate Speech Using BERToxic
Yakoob KhanWeicheng MaSoroush Vosoughi
2021-04-08
A transfer-learning approach for lesion detection in endoscopic images from the urinary tract
Jorge F. LazoSara MocciaAldo MarzulloMichele CatellaniOttavio De CobelliBenoit RosaMichel de MathelinElena De Momi
2021-04-08
Revisiting Simple Neural Probabilistic Language Models
Simeng SunMohit Iyyer
2021-04-08
Uppsala NLP at SemEval-2021 Task 2: Multilingual Language Models for Fine-tuning and Feature Extraction in Word-in-Context Disambiguation
Huiling YouXingran ZhuSara Stymne
2021-04-08
Probing BERT in Hyperbolic Spaces
| Boli ChenYao FuGuangwei XuPengjun XieChuanqi TanMosha ChenLiping Jing
2021-04-08
Facial Attribute Transformers for Precise and Robust Makeup Transfer
Zhaoyi WanHaoran ChenJielei ZhangWentao JiangCong YaoJiebo Luo
2021-04-07
LI-Net: Large-Pose Identity-Preserving Face Reenactment Network
Jin LiuPeng ChenTao LiangZhaoxing LiCai YuShuqiao ZouJiao DaiJizhong Han
2021-04-07
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
Zhicheng HuangZhaoyang ZengYupan HuangBei LiuDongmei FuJianlong Fu
2021-04-07
Analysis Towards Classification of Infection and Ischaemia of Diabetic Foot Ulcers
Moi Hoon YapBill CassidyJoseph M. PappachanClaire O'SheaDavid GillespieNeil Reeves
2021-04-07
Sparse Oblique Decision Trees: A Tool to Understand and Manipulate Neural Net Features
Suryabhan Singh HadaMiguel Á. Carreira-PerpiñánArman Zharmagambetov
2021-04-07
Combining Pre-trained Word Embeddings and Linguistic Features for Sequential Metaphor Identification
Rui MaoChenghua LinFrank Guerin
2021-04-07
Interpreting A Pre-trained Model Is A Key For Model Architecture Optimization: A Case Study On Wav2Vec 2.0
Liu ChenMeysam Asgari
2021-04-07
Better Neural Machine Translation by Extracting Linguistic Information from BERT
| Hassan S. ShavaraniAnoop Sarkar
2021-04-07
Interpreting Verbal Metaphors by Paraphrasing
Rui MaoChenghua LinFrank Guerin
2021-04-07
MuSLCAT: Multi-Scale Multi-Level Convolutional Attention Transformer for Discriminative Music Modeling on Raw Waveforms
Kai MiddlebrookShyam SudhakaranDavid Guy Brizan
2021-04-06
hBert + BiasCorp -- Fighting Racism on the Web
Olawale OnabolaZhuang MaYang XieBenjamin AkeraAbdulrahman IbraheemJia XueDianbo LiuYoshua Bengio
2021-04-06
Attention Head Masking for Inference Time Content Selection in Abstractive Summarization
Shuyang CaoLu Wang
2021-04-06
Fourier Image Transformer
| Tim-Oliver BuchholzFlorian Jug
2021-04-06
Variational Transformer Networks for Layout Generation
Diego Martin ArroyoJanis PostelsFederico Tombari
2021-04-06
Content-Aware GAN Compression
Yuchen LiuZhixin ShuYijun LiZhe LinFederico PerazziS. Y. Kung
2021-04-06
LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring
Anton MitrofanovMariya KorenevskayaIvan PodluzhnyYuri KhokhlovAleksandr LaptevAndrei AndrusenkoAleksei IlinMaxim KorenevskyIvan MedennikovAleksei Romanenko
2021-04-06
ODE Transformer: An Ordinary Differential Equation-Inspired Model for Neural Machine Translation
Bei LiQuan DuTao ZhouShuhan ZhouXin ZengTong XiaoJingbo Zhu
2021-04-06
CodeTrans: Towards Cracking the Language of Silicone's Code Through Self-Supervised Deep Learning and High Performance Computing
| Ahmed ElnaggarWei DingLlion JonesTom GibbsTamas FeherChristoph AngererSilvia SeveriniFlorian MatthesBurkhard Rost
2021-04-06
Efficient transfer learning for NLP with ELECTRA
| François Mercier
2021-04-06
Variable selection with missing data in both covariates and outcomes: Imputation and machine learning
| Liangyuan HuJung-Yi Joyce LinJiayi Ji
2021-04-06
Rethinking Perturbations in Encoder-Decoders for Fast Training
| Sho TakaseShun Kiyono
2021-04-05
Exploring Transformers in Emotion Recognition: a comparison of BERT, DistillBERT, RoBERTa, XLNet and ELECTRA
Diogo Cortiz
2021-04-05
What's the best place for an AI conference, Vancouver or ______: Why completing comparative questions is difficult
Avishai ZagouryEinat MinkovIdan SzpektorWilliam W. Cohen
2021-04-05
AST: Audio Spectrogram Transformer
Yuan GongYu-An ChungJames Glass
2021-04-05
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency
Yangyang ShiVarun NagarajaChunyang WuJay MahadeokarDuc LeRohit PrabhavalkarAlex XiaoChing-Feng YehJulian ChanChristian FuegenOzlem KalinliMichael L. Seltzer
2021-04-05
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
Suyoun KimAbhinav AroraDuc LeChing-Feng YehChristian FuegenOzlem KalinliMichael L. Seltzer
2021-04-05
ReCAM@IITK at SemEval-2021 Task 4: BERT and ALBERT based Ensemble for Abstract Word Prediction
| Abhishek MittalAshutosh Modi
2021-04-04
Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy Reasoning
| Hui LiuDanqing ZhangBing YinXiaodan Zhu
2021-04-04
MCL@IITK at SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation using Augmented Data, Signals, and Transformers
Rohan GuptaJay MundraDeepak MahajanAshutosh Modi
2021-04-04
SimCD: Simultaneous Clustering and Differential expression analysis for single-cell transcriptomic data
| Seyednami NiyakanEhsan HajiramezanaliShahin BolukiSiamak Zamani DadanehXiaoning Qian
2021-04-04
TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling
Tze Yuang ChongXuyang WangLin YangJunjie Wang
2021-04-04
IITK@Detox at SemEval-2021 Task 5: Semi-Supervised Learning and Dice Loss for Toxic Spans Detection
| Archit BansalAbhay KaushikAshutosh Modi
2021-04-04
Learning Mobile CNN Feature Extraction Toward Fast Computation of Visual Object Tracking
Tsubasa MurateTakashi WatanabeMasaki Yamada
2021-04-03
Unsupervised Domain Adaptation with Global and Local Graph Neural Networks in Limited Labeled Data Scenario: Application to Disaster Management
Samujjwal GhoshSubhadeep MajiMaunendra Sankar Desarkar
2021-04-03
Exploring the Role of BERT Token Representations to Explain Sentence Probing Results
Hosein MohebbiAli ModarressiMohammad Taher Pilehvar
2021-04-03
Deepfake Detection Scheme Based on Vision Transformer and Distillation
Young-Jin HeoYoung-Ju ChoiYoung-Woon LeeByung-Gyu Kim
2021-04-03
MR-Contrast-Aware Image-to-Image Translations with Generative Adversarial Networks
Jonas DenckJens GuehringAndreas MaierEva Rothgang
2021-04-03
Efficient DETR: Improving End-to-End Object Detector with Dense Prior
Zhuyu YaoJiangbo AiBoxun LiChi Zhang
2021-04-03
IITK@LCP at SemEval 2021 Task 1: Classification for Lexical Complexity Regression Task
| Neil Rajiv ShirudeSagnik MukherjeeTushar ShandhilyaAnanta MukherjeeAshutosh Modi
2021-04-02
Plot2API: Recommending Graphic API from Plot via Semantic Parsing Guided Neural Network
| Zeyu WangSheng HuangZhongxin LiuMeng YanXin XiaBei WangDan Yang
2021-04-02
Developing a New Autism Diagnosis Process Based on a Hybrid Deep Learning Architecture Through Analyzing Home Videos
Spencer HeRyan Liu
2021-04-02
LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
| Ben GrahamAlaaeldin El-NoubyHugo TouvronPierre StockArmand JoulinHervé JégouMatthijs Douze
2021-04-02
Language-based Video Editing via Multi-Modal Multi-Level Transformer
Tsu-Jui FuXin Eric WangScott T. GraftonMiguel P. EcksteinWilliam Yang Wang
2021-04-02
AAformer: Auto-Aligned Transformer for Person Re-Identification
Kuan ZhuHaiyun GuoShiliang ZhangYaoWei WangGaopan HuangHonglin QiaoJing LiuJinqiao WangMing Tang
2021-04-02
Effect of depth order on iterative nested named entity recognition models
Perceval WajsburtYoann TailléXavier Tannier
2021-04-02
The Coronavirus is a Bioweapon: Analysing Coronavirus Fact-Checked Stories
Lynnette Hui Xian NgKathleen M. Carley
2021-04-02
Bayesian Graph Convolutional Network for Traffic Prediction
Jun FuWei ZhouZhibo Chen
2021-04-01
EfficientNetV2: Smaller Models and Faster Training
| Mingxing TanQuoc V. Le
2021-04-01
Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis
Ajay JainMatthew TancikPieter Abbeel
2021-04-01
WakaVT: A Sequential Variational Transformer for Waka Generation
Yuka TakeishiMingxuan NiuJing LuoZhong JinXinyu Yang
2021-04-01
LoFTR: Detector-Free Local Feature Matching with Transformers
| Jiaming SunZehong ShenYuang WangHujun BaoXiaowei Zhou
2021-04-01
One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking
| Minghao ChenHouwen PengJianlong FuHaibin Ling
2021-04-01
TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking
Peng ChuJiang WangQuanzeng YouHaibin LingZicheng Liu
2021-04-01
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Giorgio BarnabòGiovanni TrappoliniLorenzo LastillaCesare CampagnanoAngela FanFabio PetroniFabrizio Silvestri
2021-04-01
HLE-UPC at SemEval-2021 Task 5: Multi-Depth DistilBERT for Toxic Spans Detection
| Rafel Palliser-SansAlbert Rial-Farràs
2021-04-01
Students are the Best Teacher: Exit-Ensemble Distillation with Multi-Exits
Hojung LeeJong-Seok Lee
2021-04-01
Keyword Transformer: A Self-Attention Model for Keyword Spotting
Axel BergMark O'ConnorMiguel Tairum Cruz
2021-04-01
Next Generation Multitarget Trackers: Random Finite Set Methods vs Transformer-based Deep Learning
| Juliano PintoGeorg HessWilliam LjungberghYuxuan XiaLennart SvenssonHenk Wymeersch
2021-04-01
Facial expression and attributes recognition based on multi-task learning of lightweight neural networks
| Savchenko A.V.
2021-03-31
Classification of Hematoma: Joint Learning of Semantic Segmentation and Classification
Hokuto HiranoTsuyoshi Okita
2021-03-31
Facial expression and attributes recognition based on multi-task learning of lightweight neural networks
| Andrey V. Savchenko
2021-03-31
Adversarial Attacks and Defenses for Speech Recognition Systems
Piotr ŻelaskoSonal JoshiYiwen ShaoJesus VillalbaJan TrmalNajim DehakSanjeev Khudanpur
2021-03-31
NetAdaptV2: Efficient Neural Architecture Search with Fast Super-Network Training and Architecture Optimization
Tien-Ju YangYi-Lun LiaoVivienne Sze
2021-03-31
Convolutional Dynamic Alignment Networks for Interpretable Classifications
Moritz BöhleMario FritzBernt Schiele
2021-03-31
Spatiotemporal Transformer for Video-based Person Re-identification
Tianyu ZhangLonghui WeiLingxi XieZijie ZhuangYongfei ZhangBo LiQi Tian
2021-03-30
Automatic Graph Partitioning for Very Large-scale Deep Learning
Masahiro TanakaKenjiro TauraToshihiro HanawaKentaro Torisawa
2021-03-30
Read and Attend: Temporal Localisation in Sign Language Videos
Gül VarolLiliane MomeniSamuel AlbanieTriantafyllos AfourasAndrew Zisserman
2021-03-30
Differentiable Network Adaption with Elastic Search Space
Shaopeng GuoYujie WangKun YuanQuanquan Li
2021-03-30
Automated Cleanup of the ImageNet Dataset by Model Consensus, Explainability and Confident Learning
| Csaba Kertész
2021-03-30
Rethinking Spatial Dimensions of Vision Transformers
| Byeongho HeoSangdoo YunDongyoon HanSanghyuk ChunJunsuk ChoeSeong Joon Oh
2021-03-30
Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Mingchen ZhugeDehong GaoDeng-Ping FanLinbo JinBen ChenHaoming ZhouMinghui QiuLing Shao
2021-03-30
Nonlinear Weighted Directed Acyclic Graph and A Priori Estimates for Neural Networks
Yuqing LiTao LuoChao Ma
2021-03-30
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with Pre-trained Transformers
| Debanjan ChaudhuriMd Rashad Al Hasan RonyJens Lehmann
2021-03-30
An In-depth Analysis of Passage-Level Label Transfer for Contextual Document Ranking
Koustav RudraZeon Trevor FernandoAvishek Anand
2021-03-30
FocusedDropout for Convolutional Neural Network
Tianshu XieMinghui LiuJiali DengXuan ChengXiaomin WangMing Liu
2021-03-29
CvT: Introducing Convolutions to Vision Transformers
| Haiping WuBin XiaoNoel CodellaMengchen LiuXiyang DaiLu YuanLei Zhang
2021-03-29
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
Pengchuan ZhangXiyang DaiJianwei YangBin XiaoLu YuanLei ZhangJianfeng Gao
2021-03-29
Transformer Tracking
| Xin ChenBin YanJiawen ZhuDong WangXiaoyun YangHuchuan Lu
2021-03-29
Retraining DistilBERT for a Voice Shopping Assistant by Using Universal Dependencies
Pratik JayaraoArpit Sharma
2021-03-29
Whitening Sentence Representations for Better Semantics and Faster Retrieval
| Jianlin SuJiarun CaoWeijie LiuYangyiwen Ou
2021-03-29
Contextual Text Embeddings for Twi
Paul AzunreSalomey OseiSalomey AddoLawrence Asamoah Adu-GyamfiStephen MooreBernard AdabankahBernard OpokuClara Asare-NyarkoSamuel NyarkoCynthia AmoabaEsther Dansoa AppiahFelix AkwerhRichard Nii Lante LawsonJoel BuduEmmanuel DebrahNana BoatengWisdom OforiEdwin Buabeng-MunkohFranklin AdjeiIsaac Kojo Essel AmpomahJoseph OtooReindorf BorkorStandylove Birago MensahLucien MensahMark Amoako MarcelAnokye Acheampong AmponsahJames Ben Hayfron-Acquah
2021-03-29
Industry Scale Semi-Supervised Learning for Natural Language Understanding
Luoxin ChenFrancisco GarciaVarun KumarHe XieJianhua Lu
2021-03-29
Noise Injection-based Regularization for Point Cloud Processing
Xiao ZangYi XieSiyu LiaoJie ChenBo Yuan
2021-03-28
PENELOPIE: Enabling Open Information Extraction for the Greek Language through Machine Translation
| Dimitris PapadopoulosNikolaos PapadakisNikolaos Matsatsinis
2021-03-28
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
Ye JiaHeiga ZenJonathan ShenYu ZhangYonghui Wu
2021-03-28
HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval
Song LiuHaoqi FanShengsheng QianYiru ChenWenkui DingZhongyuan Wang
2021-03-28
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
| Chun-Fu ChenQuanfu FanRameswar Panda
2021-03-27
Face Transformer for Recognition
Yaoyao ZhongWeihong Deng
2021-03-27
Unsupervised Self-Training for Sentiment Analysis of Code-Switched Data
Akshat GuptaSargam MenghaniSai Krishna RallabandiAlan W Black
2021-03-27
Leveraging neural representations for facilitating access to untranscribed speech from endangered languages
| Nay SanMartijn BarteldsMitchell BrowneLily CliffordFiona GibsonJohn MansfieldDavid NashJane SimpsonMyfany TurpinMaria VollmerSasha WilmothDan Jurafsky
2021-03-26
A Practical Survey on Faster and Lighter Transformers
Quentin FournierGaétan Marceau CaronDaniel Aloise
2021-03-26
Training a Better Loss Function for Image Restoration
Aamir MustafaAliaksei MikhailiukDan Andrei IliescuVarun BabbarRafal K. Mantiuk
2021-03-26
Understanding Robustness of Transformers for Image Classification
Srinadh BhojanapalliAyan ChakrabartiDaniel GlasnerDaliang LiThomas UnterthinerAndreas Veit
2021-03-26
Improve GAN-based Neural Vocoder using Pointwise Relativistic LeastSquare GAN
Congyi WangYu ChenBin WangYi Shi
2021-03-26
Gated Transformer Networks for Multivariate Time Series Classification
| Minghao LiuShengqi RenSiyuan MaJiahui JiaoYizhou ChenZhiguang WangWei Song
2021-03-26
Lifting Transformer for 3D Human Pose Estimation in Video
Wenhao LiHong LiuRunwei DingMengyuan LiuPichao Wang
2021-03-26
On Generating Transferable Targeted Perturbations
| Muzammal NaseerSalman KhanMunawar HayatFahad Shahbaz KhanFatih Porikli
2021-03-26
Predicting Directionality in Causal Relations in Text
| Pedram HosseiniDavid A. BroniatowskiMona Diab
2021-03-25
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
| Ze LiuYutong LinYue CaoHan HuYixuan WeiZheng ZhangStephen LinBaining Guo
2021-03-25
AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting
| Ye YuanXinshuo WengYanglan OuKris Kitani
2021-03-25
Visual Grounding Strategies for Text-Only Natural Language Processing
Damien Sileo
2021-03-25
Bertinho: Galician BERT Representations
David VilaresMarcos GarciaCarlos Gómez-Rodríguez
2021-03-25
Mask Attention Networks: Rethinking and Strengthen Transformer
Zhihao FanYeyun GongDayiheng LiuZhongyu WeiSiyuan WangJian JiaoNan DuanRuofei ZhangXuanjing Huang
2021-03-25
BERT4SO: Neural Sentence Ordering by Fine-tuning BERT
Yutao ZhuJian-Yun NieKun ZhouShengchao LiuPan Du
2021-03-25
Thinking Aloud: Dynamic Context Generation Improves Zero-Shot Reasoning Performance of GPT-2
Gregor BetzKyle RichardsonChristian Voigt
2021-03-24
FastMoE: A Fast Mixture-of-Expert Training System
| Jiaao HeJiezhong QiuAohan ZengZhilin YangJidong ZhaiJie Tang
2021-03-24
Czert -- Czech BERT-like Model for Language Representation
| Jakub SidoOndřej PražákPavel PřibáňJan PašekMichal SejákMiloslav Konopík
2021-03-24
Multi-view 3D Reconstruction with Transformer
Dan WangXinrui CuiXun ChenZhengxia ZouTianyang ShiSeptimiu SalcudeanZ. Jane WangRabab Ward
2021-03-24
Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
| Amaia SalvadorErhan GundogduLoris BazzaniMichael Donoser
2021-03-24
Efficient Multi-Objective Optimization for Deep Learning
| Michael RuchteJosif Grabocka
2021-03-24
Spatio-Temporal Sparsification for General Robust Graph Convolution Networks
Mingming LuYa zhang
2021-03-23
Are all outliers alike? On Understanding the Diversity of Outliers for Detecting OODs
Ramneet KaurSusmit JhaAnirban RoyOleg SokolskyInsup Lee
2021-03-23
Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection
Jan Philip WahleTerry RuasNorman MeuschkeBela Gipp
2021-03-23
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search
| Changlin LiTao TangGuangrun WangJiefeng PengBing WangXiaodan LiangXiaojun Chang
2021-03-23
Detecting Hate Speech with GPT-3
| Ke-Li ChiuRohan Alexander
2021-03-23
TMR: Evaluating NER Recall on Tough Mentions
Jingxuan TuConstantine Lignos
2021-03-23
Repairing Pronouns in Translation with BERT-Based Post-Editing
Reid Pryzant
2021-03-23
Variable Name Recovery in Decompiled Binary Code using Constrained Masked Language Modeling
Pratyay BanerjeeKuntal Kumar PalFish WangChitta Baral
2021-03-23
On the Robustness of Monte Carlo Dropout Trained with Noisy Labels
Purvi GoelLi Chen
2021-03-22
Comprehensive process-molten pool relations modeling using CNN for wire-feed laser additive manufacturing
Noopur JamnikarSen LiuCraig BriceXiaoli Zhang
2021-03-22
Open Domain Question Answering over Tables via Dense Retrieval
| Jonathan HerzigThomas MüllerSyrine KricheneJulian Martin Eisenschlos
2021-03-22
BERT: A Review of Applications in Natural Language Processing and Understanding
M. V. Koroteev
2021-03-22
Hybrid Model for Patent Classification using Augmented SBERT and KNN
| Hamid BekamiriDaniel S. HainRoman Jurowetzki
2021-03-22
Identifying Machine-Paraphrased Plagiarism
| Jan Philip WahleTerry RuasTomáš FoltýnekNorman MeuschkeBela Gipp
2021-03-22
Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
Mikael BrunilaRosie ZhaoAndrei MirceaSam LumleyRenee Sieber
2021-03-22
Incorporating Convolution Designs into Visual Transformers
| Kun YuanShaopeng GuoZiwei LiuAojun ZhouFengwei YuWei Wu
2021-03-22
Tiny Transformers for Environmental Sound Classification at the Edge
David ElliottCarlos E. OteroSteven WyattEvan Martino
2021-03-22
End-to-End Trainable Multi-Instance Pose Estimation with Transformers
Lucas StofflMaxime VidalAlexander Mathis
2021-03-22
ProgressiveSpinalNet architecture for FC layers
| Praveen Chopra
2021-03-21
Paying Attention to Activation Maps in Camera Pose Regression
Yoli ShavitRon FerensYosi Keller
2021-03-21
L3CubeMahaSent: A Marathi Tweet-based Sentiment Analysis Dataset
Atharva KulkarniMeet MandhaneManali LikhitkarGayatri KshirsagarRaviraj Joshi
2021-03-21
Non-Autoregressive Translation by Learning Target Categorical Codes
Yu BaoShuJian HuangTong XiaoDongqi WangXinyu DaiJiajun Chen
2021-03-21
MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation
Zachary SeymourKowshik ThopalliNiluthpol MithunHan-Pang ChiuSupun SamarasekeraRakesh Kumar
2021-03-21
ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques
| Yuanxin LiuZheng LinFengcheng Yuan
2021-03-21
NameRec*: Highly Accurate and Fine-grained Person Name Recognition
Rui ZhangYimeng DaiShijie Liu
2021-03-21
An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information
Zejun LiZhongyu WeiZhihao FanHaijun ShanXuanjing Huang
2021-03-21
Paying Attention to Multiscale Feature Maps in Multimodal Image Matching
Aviad MoreshetYosi Keller
2021-03-20
Hopper: Multi-hop Transformer for Spatiotemporal Reasoning
| Honglu ZhouAsim KadavFarley LaiAlexandru Niculescu-MizilMartin Renqiang MinMubbasir KapadiaHans Peter Graf
2021-03-19
Play the Shannon Game With Language Models: A Human-Free Approach to Summary Evaluation
Nicholas EganOleg VasilyevJohn Bohannon
2021-03-19
Transferable Model for Shape Optimization subject to Physical Constraints
Lukas HarschJohannes BurgbacherStefan Riedelbauch
2021-03-19
MuRIL: Multilingual Representations for Indian Languages
Simran KhanujaDiksha BansalSarvesh MehtaniSavya KhoslaAtreyee DeyBalaji GopalanDilip Kumar MargamPooja AggarwalRajiv Teja NagipoguShachi DaveShruti GuptaSubhash Chandra Bose GaliVish SubramanianPartha Talukdar
2021-03-19
ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
| Stéphane d'AscoliHugo TouvronMatthew LeavittAri MorcosGiulio BiroliLevent Sagun
2021-03-19
Cost-effective Deployment of BERT Models in Serverless Environment
Katarína BenešováAndrej ŠvecMarek Šuppa
2021-03-19
API2Com: On the Improvement of Automatically Generated Code Comments Using API Documentations
Ramin ShahbaziRishab SharmaFatemeh H. Fard
2021-03-19
Cascade Weight Shedding in Deep Neural Networks: Benefits and Pitfalls for Network Pruning
Kambiz AzarianFatih Porikli
2021-03-19
HW-NAS-Bench:Hardware-Aware Neural Architecture Search Benchmark
| Chaojian LiZhongzhi YuYonggan FuYongan ZhangYang ZhaoHaoran YouQixuan YuYue WangYingyan Lin
2021-03-19
Let Your Heart Speak in its Mother Tongue: Multilingual Captioning of Cardiac Signals
| Dani KiyassehTingting ZhuDavid Clifton
2021-03-19
GPT Understands, Too
| Xiao LiuYanan ZhengZhengxiao DuMing DingYujie QianZhilin YangJie Tang
2021-03-18
All NLP Tasks Are Generation Tasks: A General Pretraining Framework
| Zhengxiao DuYujie QianXiao LiuMing DingJiezhong QiuZhilin YangJie Tang
2021-03-18
Contextual Biasing of Language Models for Speech Recognition in Goal-Oriented Conversational Agents
Ashish ShenoySravan BodapatiKatrin Kirchhoff
2021-03-18
Enhancing Transformer for Video Understanding Using Gated Multi-Level Attention and Temporal Adversarial Training
Saurabh SahuPalash Goyal
2021-03-18
Model Extraction and Adversarial Transferability, Your BERT is Vulnerable!
Xuanli HeLingjuan LyuQiongkai XuLichao Sun
2021-03-18
Danish Fungi 2020 -- Not Just Another Image Recognition Dataset
| Lukáš PicekMilan ŠulcJiří MatasJacob Heilmann-ClausenThomas S. JeppesenThomas LæssøeTobias Frøslev
2021-03-18
On the Role of Images for Analyzing Claims in Social Media
| Gullal S. CheemaSherzod HakimovEric Müller-BudackRalph Ewerth
2021-03-17
Trans-SVNet: Accurate Phase Recognition from Surgical Videos via Hybrid Embedding Aggregation Transformer
Xiaojie GaoYueming JinYonghao LongQi DouPheng-Ann Heng
2021-03-17
UniParma at SemEval-2021 Task 5: Toxic Spans Detection Using CharacterBERT and Bag-of-Words Model
Akbar KarimiLeonardo RossiAndrea Prati
2021-03-17
Code Word Detection in Fraud Investigations using a Deep-Learning Approach
Youri van der ZeeJan C. ScholtesMarcel WesterhoudJulien Rossi
2021-03-17
You Only Look One-level Feature
| Qiang ChenYingming WangTong YangXiangyu ZhangJian ChengJian Sun
2021-03-17
Predicting Early Dropout: Calibration and Algorithmic Fairness Considerations
Marzieh Karimi-HaghighiCarlos CastilloDavinia Hernandez-LeoVeronica Moreno Oliver
2021-03-16
The Influence of Dropout on Membership Inference in Differentially Private Models
Erick Galinkin
2021-03-16
Dense Interaction Learning for Video-based Person Re-identification
Tianyu HeXin JinXu ShenJianqiang HuangZhibo ChenXian-Sheng Hua
2021-03-16
KGSynNet: A Novel Entity Synonyms Discovery Framework with Knowledge Graph
Yiying YangXi YinHaiqin YangXingjian FeiHao PengKaijie ZhouKunfeng LaiJianping Shen
2021-03-16
Robustly Optimized and Distilled Training for Natural Language Understanding
Haytham ElFadeelStan Peshterliev
2021-03-16
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval
| Siqi SunYen-Chun ChenLinjie LiShuohang WangYuwei FangJingjing Liu
2021-03-16
Knowledge driven Description Synthesis for Floor Plan Interpretation
Shreya GoyalChiranjoy ChattopadhyayGaurav Bhatnagar
2021-03-15
SemVLP: Vision-Language Pre-training by Aligning Semantics at Multiple Levels
Chenliang LiMing YanHaiyang XuFuli LuoWei WangBin BiSongfang Huang
2021-03-14
Improving Code Summarization with Block-wise Abstract Syntax Tree Splitting
| Chen LinZhichao OuyangJunqing ZhuangJianqiang ChenHui LiRongxin Wu
2021-03-14
Embedding Calibration for Music Semantic Similarity using Auto-regressive Transformer
Xinran ZhangMaosong SunJiafeng LiuXiaobing Li
2021-03-13
Revisiting ResNets: Improved Training and Scaling Strategies
| Irwan BelloWilliam FedusXianzhi DuEkin D. CubukAravind SrinivasTsung-Yi LinJonathon ShlensBarret Zoph
2021-03-13
Text Mining of Stocktwits Data for Predicting Stock Prices
Mukul JaggiPriyanka MandalShreya NarangUsman NaseemMatloob Khushi
2021-03-13
Bilingual Dictionary-based Language Model Pretraining for Neural Machine Translation
Yusen LinJiayong LinShuaicheng ZhangHaoying Dai
2021-03-12
Is BERT a Cross-Disciplinary Knowledge Learner? A Surprising Finding of Pre-trained Models' Transferability
Wei-Tsung KaoHung-Yi Lee
2021-03-12
Explaining and Improving BERT Performance on Lexical Semantic Change Detection
Severin LaicherSinan KurtyigitDominik SchlechtwegJonas KuhnSabine Schulte im Walde
2021-03-12
Vision Transformer for COVID-19 CXR Diagnosis using Chest X-ray Feature Corpus
Sangjoon ParkGwanghyun KimYujin OhJoon Beom SeoSang Min LeeJin Hwan KimSungjun MoonJae-Kwang LimJong Chul Ye
2021-03-12
Severity Quantification and Lesion Localization of COVID-19 on CXR using Vision Transformer
Gwanghyun KimSangjoon ParkYujin OhJoon Beom SeoSang Min LeeJin Hwan KimSungjun MoonJae-Kwang LimJong Chul Ye
2021-03-12
Sequential Random Network for Fine-grained Image Classification
Chaorong LiMalu ZhangWei HuangFengqing QinAnping ZengYuanyuan Huang
2021-03-12
Predicting the Behavior of Dealers in Over-The-Counter Corporate Bond Markets
Yusen LinJinming XueLouiqa Raschid
2021-03-12
Comparing the Performance of NLP Toolkits and Evaluation measures in Legal Tech
Muhammad Zohaib Khan
2021-03-12
Intraclass clustering: an implicit learning ability that regularizes DNNs
Carbonnelle SimonChristophe De Vleeschouwer
2021-03-11
Evaluation of Morphological Embeddings for the Russian Language
Vitaly RomanovAlbina Khusainova
2021-03-11
Improving Bi-encoder Document Ranking Models with Two Rankers and Multi-teacher Distillation
Jaekeol ChoiEuna JungJangwon SuhWonjong Rhee
2021-03-11
Composite Re-Ranking for Efficient Document Search with BERT
Yingrui YangYifan QiaoJinjin ShaoMayuresh AnandXifeng YanTao Yang
2021-03-11
Towards Multi-Sense Cross-Lingual Alignment of Contextual Embeddings
Linlin LiuThien Hai NguyenShafiq JotyLidong BingLuo Si
2021-03-11
LightMBERT: A Simple Yet Effective Method for Multilingual BERT Distillation
Xiaoqi JiaoYichun YinLifeng ShangXin JiangXiao ChenLinlin LiFang WangQun Liu
2021-03-11
FairFil: Contrastive Neural Debiasing Method for Pretrained Text Encoders
Pengyu ChengWeituo HaoSiyang YuanShijing SiLawrence Carin
2021-03-11
Self-supervised Text-to-SQL Learning with Header Alignment Training
Donggyu KimSeanie Lee
2021-03-11
Unknown Object Segmentation from Stereo Images
Maximilian DurnerWout BoerdijkMartin SundermeyerWerner FriedlZoltan-Csaba MartonRudolph Triebel
2021-03-11
On Improving Deep Learning Trace Analysis with System Call Arguments
Quentin FournierDaniel AloiseSeyed Vahid AzhariFrançois Tetreault
2021-03-11
Continuous 3D Multi-Channel Sign Language Production via Progressive Transformers and Mixture Density Networks
Ben SaundersNecati Cihan CamgozRichard Bowden
2021-03-11
CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
| Dan HendrycksCollin BurnsAnya ChenSpencer Ball
2021-03-10
Majority Voting with Bidirectional Pre-translation For Bitext Retrieval
| Alex JonesDerry Tanti Wijaya
2021-03-10
Hurdles to Progress in Long-form Question Answering
Kalpesh KrishnaAurko RoyMohit Iyyer
2021-03-10
CEQE: Contextualized Embeddings for Query Expansion
Shahrzad NaseriJeffrey DaltonAndrew YatesJames Allan
2021-03-09
Enhancing sensor resolution improves CNN accuracy given the same number of parameters or FLOPS
| Ali Borji
2021-03-09
Pretrained Transformers as Universal Computation Engines
| Kevin LuAditya GroverPieter AbbeelIgor Mordatch
2021-03-09
Active Testing: Sample-Efficient Model Evaluation
| Jannik KossenSebastian FarquharYarin GalTom Rainforth
2021-03-09
Language Models have a Moral Dimension
Patrick SchramowskiCigdem TuranNico AndersenConstantin RothkopfKristian Kersting
2021-03-08
Syntax-BERT: Improving Pre-trained Transformers with Syntax Trees
| Jiangang BaiYujing WangYiren ChenYaming YangJing BaiJing YuYunhai Tong
2021-03-07
Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain
Jinyu TianJiantao ZhouYuanman LiJia Duan
2021-03-07
TransBTS: Multimodal Brain Tumor Segmentation Using Transformer
| Wenxuan WangChen ChenMeng DingJiangyun LiHong YuSen Zha
2021-03-07
Orthogonal Attention: A Cloze-Style Approach to Negation Scope Resolution
Aditya KhandelwalVahida Attar
2021-03-07
MTLHealth: A Deep Learning System for Detecting Disturbing Content in Student Essays
Joseph ValenciaErin Yao
2021-03-07
Contextual Dropout: An Efficient Sample-Dependent Dropout Module
Xinjie FanShujian ZhangKorawat TanwisuthXiaoning QianMingyuan Zhou
2021-03-06
WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition
Zheng ZhuGuan HuangJiankang DengYun YeJunJie HuangXinze ChenJiagang ZhuTian YangJiwen LuDalong DuJie zhou
2021-03-06
MalBERT: Using Transformers for Cybersecurity and Malicious Software Detection
Abir RahaliMoulay A. Akhloufi
2021-03-05
Fine-tuning Pretrained Multilingual BERT Model for Indonesian Aspect-based Sentiment Analysis
Annisa Nurul AzharMasayu Leylia Khodra
2021-03-05
Non-invasive Self-attention for Side Information Fusion in Sequential Recommendation
Chang LiuXiaoguang LiGuohao CaiZhenhua DongHong ZhuLifeng Shang
2021-03-05
Measuring Mathematical Problem Solving With the MATH Dataset
| Dan HendrycksCollin BurnsSaurav KadavathAkul AroraSteven BasartEric TangDawn SongJacob Steinhardt
2021-03-05
SpecTr: Spectral Transformer for Hyperspectral Pathology Image Segmentation
| Boxiang YunYan WangJieneng ChenHuiyu WangWei ShenQingli Li
2021-03-05
Hierarchical Transformer for Multilingual Machine Translation
Albina KhusainovaAdil KhanAdín Ramírez RiveraVitaly Romanov
2021-03-05
IOT: Instance-wise Layer Reordering for Transformer Structures
| Jinhua ZhuLijun WuYingce XiaShufang XieTao QinWengang ZhouHouqiang LiTie-Yan Liu
2021-03-05
CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
| Yutong XieJianpeng ZhangChunhua ShenYong Xia
2021-03-04
The Transformer Network for the Traveling Salesman Problem
| Xavier BressonThomas Laurent
2021-03-04
Coordinate Attention for Efficient Mobile Network Design
| Qibin HouDaquan ZhouJiashi Feng
2021-03-04
Hardware Acceleration of Fully Quantized BERT for Efficient Natural Language Processing
Zejian LiuGang LiJian Cheng
2021-03-04
End-to-end acoustic modelling for phone recognition of young readers
Lucile GelinMorgane DanielJulien PinquierThomas Pellegrini
2021-03-04
Sensing population distribution from satellite imagery via deep learning: model selection, neighboring effect, and systematic biases
Xiao HuangDi ZhuFan ZhangTao LiuXiao LiLei Zou
2021-03-03
University of Copenhagen Participation in TREC Health Misinformation Track 2020
Lucas Chaves LimaDustin Brandon WrightIsabelle AugensteinMaria Maistro
2021-03-03
Few-shot Learning for Slot Tagging with Attentive Relational Network
Cennet OguzNgoc Thang Vu
2021-03-03
Listen, Read, and Identify: Multimodal Singing Language Identification of Music
Keunwoo ChoiYuxuan Wang
2021-03-02
Dual Reinforcement-Based Specification Generation for Image De-Rendering
Ramakanth PasunuruDavid RosenbergGideon MannMohit Bansal
2021-03-02
Hate Towards the Political Opponent: A Twitter Corpus Study of the 2020 US Elections on the Basis of Offensive Speech and Stance Detection
Lara GrimmingerRoman Klinger
2021-03-02
Probing Product Description Generation via Posterior Distillation
Haolan ZhanHainan ZhangHongshen ChenLei ShenZhuoye DingYongjun BaoWeipeng YanYanyan Lan
2021-03-02
A HINT from Arithmetic: On Systematic Generalization of Perception, Syntax, and Semantics
Qing LiSiyuan HuangYining HongYixin ZhuYing Nian WuSong-Chun Zhu
2021-03-02
A Body Part Embedding Model With Datasets for Measuring 2D Human Motion Similarity
| Jonghyuk ParkSukhyun ChoDongwoo KimOleksandr BailoHeewoong ParkSanghoon HongJonghun Park
2021-03-02
LocalDrop: A Hybrid Regularization for Deep Neural Networks
Ziqing LuChang XuBo DuTakashi IshidaLefei ZhangMasashi Sugiyama
2021-03-01
BERT-based knowledge extraction method of unstructured domain text
Wang ZijiaLi YeZhu Zhongkai
2021-03-01
Combat COVID-19 Infodemic Using Explainable Natural Language Processing Models
Jackie AyoubX. Jessie YangFeng Zhou
2021-03-01
Long Document Summarization in a Low Resource Setting using Pretrained Language Models
Ahsaas BajajPavitra DangatiKalpesh KrishnaPradhiksha Ashok KumarRheeya UppaalBradford WindsorEliot BrennerDominic DotterrerRajarshi DasAndrew McCallum
2021-03-01
BERT based patent novelty search by training claims to their own description
Michael FreunekAndré Bodmer
2021-03-01
CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation
Aly MagassoubaKomei SugiuraHisashi Kawai
2021-03-01
Brain Programming is Immune to Adversarial Attacks: Towards Accurate and Robust Image Classification using Symbolic Learning
Gerardo Ibarra-VazquezGustavo OlagueMariana Chan-LeyCesar PuenteCarlos Soubervielle-Montalvo
2021-03-01
NLP-CUET@LT-EDI-EACL2021: Multilingual Code-Mixed Hope Speech Detection using Cross-lingual Representation Learner
| Eftekhar HossainOmar SharifMohammed Moshiul Hoque
2021-02-28
NLP-CUET@DravidianLangTech-EACL2021: Investigating Visual and Textual Features to Identify Trolls from Multimodal Social Media Memes
Eftekhar HossainOmar SharifMohammed Moshiul Hoque
2021-02-28
NLP-CUET@DravidianLangTech-EACL2021: Offensive Language Detection from Multilingual Code-Mixed Text using Transformers
| Omar SharifEftekhar HossainMohammed Moshiul Hoque
2021-02-28
Transformer in Transformer
| Kai HanAn XiaoEnhua WuJianyuan GuoChunjing XuYunhe Wang
2021-02-27
Transformers with Competitive Ensembles of Independent Mechanisms
Alex LambDi HeAnirudh GoyalGuolin KeChien-Feng LiaoMirco RavanelliYoshua Bengio
2021-02-27
A Novel Adaptive Deep Network for Building Footprint Segmentation
A. ZiaeeR. DehbozorgiM. Döller
2021-02-27
Generative chemical transformer: attention makes neural machine learn molecular geometric structures via text
Hyunseung KimJonggeol NaWon Bo Lee
2021-02-27
COVID-19 Tweets Analysis through Transformer Language Models
| Abdul Hameed AzeemiAdeel Waheed
2021-02-27
FjORD: Fair and Accurate Federated Learning under heterogeneous targets with Ordered Dropout
Samuel HorvathStefanos LaskaridisMario AlmeidaIlias LeontiadisStylianos I. VenierisNicholas D. Lane
2021-02-26
Multi-task transfer learning for finding actionable information from crisis-related messages on social media
Congcong WangDavid Lillis
2021-02-26
MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition
Linghui MengJin XuXu TanJindong WangTao QinBo Xu
2021-02-25
LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching
| Boer LyuLu ChenSu ZhuKai Yu
2021-02-25
Sentiment Analysis of Persian-English Code-mixed Texts
| Nazanin SabriAli EdalatBehnam Bahrak
2021-02-25
LazyFormer: Self Attention with Lazy Update
Chengxuan YingGuolin KeDi HeTie-Yan Liu
2021-02-25
Emotion-Aware, Emotion-Agnostic, or Automatic: Corpus Creation Strategies to Obtain Cognitive Event Appraisal Annotations
Jan HofmannEnrica TroianoRoman Klinger
2021-02-25
Robust Pollen Imagery Classification with Generative Modeling and Mixup Training
Jaideep Murkute
2021-02-25
A Framework For Pruning Deep Neural Networks Using Energy-Based Models
Hojjat SalehinejadShahrokh Valaee
2021-02-25
PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts using Transfer Learning
Nasi JofcheKostadin MishevRiste StojanovMilos JovanovikDimitar Trajanov
2021-02-25
BERT-based Acronym Disambiguation with Multiple Training Strategies
Chunguang PanBingyan SongShengguang WangZhipeng Luo
2021-02-25
Task-Specific Pre-Training and Cross Lingual Transfer for Code-Switched Data
Akshat GuptaSai Krishna RallabandiAlan Black
2021-02-24
LRG at SemEval-2021 Task 4: Improving Reading Comprehension with Abstract Words using Augmentation, Linguistic Features and Voting
Abheesht SharmaHarshit PandeyGunjan ChhablaniYash BhartiaTirtharaj Dash
2021-02-24
NLRG at SemEval-2021 Task 5: Toxic Spans Detection Leveraging BERT-based Token Classification and Span Prediction Techniques
| Gunjan ChhablaniYash BhartiaAbheesht SharmaHarshit PandeyShan Suthaharan
2021-02-24
From Universal Language Model to Downstream Task: Improving RoBERTa-Based Vietnamese Hate Speech Detection
Quang Huu PhamViet Anh NguyenLinh Bao DoanNgoc N. TranTa Minh Thanh
2021-02-24
When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute
| Tao Lei
2021-02-24
PADA: A Prompt-based Autoregressive Approach for Adaptation to Unseen Domains
| Eyal Ben-DavidNadav OvedRoi Reichart
2021-02-24
Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers
Ishan Sanjeev UpadhyayNikhil EAnshul WadhawanRadhika Mamidi
2021-02-24
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
| Wenhai WangEnze XieXiang LiDeng-Ping FanKaitao SongDing LiangTong LuPing LuoLing Shao
2021-02-24
Accurate Learning of Graph Representations with Graph Multiset Pooling
| Jinheon BaekMinki KangSung Ju Hwang
2021-02-23
Robust and Transferable Anomaly Detection in Log Data using Pre-Trained Language Models
Harold OttJasmin BogatinovskiAlexander AckerSasho NedelkoskiOdej Kao
2021-02-23
Automatic Ship Classification Utilizing Bag of Deep Features
Sadegh Soleimani PourAta JodeiriHossein RashidiSeyed Mostafa MirhassaniHoda KheradfallahHadi Seyedarabi
2021-02-23
Minimally-Supervised Structure-Rich Text Categorization via Learning on Text-Rich Networks
Xinyang ZhangChenwei ZhangLuna Xin DongJingbo ShangJiawei Han
2021-02-23
VisualCheXbert: Addressing the Discrepancy Between Radiology Report Labels and Image Labels
| Saahil JainAkshay SmitSteven QH TruongChanh DT NguyenMinh-Thanh HuynhMudit JainVictoria A. YoungAndrew Y. NgMatthew P. LungrenPranav Rajpurkar
2021-02-23
Deep Deformation Detail Synthesis for Thin Shell Models
Lan ChenLin GaoJie YangShibiao XuJuntao YeXiaopeng ZhangYu-Kun Lai
2021-02-23
Do Transformer Modifications Transfer Across Implementations and Applications?
| Sharan NarangHyung Won ChungYi TayWilliam FedusThibault FevryMichael MatenaKarishma MalkanNoah FiedelNoam ShazeerZhenzhong LanYanqi ZhouWei LiNan DingJake MarcusAdam RobertsColin Raffel
2021-02-23
RCoNet: Deformable Mutual Information Maximization and High-order Uncertainty-aware Learning for Robust COVID-19 Detection
Shunjie DongQianqian YangYu FuMei TianCheng Zhuo
2021-02-22
Using Prior Knowledge to Guide BERT's Attention in Semantic Textual Matching Tasks
| Tingyu XiaYue WangYuan TianYi Chang
2021-02-22
Evaluating Contextualized Language Models for Hungarian
| Judit ÁcsDániel LévaiDávid Márk NemeskeyAndrás Kornai
2021-02-22
Deepfake Video Detection Using Convolutional Vision Transformer
| Deressa WodajoSolomon Atnafu
2021-02-22
Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model
Junwei LiaoYu ShiMing GongLinjun ShouSefik EskimezLiyang LuHong QuMichael Zeng
2021-02-22
Position Information in Transformers: An Overview
Philipp DufterMartin SchmittHinrich Schütze
2021-02-22
Determination of Fault Location in Transmission Lines with Image Processing and Artificial Neural Networks
Serkan BudakBahadir Akbal
2021-02-22
Few Shot Learning for Information Verification
Usama KhalidMirza Omer Beg
2021-02-22
Conditional Positional Encodings for Vision Transformers
| Xiangxiang ChuZhi TianBo ZhangXinlong WangXiaolin WeiHuaxia XiaChunhua Shen
2021-02-22
UniT: Multimodal Multitask Learning with a Unified Transformer
Ronghang HuAmanpreet Singh
2021-02-22
MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture
Wancong ZhangIeshan Vaidya
2021-02-22
Lightweight Combinational Machine Learning Algorithm for Sorting Canine Torso Radiographs
Masuda Akter TonimaFatemeh EsfahaniAustin DehartYoumin Zhang
2021-02-22
RUBERT: A Bilingual Roman Urdu BERT Using Cross Lingual Transfer Learning
Usama KhalidMirza Omer BegMuhammad Umair Arshad
2021-02-22
Parallelizing Legendre Memory Unit Training
| Narsimha ChilkuriChris Eliasmith
2021-02-22
Pre-Training BERT on Arabic Tweets: Practical Considerations
Ahmed AbdelaliSabit HassanHamdy MubarakKareem DarwishYounes Samih
2021-02-21
Web-based Application for Detecting Indonesian Clickbait Headlines using IndoBERT
Muhammad Noor FakhruzzamanSie Wildan Gunawan
2021-02-21
Medical Transformer: Gated Axial-Attention for Medical Image Segmentation
| Jeya Maria Jose ValanarasuPoojan OzaIlker HacihalilogluVishal M. Patel
2021-02-21
Unsupervised Medical Image Alignment with Curriculum Learning
Mihail BurdujaRadu Tudor Ionescu
2021-02-20
Towards Accurate and Compact Architectures via Neural Architecture Transformer
| Yong GuoYin ZhengMingkui TanQi ChenZhipeng LiJian ChenPeilin ZhaoJunzhou Huang
2021-02-20
Multilingual Answer Sentence Reranking via Automatically Translated Data
Thuy VuAlessandro Moschitti
2021-02-20
Learning Dynamic BERT via Trainable Gate Variables and a Bi-modal Regularizer
Seohyeong JeongNojun Kwak
2021-02-19
Towards Emotion Recognition in Hindi-English Code-Mixed Data: A Transformer Based Approach
Anshul WadhawanAkshita Aggarwal
2021-02-19
Using Transformer based Ensemble Learning to classify Scientific Articles
| Sohom GhoshAnkush Chopra
2021-02-19
Calibrate Before Use: Improving Few-Shot Performance of Language Models
| Tony Z. ZhaoEric WallaceShi FengDan KleinSameer Singh
2021-02-19
Dialect Identification in Nuanced Arabic Tweets Using Farasa Segmentation and AraBERT
Anshul Wadhawan
2021-02-19
Latent Variable Nested Set Transformers & AutoBots
Roger GirgisFlorian GolemoFelipe CodevillaJim Aldon D'SouzaSamira Ebrahimi KahouFelix HeideChristopher Pal
2021-02-19
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer
Rafał PowalskiŁukasz BorchmannDawid JurkiewiczTomasz DwojakMichał PietruszkaGabriela Pałka
2021-02-18
UnibucKernel: Geolocating Swiss German Jodels Using Ensemble Learning
Mihaela GamanSebastian CojocariuRadu Tudor Ionescu
2021-02-18
Training Large-Scale News Recommenders with Pretrained Language Models in the Loop
Shitao XiaoZheng LiuYingxia ShaoTao DiXing Xie
2021-02-18
Densely Nested Top-Down Flows for Salient Object Detection
| Chaowei FangHaiBin TianDingwen ZhangQiang ZhangJungong HanJunwei Han
2021-02-18
Quiz-Style Question Generation for News Stories
| Adam D. LelkesVinh Q. TranCong Yu
2021-02-18
On Connectivity of Solutions in Deep Learning: The Role of Over-parameterization and Feature Quality
| Quynh NguyenPierre BrechetMarco Mondelli
2021-02-18
SciDr at SDU-2020: IDEAS -- Identifying and Disambiguating Everyday Acronyms for Scientific Domain
Aadarsh SinghPriyanshu Kumar
2021-02-17
Leveraging Query Resolution and Reading Comprehension for Conversational Passage Retrieval
Svitlana VakulenkoNikos VoskaridesZhucheng TuShayne Longpre
2021-02-17
A Dataset and Benchmark for Malaria Life-Cycle Classification in Thin Blood Smear Images
Qazi Ammar ArshadMohsen AliSaeed-Ul HassanChen ChenAyisha ImranGhulam RasulWaqas Sultani
2021-02-17
Rethinking Co-design of Neural Architectures and Hardware Accelerators
Yanqi ZhouXuanyi DongBerkin AkinMingxing TanDaiyi PengTianjian MengAmir YazdanbakhshDa HuangRavi NarayanaswamiJames Laudon
2021-02-17
THEaiTRE 1.0: Interactive generation of theatre play scripts
Rudolf RosaTomáš MusilOndřej DušekDominik JurkoPatrícia SchmidtováDavid MarečekOndřej BojarTom KocmiDaniel HrbekDavid KošťákMartina KinskáMarie NovákováJosef DoležalKlára VoseckáTomáš StudeníkPetr Žabka
2021-02-17
Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters
Aston ZhangYi TayShuai ZhangAlvin ChanAnh Tuan LuuSiu Cheung HuiJie Fu
2021-02-17
TCN: Table Convolutional Network for Web Table Interpretation
Daheng WangPrashant ShiralkarColin LockardBinxuan HuangXin Luna DongMeng Jiang
2021-02-17
Non-Autoregressive Text Generation with Pre-trained Language Models
Yixuan SuDeng CaiYan WangDavid VandykeSimon BakerPiji LiNigel Collier
2021-02-16
Exploring Transformers in Natural Language Generation: GPT, BERT, and XLNet
M. Onat TopalAnil BasImke van Heerden
2021-02-16
TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Zhuohan LiSiyuan ZhuangShiyuan GuoDanyang ZhuoHao ZhangDawn SongIon Stoica
2021-02-16
Improving Bayesian Inference in Deep Neural Networks with Variational Structured Dropout
Son NguyenDuong NguyenKhai NguyenNhat HoKhoat ThanHung Bui
2021-02-16
Have Attention Heads in BERT Learned Constituency Grammar?
Ziyang Luo
2021-02-16
Revisiting Language Encoding in Learning Multilingual Representations
| Shengjie LuoKaiyuan GaoShuxin ZhengGuolin KeDi HeLiWei WangTie-Yan Liu
2021-02-16
GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training
| Chen ZhuRenkun NiZheng XuKezhi KongW. Ronny HuangTom Goldstein
2021-02-16
Training Larger Networks for Deep Reinforcement Learning
Kei OtaDevesh K. JhaAsako Kanezaki
2021-02-16
An AutoML-based Approach to Multimodal Image Sentiment Analysis
Vasco LopesAntónio GasparLuís A. AlexandreJoão Cordeiro
2021-02-16
The corruptive force of AI-generated advice
Margarita LeibNils C. KöbisRainer Michael RilkeMarloes HagensBernd Irlenbusch
2021-02-15
Prompt Programming for Large Language Models: Beyond the Few-Shot Paradigm
Laria ReynoldsKyle McDonell
2021-02-15
Fast End-to-End Speech Recognition via a Non-Autoregressive Model and Cross-Modal Knowledge Transferring from BERT
Ye BaiJiangyan YiJianHua TaoZhengkun TianZhengqi WenShuai Zhang
2021-02-15
DOBF: A Deobfuscation Pre-Training Objective for Programming Languages
Baptiste RoziereMarie-Anne LachauxMarc SzafraniecGuillaume Lample
2021-02-15
Improved Customer Transaction Classification using Semi-Supervised Knowledge Distillation
Rohan Sukumaran
2021-02-15
Federated Dropout Learning for Hybrid Beamforming With Spatial Path Index Modulation In Multi-User mmWave-MIMO Systems
Ahmet M. ElbirSinem ColeriKumar Vijay Mishra
2021-02-15
Detection and severity classification of COVID-19 in CT images using deep learning
Yazan QiblaweyAnas TahirMuhammad E. H. ChowdhuryAmith KhandakarSerkan KiranyazTawsifur RahmanNabil IbtehazSakib MahmudSomaya Al-MadeedFarayi Musharavati
2021-02-15
Colored Kimia Path24 Dataset: Configurations and Benchmarks with Deep Embeddings
Sobhan ShafieiMorteza BabaieShivam KalraH. R. Tizhoosh
2021-02-15
Translational Equivariance in Kernelizable Attention
| Max HornKumar ShridharElrich GroenewaldPhilipp F. M. Baumann
2021-02-15
Within-Document Event Coreference with BERT-Based Contextualized Representations
Shafiuddin Rehan AhmedJames H. Martin
2021-02-15
indicnlp@kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages
| Kushal KediaAbhilash Nandy
2021-02-14
Naturalizing Neuromorphic Vision Event Streams Using GANs
Dennis RobeyWesley ThioHerbert IuJason Eshraghian
2021-02-14
Doping: A technique for efficient compression of LSTM models using sparse structured additive matrices
Urmish ThakkerPaul N. WhatmoughZhiGang LiuMatthew MattinaJesse Beu
2021-02-14
indicnlp@ kgp at DravidianLangTech-EACL2021: Offensive Language Identification in Dravidian Languages
| Kushal KediaAbhilash Nandy
2021-02-14
Optimizing Inference Performance of Transformers on CPUs
Dave DiceAlex Kogan
2021-02-12
Dancing along Battery: Enabling Transformer with Run-time Reconfigurability on Mobile Devices
Yuhong SongWeiwen JiangBingbing LiPanjie QiQingfeng ZhugeEdwin Hsing-Mean ShaSakyasingha DasguptaYiyu ShiCaiwen Ding
2021-02-12
Dynamic Precision Analog Computing for Neural Networks
| Sahaj GargJoe LouAnirudh JainMitchell Nahmias
2021-02-12
Multiversal views on language models
Laria ReynoldsKyle McDonell
2021-02-12
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention
Peng LiuYuewen CaoSongxiang LiuNa HuGuangzhi LiChao WengDan Su
2021-02-12
Improving Zero-shot Neural Machine Translation on Language-specific Encoders-Decoders
Junwei LiaoYu ShiMing GongLinjun ShouHong QuMichael Zeng
2021-02-12
Transformer Language Models with LSTM-based Cross-utterance Information Representation
| G. SunC. ZhangP. C. Woodland
2021-02-12
Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits
| Leonid BoytsovZico Kolter
2021-02-12
Characterizing English Variation across Social Media Communities with BERT
| Li LucyDavid Bamman
2021-02-12
Multimodal Punctuation Prediction with Contextual Dropout
Andrew SilvaBarry-John TheobaldNicholas Apostoloff
2021-02-12
The Benefit of the Doubt: Uncertainty Aware Sensing for Edge Computing Platforms
Lorena QendroJagmohan ChauhanAlberto Gil C. P. RamosCecilia Mascolo
2021-02-11
Proof Artifact Co-training for Theorem Proving with Language Models
| Jesse Michael HanJason RuteYuhuai WuEdward W. AyersStanislas Polu
2021-02-11
Text Compression-aided Transformer Encoding
Zuchao LiZhuosheng ZhangHai ZhaoRui WangKehai ChenMasao UtiyamaEiichiro Sumita
2021-02-11
Quartile-based Prediction of Event Types and Event Time in Business Processes using Deep Learning
| Ishwar Venugopal
2021-02-11
NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting
| Kai ChenGuang ChenDan XuLijun ZhangYuyao HuangAlois Knoll
2021-02-10
Searching for Fast Model Families on Datacenter Accelerators
Sheng LiMingxing TanRuoming PangAndrew LiLiqun ChengQuoc LeNorman P. Jouppi
2021-02-10
Pruning of Convolutional Neural Networks Using Ising Energy Model
| Hojjat SalehinejadShahrokh Valaee
2021-02-10
Emojis Predict Dropouts of Remote Workers: An Empirical Study of Emoji Usage on GitHub
Xuan LuWei AiZhenpeng ChenYanbin CaoXuanzhe LiuQiaozhu Mei
2021-02-10
Enhancing Real-World Adversarial Patches with 3D Modeling Techniques
| Yael MathovLior RokachYuval Elovici
2021-02-10
Joint Intent Detection and Slot Filling with Wheel-Graph Attention Networks
Pengfei WeiBi ZengWenxiong Liao
2021-02-09
Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers
Shucong ZhangCong-Thanh DoRama DoddipatlaErfan LoweimiPeter BellSteve Renals
2021-02-09
Distribution Adaptive INT8 Quantization for Training CNNs
Kang ZhaoSida HuangPan PanYinghan LiYingya ZhangZhenyu GuYinghui Xu
2021-02-09
Conversational Query Rewriting with Self-supervised Learning
Hang LiuMeng ChenYouzheng WuXiaodong HeBoWen Zhou
2021-02-09
Bayesian Transformer Language Models for Speech Recognition
Boyang XueJianwei YuJunhao XuShansong LiuShoukang HuZi YeMengzhe GengXunying LiuHelen Meng
2021-02-09
NewsBERT: Distilling Pre-trained Language Model for Intelligent News Application
Chuhan WuFangzhao WuYang YuTao QiYongfeng HuangQi Liu
2021-02-09
AuGPT: Dialogue with Pre-trained Language Models and Data Augmentation
| Jonáš KulhánekVojtěch HudečekTomáš NekvindaOndřej Dušek
2021-02-09
Point Cloud Transformers applied to Collider Physics
| Vinicius MikuniFlorencia Canelli
2021-02-09
Transfer Learning Approach for Arabic Offensive Language Detection System -- BERT-Based Model
Fatemah HusainOzlem Uzuner
2021-02-09
Generating Fake Cyber Threat Intelligence Using Transformer-Based Models
Priyanka RanadeAritran PiplaiSudip MittalAnupam JoshiTim Finin
2021-02-08
Exploiting epistemic uncertainty of the deep learning models to generate adversarial samples
Omer Faruk TunaFerhat Ozgur CatakM. Taner Eskil
2021-02-08
APS: A Large-Scale Multi-Modal Indoor Camera Positioning System
Ali GhofraniRahil Mahdian ToroghiSeyed Mojtaba Tabatabaie
2021-02-08
How True is GPT-2? An Empirical Analysis of Intersectional Occupational Biases
| Hannah KirkYennie JunHaider IqbalElias BenussiFilippo VolpinFrederic A. DreyerAleksandar ShtedritskiYuki M. Asano
2021-02-08
Colorization Transformer
| Manoj KumarDirk WeissenbornNal Kalchbrenner
2021-02-08
TransReID: Transformer-based Object Re-Identification
| Shuting HeHao LuoPichao WangFan WangHao LiWei Jiang
2021-02-08
TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation
| Jieneng ChenYongyi LuQihang YuXiangde LuoEhsan AdeliYan WangLe LuAlan L. YuilleYuyin Zhou
2021-02-08
A Hybrid Task-Oriented Dialog System with Domain and Task Adaptive Pretraining
| Boliang ZhangYing LyuNing DingTianhao ShenZhaoyang JiaKun HanKevin Knight
2021-02-08
Wake Word Detection with Streaming Transformers
Yiming WangHang LvDaniel PoveyLei XieSanjeev Khudanpur
2021-02-08
Nyströmformer: A Nyström-Based Algorithm for Approximating Self-Attention
| Yunyang XiongZhanpeng ZengRudrasis ChakrabortyMingxing TanGlenn FungYin LiVikas Singh
2021-02-07
Spoiler Alert: Using Natural Language Processing to Detect Spoilers in Book Reviews
| Allen BaoMarshall HoSaarthak Sangamnerkar
2021-02-07
Neural Data-to-Text Generation with LM-based Text Augmentation
Ernie ChangXiaoyu ShenDawei ZhuVera DembergHui Su
2021-02-06
Jointly Improving Language Understanding and Generation with Quality-Weighted Weak Supervision of Automatic Labeling
Ernie ChangVera DembergAlex Marin
2021-02-06
baller2vec: A Multi-Entity Transformer For Multi-Agent Spatiotemporal Modeling
| Michael A. AlcornAnh Nguyen
2021-02-05
PipeTransformer: Automated Elastic Pipelining for Distributed Training of Transformers
Chaoyang HeShen LiMahdi SoltanolkotabiSalman Avestimehr
2021-02-05
RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER
| Lin SunJiquan WangKai ZhangYindu SuFangsheng Weng
2021-02-05
ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
Wonjae KimBokyung SonIldoo Kim
2021-02-05
Understanding Emails and Drafting Responses -- An Approach Using GPT-3
Jonas ThiergartStefan HuberThomas Übellacker
2021-02-05
Adaptive Semiparametric Language Models
Dani YogatamaCyprien de Masson d'AutumeLingpeng Kong
2021-02-04
Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models
Alex TamkinMiles BrundageJack ClarkDeep Ganguli
2021-02-04
1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed
| Hanlin TangShaoduo GanAmmar Ahmad AwanSamyam RajbhandariConglong LiXiangru LianJi LiuCe ZhangYuxiong He
2021-02-04
Hierarchical Multi-head Attentive Network for Evidence-aware Fake News Detection
| Nguyen VoKyumin Lee
2021-02-04
A Bayesian Neural Network based on Dropout Regulation
Claire TheobaldFrédéric PennerathBrieuc Conan-GuezMiguel CouceiroAmedeo Napoli
2021-02-03
Pitfalls of Static Language Modelling
Angeliki LazaridouAdhiguna KuncoroElena GribovskayaDevang AgrawalAdam LiskaTayfun TerziMai GimenezCyprien de Masson d'AutumeSebastian RuderDani YogatamaKris CaoTomas KociskySusannah YoungPhil Blunsom
2021-02-03
Bootstrapping Multilingual AMR with Contextual Word Alignments
| Janaki ShethYoung-suk LeeRamon Fernandez AstudilloTahira NaseemRadu FlorianSalim RoukosTodd Ward
2021-02-03
HeBERT & HebEMO: a Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition
Avihay ChriquiInbal Yahav
2021-02-03
Neural Transfer Learning with Transformers for Social Science Text Analysis
Sandra Wankmüller
2021-02-03
Relaxed Transformer Decoders for Direct Action Proposal Generation
| Jing TanJiaqi TangLiMin WangGangshan Wu
2021-02-03
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram
Shengkui ZhaoHao WangTrung Hieu NguyenBin Ma
2021-02-03
MUFASA: Multimodal Fusion Architecture Search for Electronic Health Records
Zhen XuDavid R. SoAndrew M. Dai
2021-02-03
Single Model Deep Learning on Imbalanced Small Datasets for Skin Lesion Classification
Peng YaoShuwei ShenMengjuan XuPeng LiuFan ZhangJinyu XingPengfei ShaoBenjamin KaffenbergerRonald X. Xu
2021-02-02
Clickbait Headline Detection in Indonesian News Sites using Multilingual Bidirectional Encoder Representations from Transformers (M-BERT)
Muhammad N. FakhruzzamanSaidah Z. JannahRatih A. NingrumIndah Fahmiyah
2021-02-02
AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning
| YuHan LiuSaurabh AgarwalShivaram Venkataraman
2021-02-02
PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation
Yuan GongYu-An ChungJames Glass
2021-02-02
ConvNets for Counting: Object Detection of Transient Phenomena in Steelpan Drums
| Scott H. HawleyAndrew C. Morrison
2021-02-01
SJ_AJ@DravidianLangTech-EACL2021: Task-Adaptive Pre-Training of Multilingual BERT models for Offensive Language Identification
| Sai Muralidhar JayanthiAkshat Gupta
2021-02-01
Text-to-hashtag Generation using Seq2seq Learning
Augusto CamargoWesley CarvalhoFelipe Peressim
2021-02-01
Automated Query Reformulation for Efficient Search based on Query Logs From Stack Overflow
| Kaibo CaoChunyang ChenSebastian BaltesChristoph TreudeXiang Chen
2021-02-01
Neural Network architectures to classify emotions in Indian Classical Music
Uddalok SarkarSayan NagMedha BasuArchi BanerjeeShankha SanyalRanjan SenguptaDipak Ghosh
2021-02-01
GTAE: Graph-Transformer based Auto-Encoders for Linguistic-Constrained Text Style Transfer
Yukai ShiSen ZhangChenxing ZhouXiaodan LiangXiaojun YangLiang Lin
2021-02-01
Polyphone Disambiguition in Mandarin Chinese with Semi-Supervised Learning
Yi ShiCongyi WangYu ChenBin Wang
2021-02-01
Scaling Federated Learning for Fine-tuning of Large Language Models
Agrin HilmkilSebastian CallhMatteo BarbieriLeon René SütfeldEdvin Listo ZecOlof Mogren
2021-02-01
Improving Distantly-Supervised Relation Extraction through BERT-based Label & Instance Embeddings
Despina ChristouGrigorios Tsoumakas
2021-02-01
"Is depression related to cannabis?": A knowledge-infused model for Entity and Relation Extraction with Limited Supervision
Kaushik RoyUsha LokalaVedant KhandelwalAmit Sheth
2021-02-01
Computational Performance Predictions for Deep Neural Network Training: A Runtime-Based Approach
Geoffrey X. YuYubo GaoPavel GolikovGennady Pekhimenko
2021-01-31
Characterizing Student Engagement Moods for Dropout Prediction in Question Pool Websites
Reza Hadi MogaviXiaojuan MaPan Hui
2021-01-31
Fine-tuning Handwriting Recognition systems with Temporal Dropout
| Edgard ChammasChafic Mokbel
2021-01-31
Classification of Shoulder X-Ray Images with Deep Learning Ensemble Models
| Fatih UysalFırat HardalaçOzan PekerTolga TolunayNil Tokgöz
2021-01-31
Short Text Clustering with Transformers
Leonid PugachevMikhail Burtsev
2021-01-31
ICodeNet -- A Hierarchical Neural Network Approach for Source Code Author Identification
Pranali BoraTulika AwalgaonkarHimanshu PalveRaviraj JoshiPurvi Goel
2021-01-30
EmpathBERT: A BERT-based Framework for Demographic-aware Empathy Prediction
Bhanu Prakash Reddy GudaAparna GarimellaNiyati Chhaya
2021-01-30
ShufText: A Simple Black Box Approach to Evaluate the Fragility of Text Classification Models
Rutuja TawareShraddha VaratGaurav SalunkeChaitanya GawandeGeetanjali KaleRahul KhengareRaviraj Joshi
2021-01-30
ObjectAug: Object-level Data Augmentation for Semantic Image Segmentation
Jiawei ZhangYanchun ZhangXiaowei Xu
2021-01-30
Learning From How Human Correct
Tong Guo
2021-01-30
Speech Recognition by Simply Fine-tuning BERT
Wen-Chin HuangChia-Hua WuShang-Bao LuoKuan-Yu ChenHsin-Min WangTomoki Toda
2021-01-30
Segmentation of skin lesions and their attributes using Generative Adversarial Networks
| Cristian Lazo
2021-01-30
Transition based Graph Decoder for Neural Machine Translation
Leshem ChoshenOmri Abend
2021-01-29
Synthesizing Monolingual Data for Neural Machine Translation
Benjamin MarieAtsushi Fujita
2021-01-29
Fine-tuning BERT-based models for Plant Health Bulletin Classification
| Shufan JiangRafael AngaritaStephane CormierFrancis Rousseaux
2021-01-29
BERTaú: Itaú BERT for digital customer service
Paulo FinardiJosé Dié ViegasGustavo T. FerreiraAlex F. MansanoVinicius F. Caridá
2021-01-28
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
| Li YuanYunpeng ChenTao WangWeihao YuYujun ShiZihang JiangFrancis EH TayJiashi FengShuicheng Yan
2021-01-28
A Graph-based Relevance Matching Model for Ad-hoc Retrieval
Yufeng ZhangJinghao ZhangZeyu CuiShu WuLiang Wang
2021-01-28
LSTM-SAKT: LSTM-Encoded SAKT-like Transformer for Knowledge Tracing
Takashi OyaShigeo Morishima
2021-01-28
On the Evolution of Syntactic Information Encoded by BERT's Contextualized Representations
Laura Pérez-MayosRoberto CarliniMiguel BallesterosLeo Wanner
2021-01-27
Spatial-Channel Transformer Network for Trajectory Prediction on the Traffic Scenes
Jingwen ZhaoXuanpeng LiQifan XueWeigong Zhang
2021-01-27
KoreALBERT: Pretraining a Lite BERT Model for Korean Language Understanding
Hyunjae LeeJaewoong YoonBonggyu HwangSeongho JoeSeungjai MinYoungjune Gwon
2021-01-27
An explainable Transformer-based deep learning model for the prediction of incident heart failure
Shishir RaoYikuan LiRema RamakrishnanAbdelaali HassaineDexter CanoyJohn ClelandThomas LukasiewiczGholamreza Salimi-KhorshidiKazem Rahimi
2021-01-27
Bayesian Nested Neural Networks for Uncertainty Calibration and Adaptive Compression
Yufei CuiZiquan LiuQiao LiYu MaoAntoni B. ChanChun Jason Xue
2021-01-27
Exploring multi-task multi-lingual learning of transformer models for hate speech and offensive speech identification in social media
| Sudhanshu MishraShivangi PrasadShubhanshu Mishra
2021-01-27
G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for Biomarker Identification and Disease Classification
Sayan GhosalQiang ChenGiulio PergolaAaron L. GoldmanWilliam UlrichKaren F. BermanGiuseppe BlasiLeonardo FazioAntonio RampinoAlessandro BertolinoDaniel R. WeinbergerVenkata S. MattayArchana Venkataraman
2021-01-27
Attention Can Reflect Syntactic Structure (If You Let It)
Vinit RavishankarArtur KulmizevMostafa AbdouAnders SøgaardJoakim Nivre
2021-01-26
CPTR: Full Transformer Network for Image Captioning
Wei LiuSihan ChenLongteng GuoXinxin ZhuJing Liu
2021-01-26
Regulatory Compliance through Doc2Doc Information Retrieval: A case study in EU/UK legislation where text similarity has limitations
Ilias ChalkidisManos FergadiotisNikolaos ManginasEva KatakalouProdromos Malakasiotis
2021-01-26
Evaluation of BERT and ALBERT Sentence Embedding Performance on Downstream NLP Tasks
Hyunjin ChoiJudong KimSeongho JoeYoungjune Gwon
2021-01-26
Analyzing Zero-shot Cross-lingual Transfer in Supervised NLP Tasks
Hyunjin ChoiJudong KimSeongho JoeSeungjai MinYoungjune Gwon
2021-01-26
Variational Information Bottleneck Model for Accurate Indoor Position Recognition
Weizhu QianFranck Gechter
2021-01-26
CLiMP: A Benchmark for Chinese Language Model Evaluation
Beilei XiangChangbing YangYu LiAlex WarstadtKatharina Kann
2021-01-26
Named Entity Recognition in the Style of Object Detection
Bing Li
2021-01-26
First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT
Benjamin MullerYanai ElazarBenoît SagotDjamé Seddah
2021-01-26
Deep Subjecthood: Higher-Order Grammatical Features in Multilingual BERT
| Isabel PapadimitriouEthan A. ChiRichard FutrellKyle Mahowald
2021-01-26
Uncertainty aware and explainable diagnosis of retinal disease
Amitojdeep SinghSourya SenguptaMohammed Abdul RasheedVaradharajan JayakumarVasudevan Lakshminarayanan
2021-01-26
A Hybrid Approach to Measure Semantic Relatedness in Biomedical Concepts
Katikapalli Subramanyam KalyanSivanesan Sangeetha
2021-01-25
Randomized Deep Structured Prediction for Discourse-Level Processing
Manuel WidmoserMaria Leonor PachecoJean HonorioDan Goldwasser
2021-01-25
Stereotype and Skew: Quantifying Gender Bias in Pre-trained and Fine-tuned Language Models
| Daniel de Vassimon ManelaDavid ErringtonThomas FisherBoris van BreugelPasquale Minervini
2021-01-24
cGANs for Cartoon to Real-life Images
Pranjal Singh RajputKanya SatisSonnya DellarosaWenxuan HuangObinna Agba
2021-01-24
Multi-Task Time Series Forecasting With Shared Attention
Zekai ChenJiaze EXiao ZhangHao ShengXiuzheng Cheng
2021-01-24
Does Dialog Length matter for Next Response Selection task? An Empirical Study
Jatin GanhotraSachindra Joshi
2021-01-24
RomeBERT: Robust Training of Multi-Exit BERT
| Shijie GengPeng GaoZuohui FuYongfeng Zhang
2021-01-24
DenseNet for Breast Tumor Classification in Mammographic Images
Yuliana Jiménez GaonaMaría José Rodriguez-AlvarezHector Espinó MoratóDarwin Castillo MallaVasudevan Lakshminarayanan
2021-01-24
Training Multilingual Pre-trained Language Model with Byte-level Subwords
| Junqiu WeiQun LiuYinpeng GuoXin Jiang
2021-01-23
Extracting Lifestyle Factors for Alzheimer's Disease from Clinical Notes Using Deep Learning with Weak Supervision
Zitao ShenYoonkwon YiAnusha BompelliFang YuYanshan WangRui Zhang
2021-01-22
Artificial intelligence prediction of stock prices using social media
Kavyashree RanawatStefano Giani
2021-01-22
Enriching Non-Autoregressive Transformer with Syntactic and SemanticStructures for Neural Machine Translation
Ye LiuYao WanJian-Guo ZhangWenting ZhaoPhilip S. Yu
2021-01-22
Solving the Same-Different Task with Convolutional Neural Networks
Nicola MessinaGiuseppe AmatoFabio CarraraClaudio GennaroFabrizio Falchi
2021-01-22
HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection
| Suman DowlagarRadhika Mamidi
2021-01-22
Multilingual Pre-Trained Transformers and Convolutional NN Classification Models for Technical Domain Identification
Suman DowlagarRadhika Mamidi
2021-01-22
A multi-perspective combined recall and rank framework for Chinese procedure terminology normalization
Ming LiangKui XueTong Ruan
2021-01-22
The heads hypothesis: A unifying statistical approach towards understanding multi-headed attention in BERT
| Madhura PandeAakriti BudhrajaPreksha NemaPratyush KumarMitesh M. Khapra
2021-01-22
Drug and Disease Interpretation Learning with Biomedical Entity Representation Transformer
Zulfat MiftahutdinovArtur KadurinRoman KudrinElena Tutubalina
2021-01-22
BERT Transformer model for Detecting Arabic GPT2 Auto-Generated Tweets
Fouzi HarragMaria DebbahKareem DarwishAhmed Abdelali
2021-01-22
DAF:re: A Challenging, Crowd-Sourced, Large-Scale, Long-Tailed Dataset For Anime Character Recognition
| Edwin Arkel RiosWen-Huang ChengBo-Cheng Lai
2021-01-21
Activity Graph Transformer for Temporal Action Localization
Megha NawhalGreg Mori
2021-01-21
Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual Retrieval
| Robert LitschkoIvan VulićSimone Paolo PonzettoGoran Glavaš
2021-01-21
Classifying Scientific Publications with BERT -- Is Self-Attention a Feature Selection Method?
| Andres Garcia-SilvaJose Manuel Gomez-Perez
2021-01-20
Divide and Conquer: An Ensemble Approach for Hostile Post Detection in Hindi
| Varad BhatnagarPrince KumarSairam MoghiliPushpak Bhattacharyya
2021-01-20
Learning to Augment for Data-Scarce Domain BERT Knowledge Distillation
Lingyun FengMinghui QiuYaliang LiHai-Tao ZhengYing Shen
2021-01-20
Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates
Artem ShelmanovDmitri PuzyrevLyubov KupriyanovaDenis BelyakovDaniil LarionovNikita KhromovOlga KozlovaEkaterina ArtemovaDmitry V. DylovAlexander Panchenko
2021-01-20
PGT: Pseudo Relevance Feedback Using a Graph-Based Transformer
HongChien YuZhuyun DaiJamie Callan
2021-01-20
UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers
| Siyi HuFengda ZhuXiaojun ChangXiaodan Liang
2021-01-20
Open-Domain Conversational Search Assistant with Transformers
Rafael FerreiraMariana LeiteDavid SemedoJoao Magalhaes
2021-01-20
Situation and Behavior Understanding by Trope Detection on Films
| Chen-Hsi ChangHung-Ting SuJui-heng HsuYu-Siang WangYu-Cheng ChangZhe Yu LiuYa-Liang ChangWen-Feng ChengKe-Jyun WangWinston H. Hsu
2021-01-19
Towards Facilitating Empathic Conversations in Online Mental Health Support: A Reinforcement Learning Approach
| ASHISH SHARMAInna W. LinAdam S. MinerDavid C. AtkinsTim Althoff
2021-01-19
Deep Learning Models for Calculation of Cardiothoracic Ratio from Chest Radiographs for Assisted Diagnosis of Cardiomegaly
Tanveer GupteMrunmai NiljikarManish GawaliViraj KulkarniAmit KharatAniruddha Pant
2021-01-19
Fast Convergence of DETR with Spatially Modulated Co-Attention
| Peng GaoMinghang ZhengXiaogang WangJifeng DaiHongsheng Li
2021-01-19
Inference for BART with Multinomial Outcomes
Yizhen XuJoseph W. HoganMichael J. DanielsRami KantorAnn Mwangi
2021-01-18
TLU-Net: A Deep Learning Approach for Automatic Steel Surface Defect Detection
Praveen DamacharlaAchuth Rao M. V.Jordan RingenbergAhmad Y Javaid
2021-01-18
Can a Fruit Fly Learn Word Embeddings?
| Yuchen LiangChaitanya K. RyaliBenjamin HooverLeopold GrinbergSaket NavlakhaMohammed J. ZakiDmitry Krotov
2021-01-18
Automatic punctuation restoration with BERT models
| Attila NagyBence BialJudit Ács
2021-01-18
Energy-based Dropout in Restricted Boltzmann Machines: Why not go random
Mateus RoderGustavo H. de RosaVictor Hugo C. de AlbuquerqueAndré L. D. RossiJoão P. Papa
2021-01-17
Cost-Efficient Online Hyperparameter Optimization
Jingkang WangMengye RenIlija BogunovicYuwen XiongRaquel Urtasun
2021-01-17
Dual-Level Collaborative Transformer for Image Captioning
| Yunpeng LuoJiayi JiXiaoshuai SunLiujuan CaoYongjian WuFeiyue HuangChia-Wen LinRongrong Ji