Search Results for author: Junlin Zhang

Found 15 papers, 10 papers with code

DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing Perspective

no code implementations • 14 Sep 2023 • Pu Miao, Zeyao Du, Junlin Zhang

Contrastive learning schemes such as SimCSE and ConSERT have already been adopted successfully in unsupervised sentence embedding to improve the quality of embeddings by reducing this bias.

Contrastive Learning Semantic Similarity +5

Paper
Add Code

Perception Test: A Diagnostic Benchmark for Multimodal Video Models

2 code implementations • NeurIPS 2023 • Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira

We propose a novel multimodal video benchmark - the Perception Test - to evaluate the perception and reasoning skills of pre-trained multimodal models (e. g. Flamingo, SeViLA, or GPT-4).

counterfactual Descriptive +2

151

Paper
Code

MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR Prediction

1 code implementation • 25 Oct 2022 • PengTao Zhang, Junlin Zhang

In this paper, we propose multi-Hash Codebook NETwork (HCNet) as the memory mechanism for efficiently learning and memorizing representations of cross features in CTR tasks.

Ranked #1 on Click-Through Rate Prediction on KDD12

Click-Through Rate Prediction Language Modelling +2

Paper
Code

Perception Test: A Diagnostic Benchmark for Multimodal Models

1 code implementation • Deep Mind 2022 • Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Skanda Koppula, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman and João Carreira

We propose a novel multimodal benchmark – the Perception Test – that aims to extensively evaluate perception and reasoning skills of multimodal models.

Multiple-choice Question Answering +1

151

Paper
Code

FiBiNet++: Reducing Model Size by Low Rank Feature Interaction Layer for CTR Prediction

4 code implementations • 12 Sep 2022 • PengTao Zhang, Zheng Zheng, Junlin Zhang

Click-Through Rate (CTR) estimation has become one of the most fundamental tasks in many real-world applications and various deep models have been proposed.

Ranked #16 on Click-Through Rate Prediction on Criteo

Click-Through Rate Prediction Recommendation Systems

1,504

Paper
Code

Leaf-FM: A Learnable Feature Generation Factorization Machine for Click-Through Rate Prediction

no code implementations • 26 Jul 2021 • Qingyun She, Zhiqiang Wang, Junlin Zhang

For example, the continuous features are usually transformed to the power forms by adding a new feature to allow it to easily form non-linear functions of the feature.

Click-Through Rate Prediction Feature Engineering +1

Paper
Add Code

ContextNet: A Click-Through Rate Prediction Framework Using Contextual information to Refine Feature Embedding

3 code implementations • 26 Jul 2021 • Zhiqiang Wang, Qingyun She, PengTao Zhang, Junlin Zhang

In this paper, We propose a novel CTR Framework named ContextNet that implicitly models high-order feature interactions by dynamically refining each feature's embedding according to the input context.

Ranked #15 on Click-Through Rate Prediction on Criteo

Click-Through Rate Prediction Recommendation Systems +1

195

Paper
Code

MaskNet: Introducing Feature-Wise Multiplication to CTR Ranking Models by Instance-Guided Mask

15 code implementations • 9 Feb 2021 • Zhiqiang Wang, Qingyun She, Junlin Zhang

We also turn the feed-forward layer in DNN model into a mixture of addictive and multiplicative feature interactions by proposing MaskBlock in this paper.

Ranked #9 on Click-Through Rate Prediction on Criteo

Click-Through Rate Prediction Recommendation Systems

61,446

Paper
Code

BoostingBERT:Integrating Multi-Class Boosting into BERT for NLP Tasks

no code implementations • 13 Sep 2020 • Tongwen Huang, Qingyun She, Junlin Zhang

Our proposed model uses the pre-trained Transformer as the base classifier to choose harder training sets to fine-tune and gains the benefits of both the pre-training language knowledge and boosting ensemble in NLP tasks.

Ensemble Learning Knowledge Distillation

Paper
Add Code

GateNet: Gating-Enhanced Deep Network for Click-Through Rate Prediction

3 code implementations • 6 Jul 2020 • Tongwen Huang, Qingyun She, Zhiqiang Wang, Junlin Zhang

Inspired by these observations, we propose a novel model named GateNet which introduces either the feature embedding gate or the hidden gate to the embedding layer or hidden layers of DNN CTR models, respectively.

Ranked #21 on Click-Through Rate Prediction on Criteo

Click-Through Rate Prediction Recommendation Systems

4,114

Paper
Code

Correct Normalization Matters: Understanding the Effect of Normalization On Deep Neural Network Models For Click-Through Rate Prediction

1 code implementation • 23 Jun 2020 • Zhiqiang Wang, Qingyun She, PengTao Zhang, Junlin Zhang

Normalization has become one of the most fundamental components in many deep neural networks for machine learning tasks while deep neural network has also been widely used in CTR estimation field.

Ranked #1 on Click-Through Rate Prediction on Android Malware Dataset

Click-Through Rate Prediction

1,502

Paper
Code

FiBiNET: Combining Feature Importance and Bilinear feature Interaction for Click-Through Rate Prediction

31 code implementations • 23 May 2019 • Tongwen Huang, Zhiqi Zhang, Junlin Zhang

In this paper, a new model named FiBiNET as an abbreviation for Feature Importance and Bilinear feature Interaction NETwork is proposed to dynamically learn the feature importance and fine-grained feature interactions.

Ranked #19 on Click-Through Rate Prediction on Criteo

Click-Through Rate Prediction Feature Importance +1

7,362

Paper
Code

FAT-DeepFFM: Field Attentive Deep Field-aware Factorization Machine

12 code implementations • 15 May 2019 • Junlin Zhang, Tongwen Huang, Zhiqi Zhang

Although some CTR model such as Attentional Factorization Machine (AFM) has been proposed to model the weight of second order interaction features, we posit the evaluation of feature importance before explicit feature interaction procedure is also important for CTR prediction tasks because the model can learn to selectively highlight the informative features and suppress less useful ones if the task has many input features.

Ranked #18 on Click-Through Rate Prediction on Criteo

Click-Through Rate Prediction Feature Importance +1

4,114

Paper
Code

Empirical Evaluation of RNN Architectures on Sentence Classification Task

no code implementations • 29 Sep 2016 • Lei Shen, Junlin Zhang

Recurrent Neural Networks have achieved state-of-the-art results for many problems in NLP and two most popular RNN architectures are Tail Model and Pooling Model.

Classification General Classification +2

Paper
Add Code

Online classifier adaptation for cost-sensitive learning

no code implementations • 23 Mar 2015 • Junlin Zhang, Jose Garcia

To solve this problem, we propose to learn a new classifier by adding an adaptation function to the base classifier, and update the adaptation function parameter according to the streaming data samples.

Classification General Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.