Search Results for author: Shijing Si

Found 29 papers, 4 papers with code

Evaluating the Performance of ChatGPT for Spam Email Detection

no code implementations23 Feb 2024 Yuwei Wu, Shijing Si, Yugui Zhang, Jiawen Gu, Jedrek Wosik

To fill in the gap, this study attempts to evaluate ChatGPT's capabilities for spam identification in both English and Chinese email datasets.

In-Context Learning Question Answering +1

Exploring the Capabilities of ChatGPT in Ancient Chinese Translation and Person Name Recognition

no code implementations23 Dec 2023 Shijing Si, Siqing Zhou, Le Tang, Xiaoqing Cheng, Yugui Zhang

ChatGPT's proficiency in handling modern standard languages suggests potential for its use in understanding ancient Chinese.

Translation

Revisiting the Role of Label Smoothing in Enhanced Text Sentiment Classification

no code implementations11 Dec 2023 Yijie Gao, Shijing Si, Hua Luo, Haixia Sun, Yugui Zhang

Label smoothing is a widely used technique in various domains, such as text classification, image classification and speech recognition, known for effectively combating model overfitting.

Image Classification Sentiment Analysis +5

On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models

no code implementations15 Mar 2023 Tong Ye, Shijing Si, Jianzong Wang, Ning Cheng, Zhitao Li, Jing Xiao

Deep neural retrieval models have amply demonstrated their power but estimating the reliability of their predictions remains challenging.

Retrieval

Efficient Document Retrieval by End-to-End Refining and Quantizing BERT Embedding with Contrastive Product Quantization

1 code implementation31 Oct 2022 Zexuan Qiu, Qinliang Su, Jianxing Yu, Shijing Si

Efficient document retrieval heavily relies on the technique of semantic hashing, which learns a binary code for every document and employs Hamming distance to evaluate document distances.

Quantization Retrieval

Pushing the Efficiency Limit Using Structured Sparse Convolutions

no code implementations23 Oct 2022 Vinay Kumar Verma, Nikhil Mehta, Shijing Si, Ricardo Henao, Lawrence Carin

Weight pruning is among the most popular approaches for compressing deep convolutional neural networks.

Pose Guided Human Image Synthesis with Partially Decoupled GAN

no code implementations7 Oct 2022 Jianhan Wu, Jianzong Wang, Shijing Si, Xiaoyang Qu, Jing Xiao

Most existing methods encode the texture of the whole reference human image into a latent space, and then utilize a decoder to synthesize the image texture of the target pose.

Long-range modeling Pose Transfer

Machine Unlearning Method Based On Projection Residual

no code implementations30 Sep 2022 Zihao Cao, Jianzong Wang, Shijing Si, Zhangcheng Huang, Jing Xiao

Even when data is removed from the dataset, the effects of these data persist in the model.

Machine Unlearning

RL-MD: A Novel Reinforcement Learning Approach for DNA Motif Discovery

no code implementations30 Sep 2022 Wen Wang, Jianzong Wang, Shijing Si, Zhangcheng Huang, Jing Xiao

The extraction of sequence patterns from a collection of functionally linked unlabeled DNA sequences is known as DNA motif discovery, and it is a key task in computational biology.

reinforcement-learning Reinforcement Learning (RL)

Boosting Star-GANs for Voice Conversion with Contrastive Discriminator

no code implementations21 Sep 2022 Shijing Si, Jianzong Wang, xulong Zhang, Xiaoyang Qu, Ning Cheng, Jing Xiao

Nonparallel multi-domain voice conversion methods such as the StarGAN-VCs have been widely applied in many scenarios.

Contrastive Learning Voice Conversion

Debias the Black-box: A Fair Ranking Framework via Knowledge Distillation

no code implementations24 Aug 2022 Zhitao Zhu, Shijing Si, Jianzong Wang, Yaodong Yang, Jing Xiao

Deep neural networks can capture the intricate interaction history information between queries and documents, because of their many complicated nonlinear units, allowing them to provide correct search recommendations.

Fairness Information Retrieval +2

Leveraging Causal Inference for Explainable Automatic Program Repair

no code implementations26 May 2022 Jianzong Wang, Shijing Si, Zhitao Zhu, Xiaoyang Qu, Zhenhou Hong, Jing Xiao

The experiments on four programming languages (Java, C, Python, and JavaScript) show that CPR can generate causal graphs for reasonable interpretations and boost the performance of bug fixing in automatic program repair.

Bug fixing Causal Inference +3

Cali3F: Calibrated Fast Fair Federated Recommendation System

no code implementations26 May 2022 Zhitao Zhu, Shijing Si, Jianzong Wang, Jing Xiao

Specific to recommendation systems, many federated recommendation algorithms have been proposed to realize the privacy-preserving collaborative recommendation.

Fairness Federated Learning +2

A Fair Federated Learning Framework With Reinforcement Learning

no code implementations26 May 2022 Yaqi Sun, Shijing Si, Jianzong Wang, Yuhan Dong, Zhitao Zhu, Jing Xiao

More importantly, we apply the Gini coefficient and validation accuracy of clients in each communication round to construct a reward function for the reinforcement learning.

Fairness Federated Learning +2

Federated Split BERT for Heterogeneous Text Classification

no code implementations26 May 2022 Zhengyang Li, Shijing Si, Jianzong Wang, Jing Xiao

To address this issue, we propose a framework, FedSplitBERT, which handles heterogeneous data and decreases the communication cost by splitting the BERT encoder layers into local part and global part.

Federated Learning Quantization +2

Federated Non-negative Matrix Factorization for Short Texts Topic Modeling with Mutual Information

no code implementations26 May 2022 Shijing Si, Jianzong Wang, Ruiyi Zhang, Qinliang Su, Jing Xiao

Non-negative matrix factorization (NMF) based topic modeling is widely used in natural language processing (NLP) to uncover hidden topics of short text documents.

Federated Learning text-classification +1

Augmentation-induced Consistency Regularization for Classification

no code implementations25 May 2022 Jianhan Wu, Shijing Si, Jianzong Wang, Jing Xiao

In this paper, we propose a consistency regularization framework based on data augmentation, called CR-Aug, which forces the output distributions of different sub models generated by data augmentation to be consistent with each other.

Audio Classification Data Augmentation

Towards Speaker Age Estimation with Label Distribution Learning

no code implementations23 Feb 2022 Shijing Si, Jianzong Wang, Junqing Peng, Jing Xiao

To address this, we utilize the ambiguous information among the age labels, convert each age label into a discrete label distribution and leverage the label distribution learning (LDL) method to fit the data.

Age Classification Age Estimation +2

VU-BERT: A Unified framework for Visual Dialog

no code implementations22 Feb 2022 Tong Ye, Shijing Si, Jianzong Wang, Rui Wang, Ning Cheng, Jing Xiao

The visual dialog task attempts to train an agent to answer multi-turn questions given an image, which requires the deep understanding of interactions between the image and dialog history.

Language Modelling Masked Language Modeling +2

FairFil: Contrastive Neural Debiasing Method for Pretrained Text Encoders

no code implementations ICLR 2021 Pengyu Cheng, Weituo Hao, Siyang Yuan, Shijing Si, Lawrence Carin

Pretrained text encoders, such as BERT, have been applied increasingly in various natural language processing (NLP) tasks, and have recently demonstrated significant performance gains.

Contrastive Learning Fairness +1

Reinforcement Learning for Flexibility Design Problems

no code implementations2 Jan 2021 Yehua Wei, Lei Zhang, Ruiyi Zhang, Shijing Si, Hao Zhang, Lawrence Carin

Flexibility design problems are a class of problems that appear in strategic decision-making across industries, where the objective is to design a ($e. g.$, manufacturing) network that affords flexibility and adaptivity.

Decision Making reinforcement-learning +1

Students Need More Attention: BERT-based AttentionModel for Small Data with Application to AutomaticPatient Message Triage

1 code implementation22 Jun 2020 Shijing Si, Rui Wang, Jedrek Wosik, Hao Zhang, David Dov, Guoyin Wang, Ricardo Henao, Lawrence Carin

Small and imbalanced datasets commonly seen in healthcare represent a challenge when training classifiers based on deep learning models.

Cannot find the paper you are looking for? You can Submit a new open access paper.