Search Results for author: Xingjiao Wu

Found 19 papers, 4 papers with code

FairMonitor: A Dual-framework for Detecting Stereotypes and Biases in Large Language Models

no code implementations • 6 May 2024 • Yanhong Bai, Jiabao Zhao, Jinxin Shi, Zhentao Xie, Xingjiao Wu, Liang He

Detecting stereotypes and biases in Large Language Models (LLMs) is crucial for enhancing fairness and reducing adverse impacts on individuals or groups when these models are applied.

Fairness

Paper
Add Code

UMAAF: Unveiling Aesthetics via Multifarious Attributes of Images

no code implementations • 19 Nov 2023 • Weijie Li, Yitian Wan, Xingjiao Wu, Junjie Xu, Cheng Jin, Liang He

Then, to better utilize image attributes in aesthetic assessment, we propose the Unified Multi-attribute Aesthetic Assessment Framework (UMAAF) to model both absolute and relative attributes of images.

Attribute

Paper
Add Code

DCQA: Document-Level Chart Question Answering towards Complex Reasoning and Common-Sense Understanding

1 code implementation • 29 Oct 2023 • Anran Wu, Luwei Xiao, Xingjiao Wu, Shuwen Yang, Junjie Xu, Zisong Zhuang, Nian Xie, Cheng Jin, Liang He

Our DCQA dataset is expected to foster research on understanding visualizations in documents, especially for scenarios that require complex reasoning for charts in the visually-rich document.

Answer Generation Chart Question Answering +5

Paper
Code

Progressive Evidence Refinement for Open-domain Multimodal Retrieval Question Answering

no code implementations • 15 Oct 2023 • Shuwen Yang, Anran Wu, Xingjiao Wu, Luwei Xiao, Tianlong Ma, Cheng Jin, Liang He

Firstly, utilizing compressed evidence features as input to the model results in the loss of fine-grained information within the evidence.

Contrastive Learning Logical Sequence +2

Paper
Add Code

FairMonitor: A Four-Stage Automatic Framework for Detecting Stereotypes and Biases in Large Language Models

no code implementations • 21 Aug 2023 • Yanhong Bai, Jiabao Zhao, Jinxin Shi, Tingjiang Wei, Xingjiao Wu, Liang He

Detecting stereotypes and biases in Large Language Models (LLMs) can enhance fairness and reduce adverse impacts on individuals or groups when these LLMs are applied.

Fairness

Paper
Add Code

DDT: Dual-branch Deformable Transformer for Image Denoising

1 code implementation • 13 Apr 2023 • Kangliang Liu, Xiangcheng Du, Sijie Liu, Yingbin Zheng, Xingjiao Wu, Cheng Jin

Transformer is beneficial for image denoising tasks since it can model long-range dependencies to overcome the limitations presented by inductive convolutional biases.

Image Denoising

Paper
Code

LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion

1 code implementation • CVPR 2023 • Xin Li, Tao Ma, Yuenan Hou, Botian Shi, Yuchen Yang, Youquan Liu, Xingjiao Wu, Qin Chen, Yikang Li, Yu Qiao, Liang He

Notably, LoGoNet ranks 1st on Waymo 3D object detection leaderboard and obtains 81. 02 mAPH (L2) detection performance.

3D Object Detection object-detection +1

Paper
Code

Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection

no code implementations • 18 Oct 2022 • Xin Li, Botian Shi, Yuenan Hou, Xingjiao Wu, Tianlong Ma, Yikang Li, Liang He

To address these problems, we construct the homogeneous structure between the point cloud and images to avoid projective information loss by transforming the camera features into the LiDAR 3D space.

3D Object Detection Autonomous Driving +1

Paper
Add Code

Progressive Scene Text Erasing with Self-Supervision

no code implementations • 23 Jul 2022 • Xiangcheng Du, Zhao Zhou, Yingbin Zheng, Xingjiao Wu, Tianlong Ma, Cheng Jin

Scene text erasing seeks to erase text contents from scene images and current state-of-the-art text erasing models are trained on large-scale synthetic data.

Paper
Add Code

Multi-channel Attentive Graph Convolutional Network With Sentiment Fusion For Multimodal Sentiment Analysis

no code implementations • 25 Jan 2022 • Luwei Xiao, Xingjiao Wu, Wen Wu, Jing Yang, Liang He

This paper proposes a Multi-channel Attentive Graph Convolutional Network (MAGCN), consisting of two main components: cross-modality interactive learning and sentimental feature fusion.

Multimodal Sentiment Analysis

Paper
Add Code

Cross-Domain Document Layout Analysis via Unsupervised Document Style Guide

no code implementations • 24 Jan 2022 • Xingjiao Wu, Luwei Xiao, Xiangcheng Du, Yingbin Zheng, Xin Li, Tianlong Ma, Liang He

Our framework is an unsupervised document layout analysis framework.

Contrastive Learning Document Layout Analysis

Paper
Add Code

Document Layout Analysis with Aesthetic-Guided Image Augmentation

no code implementations • 27 Nov 2021 • Tianlong Ma, Xingjiao Wu, Xin Li, Xiangcheng Du, Zhao Zhou, Liang Xue, Cheng Jin

To measure the proposed image layer modeling method, we propose a manually-labeled non-Manhattan layout fine-grained segmentation dataset named FPD.

Document Layout Analysis document understanding +2

Paper
Add Code

Document Image Layout Analysis via Explicit Edge Embedding Network

no code implementations • Information Sciences 2021 • Xingjiao Wu, Yingbin Zheng, Tianlong Ma, Hao Ye, Liang He

Layout analysis from a document image plays an important role in document content understanding and information extraction systems.

Data Augmentation Document Layout Analysis

Paper
Add Code

Human-In-The-Loop Document Layout Analysis

no code implementations • 4 Aug 2021 • Xingjiao Wu, Tianlong Ma, Xin Li, Qin Chen, Liang He

The HITL select key samples by using confidence.

Document Layout Analysis Semantic Segmentation

Paper
Add Code

A Survey of Human-in-the-loop for Machine Learning

no code implementations • 2 Aug 2021 • Xingjiao Wu, Luwei Xiao, Yixuan Sun, Junhang Zhang, Tianlong Ma, Liang He

Humans can provide training data for machine learning applications and directly accomplish tasks that are hard for computers in the pipeline with the help of machine-based approaches.

BIG-bench Machine Learning

Paper
Add Code

Document Layout Analysis via Dynamic Residual Feature Fusion

no code implementations • 7 Apr 2021 • Xingjiao Wu, Ziling Hu, Xiangcheng Du, Jing Yang, Liang He

The document layout analysis (DLA) aims to split the document image into different interest regions and understand the role of each region, which has wide application such as optical character recognition (OCR) systems and document retrieval.

Document Layout Analysis Optical Character Recognition +2

Paper
Add Code

Scene Text Recognition with Temporal Convolutional Encoder

no code implementations • 4 Nov 2019 • Xiangcheng Du, Tianlong Ma, Yingbin Zheng, Hao Ye, Xingjiao Wu, Liang He

In this paper, we study text recognition framework by considering the long-term temporal dependencies in the encoder stage.

Decoder Scene Text Recognition

Paper
Add Code

Fast Video Crowd Counting with a Temporal Aware Network

no code implementations • 4 Jul 2019 • Xingjiao Wu, Baohan Xu, Yingbin Zheng, Hao Ye, Jing Yang, Liang He

Crowd counting aims to count the number of instantaneous people in a crowded space, and many promising solutions have been proposed for single image crowd counting.

Crowd Counting

Paper
Add Code

Adaptive Scenario Discovery for Crowd Counting

1 code implementation • 6 Dec 2018 • Xingjiao Wu, Yingbin Zheng, Hao Ye, Wenxin Hu, Jing Yang, Liang He

Crowd counting, i. e., estimation number of the pedestrian in crowd images, is emerging as an important research problem with the public security applications.

Crowd Counting

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.