Search Results for author: Junjie Wang

Found 39 papers, 13 papers with code

MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question Answering

1 code implementation • Findings (EMNLP) 2021 • Junjie Wang, Yatai Ji, Jiaqi Sun, Yujiu Yang, Tetsuya Sakai

On the other hand, trilinear models such as the CTI model efficiently utilize the inter-modality information between answers, questions, and images, while ignoring intra-modality information.

Multiple-choice Question Answering +1

Paper
Code

VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing

no code implementations • 5 Mar 2024 • Zhiyuan Chang, Mingyang Li, Junjie Wang, Cheng Li, Qing Wang

Visual entailment (VE) is a multimodal reasoning task consisting of image-sentence pairs whereby a promise is defined by an image, and a hypothesis is described by a sentence.

Multimodal Reasoning Sentence +1

Paper
Add Code

Adversarial Testing for Visual Grounding via Image-Aware Property Reduction

no code implementations • 2 Mar 2024 • Zhiyuan Chang, Mingyang Li, Junjie Wang, Cheng Li, Boyu Wu, Fanjiang Xu, Qing Wang

To this end, we propose PEELING, a text perturbation approach via image-aware property reduction for adversarial testing of the VG model.

Visual Grounding

Paper
Add Code

Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing

no code implementations • 28 Feb 2024 • Mingfei Cheng, Yuan Zhou, Xiaofei Xie, Junjie Wang, Guozhu Meng, Kairui Yang

In this paper, we focus on evaluating the decision-making quality of an ADS and propose the first method for detecting non-optimal decision scenarios (NoDSs), where the ADS does not compute optimal paths for AVs.

Autonomous Driving Decision Making

Paper
Add Code

StructLM: Towards Building Generalist Models for Structured Knowledge Grounding

no code implementations • 26 Feb 2024 • Alex Zhuang, Ge Zhang, Tianyu Zheng, Xinrun Du, Junjie Wang, Weiming Ren, Stephen W. Huang, Jie Fu, Xiang Yue, Wenhu Chen

Utilizing this dataset, we train a series of models, referred to as StructLM, based on the Mistral and the CodeLlama model family, ranging from 7B to 34B parameters.

Paper
Add Code

Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues

no code implementations • 14 Feb 2024 • Zhiyuan Chang, Mingyang Li, Yi Liu, Junjie Wang, Qing Wang, Yang Liu

With the development of LLMs, the security threats of LLMs are getting more and more attention.

Paper
Add Code

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

1 code implementation • 22 Jan 2024 • Ge Zhang, Xinrun Du, Bei Chen, Yiming Liang, Tongxu Luo, Tianyu Zheng, Kang Zhu, Yuyang Cheng, Chunpu Xu, Shuyue Guo, Haoran Zhang, Xingwei Qu, Junjie Wang, Ruibin Yuan, Yizhi Li, Zekun Wang, Yudong Liu, Yu-Hsuan Tsai, Fengji Zhang, Chenghua Lin, Wenhao Huang, Wenhu Chen, Jie Fu

We introduce CMMMU, a new Chinese Massive Multi-discipline Multimodal Understanding benchmark designed to evaluate LMMs on tasks demanding college-level subject knowledge and deliberate reasoning in a Chinese context.

7,249

Paper
Code

Know Your Needs Better: Towards Structured Understanding of Marketer Demands with Analogical Reasoning Augmented LLMs

2 code implementations • 9 Jan 2024 • Junjie Wang, Dan Yang, Binbin Hu, Yue Shen, Ziqi Liu, Wen Zhang, Jinjie Gu, Zhiqiang Zhang

Considering the impressive natural language processing ability of large language models (LLMs), we try to leverage LLMs to solve this issue.

22,381

Paper
Code

AdapterDistillation: Non-Destructive Task Composition with Knowledge Distillation

no code implementations • 26 Dec 2023 • Junjie Wang, Yicheng Chen, Wangshu Zhang, Sen Hu, Teng Xu, Jing Zheng

In the second stage, we distill the knowledge from the existing teacher adapters into the student adapter to help its inference.

Knowledge Distillation Retrieval

Paper
Add Code

A Survey on Query-based API Recommendation

no code implementations • 17 Dec 2023 • Moshi Wei, Nima Shiri Harzevili, Alvine Boaye Belle, Junjie Wang, Lin Shi, Jinqiu Yang, Song Wang, Ming Zhen, Jiang

We also investigate the typical data extraction procedures and collection approaches employed by the existing approaches.

Paper
Add Code

From Beginner to Expert: Modeling Medical Knowledge into General LLMs

no code implementations • 2 Dec 2023 • Qiang Li, Xiaoyan Yang, Haowen Wang, Qin Wang, Lei Liu, Junjie Wang, Yang Zhang, Mingyuan Chu, Sen Hu, Yicheng Chen, Yue Shen, Cong Fan, Wangshu Zhang, Teng Xu, Jinjie Gu, Jing Zheng, Guannan Zhang Ant Group

(3) Specifically for multi-choice questions in the medical domain, we propose a novel Verification-of-Choice approach for prompting engineering, which significantly enhances the reasoning ability of LLMs.

Language Modelling Large Language Model +3

Paper
Add Code

GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions

no code implementations • 27 Nov 2023 • Jiemin Fang, Junjie Wang, Xiaopeng Zhang, Lingxi Xie, Qi Tian

Specifically, we first extract the region of interest (RoI) corresponding to the text instruction, aligning it to 3D Gaussians.

3D scene Editing

Paper
Add Code

Reliable Academic Conference Question Answering: A Study Based on Large Language Model

no code implementations • 19 Oct 2023 • Zhiwei Huang, Long Jin, Junjie Wang, Mingchen Tu, Yin Hua, Zhiqiang Liu, Jiawei Meng, Huajun Chen, Wen Zhang

To address this need, we have developed the ConferenceQA dataset for 7 diverse academic conferences with human annotations.

Hallucination Language Modelling +3

Paper
Add Code

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models

1 code implementation • 12 Oct 2023 • Taoran Yi, Jiemin Fang, Junjie Wang, Guanjun Wu, Lingxi Xie, Xiaopeng Zhang, Wenyu Liu, Qi Tian, Xinggang Wang

In recent times, the generation of 3D assets from text prompts has shown impressive results.

Text to 3D

541

Paper
Code

EALM: Introducing Multidimensional Ethical Alignment in Conversational Information Retrieval

1 code implementation • 2 Oct 2023 • Yiyao Yu, Junjie Wang, Yuxiang Zhang, Lin Zhang, Yujiu Yang, Tetsuya Sakai

Artificial intelligence (AI) technologies should adhere to human norms to better serve our society and avoid disseminating harmful or misleading information, particularly in Conversational Information Retrieval (CIR).

Ethics Information Retrieval +1

Paper
Code

UniEX: An Effective and Efficient Framework for Unified Information Extraction via a Span-extractive Perspective

no code implementations • 17 May 2023 • Ping Yang, Junyu Lu, Ruyi Gan, Junjie Wang, Yuxiang Zhang, Jiaxing Zhang, Pingjian Zhang

We propose a new paradigm for universal information extraction (IE) that is compatible with any schema format and applicable to a list of IE tasks, such as named entity recognition, relation extraction, event extraction and sentiment analysis.

Event Extraction named-entity-recognition +3

Paper
Add Code

NER-to-MRC: Named-Entity Recognition Completely Solving as Machine Reading Comprehension

no code implementations • 6 May 2023 • Yuxiang Zhang, Junjie Wang, Xinyu Zhu, Tetsuya Sakai, Hayato Yamana

Named-entity recognition (NER) detects texts with predefined semantic labels and is an essential building block for natural language processing (NLP).

Machine Reading Comprehension named-entity-recognition +2

Paper
Add Code

Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning

no code implementations • 23 Nov 2022 • Junjie Wang, Yao Mu, Dong Li, Qichao Zhang, Dongbin Zhao, Yuzheng Zhuang, Ping Luo, Bin Wang, Jianye Hao

The latent world model provides a promising way to learn policies in a compact latent space for tasks with high-dimensional observations, however, its generalization across diverse environments with unseen dynamics remains challenging.

Model-based Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Solving Math Word Problems via Cooperative Reasoning induced Language Models

1 code implementation • 28 Oct 2022 • Xinyu Zhu, Junjie Wang, Lin Zhang, Yuxiang Zhang, Ruyi Gan, Jiaxing Zhang, Yujiu Yang

This inspires us to develop a cooperative reasoning-induced PLM for solving MWPs, called Cooperative Reasoning (CoRe), resulting in a human-like reasoning architecture with system 1 as the generator and system 2 as the verifier.

Ranked #104 on Arithmetic Reasoning on GSM8K

Arithmetic Reasoning Math

Paper
Code

Corrected Evaluation Results of the NTCIR WWW-2, WWW-3, and WWW-4 English Subtasks

no code implementations • 19 Oct 2022 • Tetsuya Sakai, Sijie Tao, Maria Maistro, Zhumin Chu, Yujing Li, Nuo Chen, Nicola Ferro, Junjie Wang, Ian Soboroff, Yiqun Liu

The noise is due to a fatal bug in the backend of our relevance assessment interface.

Paper
Add Code

Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective

1 code implementation • 16 Oct 2022 • Ping Yang, Junjie Wang, Ruyi Gan, Xinyu Zhu, Lin Zhang, Ziwei Wu, Xinyu Gao, Jiaxing Zhang, Tetsuya Sakai

We propose a new paradigm for zero-shot learners that is format agnostic, i. e., it is compatible with any format and applicable to a list of language tasks, such as text classification, commonsense reasoning, coreference resolution, and sentiment analysis.

Multiple-choice Natural Language Inference +4

3,918

Paper
Code

LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge

no code implementations • 14 Oct 2022 • Yan Jia, Mi Hong, Jingyu Hou, Kailong Ren, Sifan Ma, Jin Wang, Fangzhen Peng, Yinglin Ji, Lin Yang, Junjie Wang

This paper describes LeVoice automatic speech recognition systems to track2 of intelligent cockpit speech recognition challenge 2022.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model

1 code implementation • CVPR 2023 • Yatai Ji, Junjie Wang, Yuan Gong, Lin Zhang, Yanru Zhu, Hongfa Wang, Jiaxing Zhang, Tetsuya Sakai, Yujiu Yang

Multimodal semantic understanding often has to deal with uncertainty, which means the obtained messages tend to refer to multiple targets.

Contrastive Learning Image-text matching +9

Paper
Code

TC-SKNet with GridMask for Low-complexity Classification of Acoustic scene

no code implementations • 5 Oct 2022 • Luyuan Xie, Yan Zhong, Lin Yang, Zhaoyu Yan, Zhonghai Wu, Junjie Wang

In our experiments, the performance gain brought by GridMask is stronger than spectrum augmentation in ASCs.

AutoML Data Augmentation

Paper
Add Code

Im2Oil: Stroke-Based Oil Painting Rendering with Linearly Controllable Fineness Via Adaptive Sampling

1 code implementation • 27 Sep 2022 • Zhengyan Tong, Xiaohang Wang, Shengchao Yuan, Xuanhong Chen, Junjie Wang, Xiangzhong Fang

Comparison with existing state-of-the-art oil painting techniques shows that our results have higher fidelity and more realistic textures.

Paper
Code

Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence

1 code implementation • 7 Sep 2022 • Jiaxing Zhang, Ruyi Gan, Junjie Wang, Yuxiang Zhang, Lin Zhang, Ping Yang, Xinyu Gao, Ziwei Wu, Xiaoqun Dong, Junqing He, Jianheng Zhuo, Qi Yang, Yongfeng Huang, Xiayu Li, Yanghan Wu, Junyu Lu, Xinyu Zhu, Weifeng Chen, Ting Han, Kunhao Pan, Rui Wang, Hao Wang, XiaoJun Wu, Zhongshen Zeng, Chongpei Chen

We hope that this project will be the foundation of Chinese cognitive intelligence.

3,918

Paper
Code

Towards No.1 in CLUE Semantic Matching Challenge: Pre-trained Language Model Erlangshen with Propensity-Corrected Loss

1 code implementation • 5 Aug 2022 • Junjie Wang, Yuxiang Zhang, Ping Yang, Ruyi Gan

This report describes a pre-trained language model Erlangshen with propensity-corrected loss, the No. 1 in CLUE Semantic Matching Challenge.

Language Modelling Masked Language Modeling

3,918

Paper
Code

Change Detection from Synthetic Aperture Radar Images via Dual Path Denoising Network

no code implementations • 13 Mar 2022 • Junjie Wang, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

We also propose the distinctive patch convolution for feature representation learning to reduce the time consumption.

Change Detection Computational Efficiency +2

Paper
Add Code

Adaptive DropBlock Enhanced Generative Adversarial Networks for Hyperspectral Image Classification

1 code implementation • 22 Jan 2022 • Junjie Wang, Feng Gao, Junyu Dong, Qian Du

Second, an adaptive DropBlock (AdapDrop) is proposed as a regularization method employed in the generator and discriminator to alleviate the mode collapse issue.

Classification Hyperspectral Image Classification

Paper
Code

Change Detection from Synthetic Aperture Radar Images via Graph-Based Knowledge Supplement Network

1 code implementation • 22 Jan 2022 • Junjie Wang, Feng Gao, Junyu Dong, Shan Zhang, Qian Du

Synthetic aperture radar (SAR) image change detection is a vital yet challenging task in the field of remote sensing image analysis.

Change Detection Feature Correlation

Paper
Code

Low-Latency Online Speaker Diarization with Graph-Based Label Generation

no code implementations • 27 Nov 2021 • Yucong Zhang, Qinjian Lin, Weiqing Wang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li

To ensure the low latency in the online setting, we introduce a variant of AHC, namely chkpt-AHC, to cluster the speakers.

Clustering speaker-diarization +1

Paper
Add Code

Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning

no code implementations • 22 Sep 2021 • Junjie Wang, Qichao Zhang, Dongbin Zhao

We train several state-of-the-art deep reinforcement learning methods in the designed training scenarios and provide the benchmark metrics evaluation results of the trained models in the test scenarios.

Autonomous Driving Benchmarking +4

Paper
Add Code

The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge

no code implementations • 5 Sep 2021 • Weiqing Wang, Danwei Cai, Qingjian Lin, Lin Yang, Junjie Wang, Jin Wang, Ming Li

This report describes the submission of the DKU-DukeECE-Lenovo team to the VoxCeleb Speaker Recognition Challenge (VoxSRC) 2021 track 4.

Action Detection Activity Detection +4

Paper
Add Code

Sparsely Overlapped Speech Training in the Time Domain: Joint Learning of Target Speech Separation and Personal VAD Benefits

no code implementations • 28 Jun 2021 • Qingjian Lin, Lin Yang, Xuyang Wang, Luyuan Xie, Chen Jia, Junjie Wang

This paper proposes the weighted SI-SNR loss, together with the joint learning of target speech separation and personal VAD.

Speech Separation

Paper
Add Code

Change Detection from SAR Images Based on Deformable Residual Convolutional Neural Networks

no code implementations • 6 Apr 2021 • Junjie Wang, Feng Gao, Junyu Dong

Convolutional neural networks (CNN) have made great progress for synthetic aperture radar (SAR) images change detection.

Change Detection

Paper
Add Code

TransfoRNN: Capturing the Sequential Information in Self-Attention Representations for Language Modeling

no code implementations • 4 Apr 2021 • Tze Yuang Chong, Xuyang Wang, Lin Yang, Junjie Wang

Also, the TransfoRNN model was applied on the LibriSpeech speech recognition task and has shown comparable results with the Transformer models.

Language Modelling speech-recognition +1

Paper
Add Code

Skeleton2Mesh: Kinematics Prior Injected Unsupervised Human Mesh Recovery

no code implementations • ICCV 2021 • Zhenbo Yu, Junjie Wang, Jingwei Xu, Bingbing Ni, Chenglong Zhao, Minsi Wang, Wenjun Zhang

The challenges of the latter task are two folds: (1) pose failure (i. e., pose mismatching -- different skeleton definitions in dataset and SMPL , and pose ambiguity -- endpoints have arbitrary joint angle configurations for the same 3D joint coordinates).

3D Pose Estimation Human Mesh Recovery

Paper
Add Code

Towards Alleviating the Modeling Ambiguity of Unsupervised Monocular 3D Human Pose Estimation

no code implementations • ICCV 2021 • Zhenbo Yu, Bingbing Ni, Jingwei Xu, Junjie Wang, Chenglong Zhao, Wenjun Zhang

Furthermore, two temporal constraints are proposed to alleviate the scale and pose ambiguity respectively.

Monocular 3D Human Pose Estimation Unsupervised 3D Human Pose Estimation

Paper
Add Code

Training Wake Word Detection with Synthesized Speech Data on Confusion Words

no code implementations • 3 Nov 2020 • Yan Jia, Zexin Cai, Murong Ma, Zeqing Zhao, Xuyang Wang, Junjie Wang, Ming Li

Confusing-words are commonly encountered in real-life keyword spotting applications, which causes severe degradation of performance due to complex spoken terms and various kinds of words that sound similar to the predefined keywords.

Data Augmentation Keyword Spotting +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.