Search Results for author: Dingkang Yang

Found 30 papers, 9 papers with code

SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion

no code implementations • 5 May 2024 • Ziyun Qian, Zeyu Xiao, Zhenyi Wu, Dingkang Yang, Mingcheng Li, Shunli Wang, Shuaibing Wang, Dongliang Kou, Lihua Zhang

To address these problems, we consider style motion as a condition and propose the Style Motion Conditioned Diffusion (SMCD) framework for the first time, which can more comprehensively learn the style features of motion.

Motion Style Transfer Style Transfer

Paper
Add Code

Multi-Scale Heterogeneity-Aware Hypergraph Representation for Histopathology Whole Slide Images

1 code implementation • 30 Apr 2024 • Minghao Han, Xukun Zhang, Dingkang Yang, Tao Liu, Haopeng Kuang, Jinghui Feng, Lihua Zhang

Survival prediction is a complex ordinal regression task that aims to predict the survival coefficient ranking among a cohort of patients, typically achieved by analyzing patients' whole slide images.

Multiple Instance Learning Survival Prediction +1

Paper
Code

Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities

no code implementations • 25 Apr 2024 • Mingcheng Li, Dingkang Yang, Xiao Zhao, Shuaibing Wang, Yan Wang, Kun Yang, Mingyang Sun, Dongliang Kou, Ziyun Qian, Lihua Zhang

Specifically, we present a sample-level contrastive distillation mechanism that transfers comprehensive knowledge containing cross-sample correlations to reconstruct missing semantics.

Disentanglement Knowledge Distillation +1

Paper
Add Code

Efficiency in Focus: LayerNorm as a Catalyst for Fine-tuning Medical Visual Language Pre-trained Models

no code implementations • 25 Apr 2024 • Jiawei Chen, Dingkang Yang, Yue Jiang, Mingcheng Li, Jinjie Wei, Xiaolu Hou, Lihua Zhang

In the realm of Medical Visual Language Models (Med-VLMs), the quest for universal efficient fine-tuning mechanisms remains paramount, especially given researchers in interdisciplinary fields are often extremely short of training resources, yet largely unexplored.

Medical Visual Question Answering Question Answering +1

Paper
Add Code

De-confounded Data-free Knowledge Distillation for Handling Distribution Shifts

no code implementations • 28 Mar 2024 • Yuzheng Wang, Dingkang Yang, Zhaoyu Chen, Yang Liu, Siao Liu, Wenqiang Zhang, Lihua Zhang, Lizhe Qi

Data-Free Knowledge Distillation (DFKD) is a promising task to train high-performance small models to enhance actual deployment without relying on the original training data.

Causal Inference Data-free Knowledge Distillation

Paper
Add Code

Can LLMs' Tuning Methods Work in Medical Multimodal Domain?

no code implementations • 11 Mar 2024 • Jiawei Chen, Yue Jiang, Dingkang Yang, Mingcheng Li, Jinjie Wei, Ziyun Qian, Lihua Zhang

We show the different impacts of fine-tuning methods for large models on medical VLMs and develop the most efficient ways to fine-tune medical VLP models.

Transfer Learning World Knowledge

Paper
Add Code

Robust Emotion Recognition in Context Debiasing

no code implementations • 9 Mar 2024 • Dingkang Yang, Kun Yang, Mingcheng Li, Shunli Wang, Shuaibing Wang, Lihua Zhang

Following the causal graph, CLEF introduces a non-invasive context branch to capture the adverse direct effect caused by the context bias.

counterfactual Emotion Recognition in Context

Paper
Add Code

Towards Multimodal Human Intention Understanding Debiasing via Subject-Deconfounding

no code implementations • 8 Mar 2024 • Dingkang Yang, Dongling Xiao, Ke Li, Yuzheng Wang, Zhaoyu Chen, Jinjie Wei, Lihua Zhang

Multimodal intention understanding (MIU) is an indispensable component of human expression analysis (e. g., sentiment or humor) from heterogeneous modalities, including visual postures, linguistic contents, and acoustic behaviors.

Paper
Add Code

Towards Multimodal Sentiment Analysis Debiasing via Bias Purification

no code implementations • 8 Mar 2024 • Dingkang Yang, Mingcheng Li, Dongling Xiao, Yang Liu, Kun Yang, Zhaoyu Chen, Yuzheng Wang, Peng Zhai, Ke Li, Lihua Zhang

In the inference phase, given a factual multimodal input, MCIS imagines two counterfactual scenarios to purify and mitigate these biases.

counterfactual Counterfactual Inference +1

Paper
Add Code

HandGCAT: Occlusion-Robust 3D Hand Mesh Reconstruction from Monocular Images

1 code implementation • 27 Feb 2024 • Shuaibing Wang, Shunli Wang, Dingkang Yang, Mingcheng Li, Ziyun Qian, Liuzhen Su, Lihua Zhang

KGC extracts hand prior information from 2D hand pose by graph convolution.

Paper
Code

Delving into Decision-based Black-box Attacks on Semantic Segmentation

no code implementations • 2 Feb 2024 • Zhaoyu Chen, Zhengyang Shan, Jingwen Chang, Kaixun Jiang, Dingkang Yang, Yiting Cheng, Wenqiang Zhang

We conduct adversarial robustness evaluation on 5 models from Cityscapes and ADE20K under 8 attacks.

Adversarial Robustness Segmentation +1

Paper
Add Code

MISS: A Generative Pretraining and Finetuning Approach for Med-VQA

no code implementations • 10 Jan 2024 • Jiawei Chen, Dingkang Yang, Yue Jiang, Yuxuan Lei, Lihua Zhang

However, most methods in the medical field treat VQA as an answer classification task which is difficult to transfer to practical application scenarios.

Medical Visual Question Answering Multi-Task Learning +3

Paper
Add Code

CPR-Coach: Recognizing Composite Error Actions based on Single-class Training

no code implementations • 21 Sep 2023 • Shunli Wang, Qing Yu, Shuaibing Wang, Dingkang Yang, Liuzhen Su, Xiao Zhao, Haopeng Kuang, Peixuan Zhang, Peng Zhai, Lihua Zhang

For the first time, this paper constructs a vision-based system to complete error action recognition and skill assessment in CPR.

Action Analysis Action Recognition

Paper
Add Code

Improving Generalization in Visual Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation

no code implementations • ICCV 2023 • Siao Liu, Zhaoyu Chen, Yang Liu, Yuzheng Wang, Dingkang Yang, Zhile Zhao, Ziqing Zhou, Xie Yi, Wei Li, Wenqiang Zhang, Zhongxue Gan

In particular, CG2A develops a Gradient Agreement Solver to adaptively balance the varying gradient magnitudes, and introduces a Soft Gradient Surgery strategy to alleviate the gradient conflicts.

reinforcement-learning

Paper
Add Code

Sampling to Distill: Knowledge Transfer from Open-World Data

no code implementations • 31 Jul 2023 • Yuzheng Wang, Zhaoyu Chen, Jie Zhang, Dingkang Yang, Zuhao Ge, Yang Liu, Siao Liu, Yunquan Sun, Wenqiang Zhang, Lizhe Qi

Then, we introduce a low-noise representation to alleviate the domain shifts and build a structured relationship of multiple data examples to exploit data knowledge.

Data-free Knowledge Distillation Transfer Learning

Paper
Add Code

AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception

1 code implementation • ICCV 2023 • Dingkang Yang, Shuai Huang, Zhi Xu, Zhenpeng Li, Shunli Wang, Mingcheng Li, Yuzheng Wang, Yang Liu, Kun Yang, Zhaoyu Chen, Yan Wang, Jing Liu, Peixuan Zhang, Peng Zhai, Lihua Zhang

Driver distraction has become a significant cause of severe traffic accidents over the past decade.

Paper
Code

Spatio-Temporal Domain Awareness for Multi-Agent Collaborative Perception

1 code implementation • ICCV 2023 • Kun Yang, Dingkang Yang, Jingyu Zhang, Mingcheng Li, Yang Liu, Jing Liu, Hanqi Wang, Peng Sun, Liang Song

In this paper, we propose SCOPE, a novel collaborative perception framework that aggregates the spatio-temporal awareness characteristics across on-road agents in an end-to-end manner.

3D Object Detection Autonomous Vehicles +1

Paper
Code

Human 3D Avatar Modeling with Implicit Neural Representation: A Brief Survey

no code implementations • 6 Jun 2023 • Mingyang Sun, Dingkang Yang, Dongliang Kou, Yang Jiang, Weihua Shan, Zhe Yan, Lihua Zhang

This paper comprehensively reviews the application of implicit neural representation in human body modeling.

Paper
Add Code

Context De-confounded Emotion Recognition

1 code implementation • CVPR 2023 • Dingkang Yang, Zhaoyu Chen, Yuzheng Wang, Shunli Wang, Mingcheng Li, Siao Liu, Xiao Zhao, Shuai Huang, Zhiyan Dong, Peng Zhai, Lihua Zhang

However, a long-overlooked issue is that a context bias in existing datasets leads to a significantly unbalanced distribution of emotional states among different context scenarios.

Emotion Recognition

Paper
Code

Out of Thin Air: Exploring Data-Free Adversarial Robustness Distillation

no code implementations • 21 Mar 2023 • Yuzheng Wang, Zhaoyu Chen, Dingkang Yang, Pinxue Guo, Kaixun Jiang, Wenqiang Zhang, Lizhe Qi

Adversarial Robustness Distillation (ARD) is a promising task to solve the issue of limited adversarial robustness of small capacity models while optimizing the expensive computational costs of Adversarial Training (AT).

Adversarial Robustness Knowledge Distillation +1

Paper
Add Code

Efficient Decision-based Black-box Patch Attacks on Video Recognition

no code implementations • ICCV 2023 • Kaixun Jiang, Zhaoyu Chen, Hao Huang, Jiafeng Wang, Dingkang Yang, Bo Li, Yan Wang, Wenqiang Zhang

First, STDE introduces target videos as patch textures and only adds patches on keyframes that are adaptively selected by temporal difference.

Video Recognition

Paper
Add Code

A novel efficient Multi-view traffic-related object detection framework

no code implementations • 23 Feb 2023 • Kun Yang, Jing Liu, Dingkang Yang, Hanqi Wang, Peng Sun, Yanni Zhang, Yan Liu, Liang Song

With the rapid development of intelligent transportation system applications, a tremendous amount of multi-view video data has emerged to enhance vehicle perception.

Model Selection object-detection +1

Paper
Add Code

Towards Simultaneous Segmentation of Liver Tumors and Intrahepatic Vessels via Cross-attention Mechanism

no code implementations • 20 Feb 2023 • Haopeng Kuang, Dingkang Yang, Shunli Wang, Xiaoying Wang, Lihua Zhang

Accurate visualization of liver tumors and their surrounding blood vessels is essential for noninvasive diagnosis and prognosis prediction of tumors.

Decoder Image Segmentation +3

Paper
Add Code

Adversarial Contrastive Distillation with Adaptive Denoising

no code implementations • 17 Feb 2023 • Yuzheng Wang, Zhaoyu Chen, Dingkang Yang, Yang Liu, Siao Liu, Wenqiang Zhang, Lizhe Qi

To this end, we propose a novel structured ARD method called Contrastive Relationship DeNoise Distillation (CRDND).

Adversarial Robustness Denoising +1

Paper
Add Code

Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models

1 code implementation • 10 Feb 2023 • Yang Liu, Dingkang Yang, Yan Wang, Jing Liu, Jun Liu, Azzedine Boukerche, Peng Sun, Liang Song

Video Anomaly Detection (VAD) serves as a pivotal technology in the intelligent surveillance systems, enabling the temporal or spatial identification of anomalous events within videos.

Anomaly Detection Event Detection +1

Paper
Code

Boosting the Transferability of Adversarial Attacks with Global Momentum Initialization

2 code implementations • 21 Nov 2022 • Jiafeng Wang, Zhaoyu Chen, Kaixun Jiang, Dingkang Yang, Lingyi Hong, Pinxue Guo, Haijing Guo, Wenqiang Zhang

To tackle these issues, we propose Global Momentum Initialization (GI) to suppress gradient elimination and help search for the global optimum.

144

Paper
Code

Learning Appearance-motion Normality for Video Anomaly Detection

no code implementations • 27 Jul 2022 • Yang Liu, Jing Liu, Mengyang Zhao, Dingkang Yang, Xiaoguang Zhu, Liang Song

Video anomaly detection is a challenging task in the computer vision community.

Anomaly Detection Video Anomaly Detection

Paper
Add Code

CA-SpaceNet: Counterfactual Analysis for 6D Pose Estimation in Space

1 code implementation • 16 Jul 2022 • Shunli Wang, Shuaibing Wang, Bo Jiao, Dingkang Yang, Liuzhen Su, Peng Zhai, Chixiao Chen, Lihua Zhang

Considering that the pose estimator is sensitive to background interference, this paper proposes a counterfactual analysis framework named CASpaceNet to complete robust 6D pose estimation of the spaceborne targets under complicated background.

6D Pose Estimation Causal Inference +2

Paper
Code

A Survey of Video-based Action Quality Assessment

no code implementations • 20 Apr 2022 • Shunli Wang, Dingkang Yang, Peng Zhai, Qing Yu, Tao Suo, Zhan Sun, Ka Li, Lihua Zhang

Most of the existing work focuses on sports and medical care.

Action Quality Assessment Action Recognition +3

Paper
Add Code

TSA-Net: Tube Self-Attention Network for Action Quality Assessment

1 code implementation • 11 Jan 2022 • Shunli Wang, Dingkang Yang, Peng Zhai, Chixiao Chen, Lihua Zhang

Specifically, we introduce a single object tracker into AQA and propose the Tube Self-Attention Module (TSA), which can efficiently generate rich spatio-temporal contextual information by adopting sparse feature interactions.

Action Assessment Action Quality Assessment +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.