Search Results for author: Dongyun Lin

Found 8 papers, 0 papers with code

Bridging the Intent Gap: Knowledge-Enhanced Visual Generation

no code implementations • 21 May 2024 • Yi Cheng, Ziwei Xu, Dongyun Lin, Harry Cheng, Yongkang Wong, Ying Sun, Joo Hwee Lim, Mohan Kankanhalli

To address these challenges, we propose a knowledge-enhanced iterative refinement framework for visual content generation.

World Knowledge

Paper
Add Code

PEVA-Net: Prompt-Enhanced View Aggregation Network for Zero/Few-Shot Multi-View 3D Shape Recognition

no code implementations • 30 Apr 2024 • Dongyun Lin, Yi Cheng, Shangbo Mao, Aiyuan Guo, Yiqun Li

Specifically, leveraging the descriptor which is effective for zero-shot inference to guide the tuning of the aggregated descriptor under the few-shot training can significantly improve the few-shot learning efficacy.

3D Shape Recognition Few-Shot Learning +2

Paper
Add Code

Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering

no code implementations • 25 Jul 2023 • Yi Cheng, Hehe Fan, Dongyun Lin, Ying Sun, Mohan Kankanhalli, Joo-Hwee Lim

The main challenge in video question answering (VideoQA) is to capture and understand the complex spatial and temporal relations between objects based on given questions.

graph construction Question Answering +2

Paper
Add Code

SCA-PVNet: Self-and-Cross Attention Based Aggregation of Point Cloud and Multi-View for 3D Object Retrieval

no code implementations • 20 Jul 2023 • Dongyun Lin, Yi Cheng, Aiyuan Guo, Shangbo Mao, Yiqun Li

With deep features extracted from point clouds and multi-view images, we design two types of feature aggregation modules, namely the In-Modality Aggregation Module (IMAM) and the Cross-Modality Aggregation Module (CMAM), for effective feature fusion.

3D Object Retrieval Object +1

Paper
Add Code

A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023

no code implementations • 13 Jul 2023 • Yi Cheng, Ziwei Xu, Fen Fang, Dongyun Lin, Hehe Fan, Yongkang Wong, Ying Sun, Mohan Kankanhalli

Our research focuses on the innovative application of a differentiable logic loss in the training to leverage the co-occurrence relations between verb and noun, as well as the pre-trained Large Language Models (LLMs) to generate the logic rules for the adaptation to unseen action labels.

Action Recognition Unsupervised Domain Adaptation

Paper
Add Code

Team VI-I2R Technical Report on EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2022

no code implementations • 29 Jan 2023 • Yi Cheng, Dongyun Lin, Fen Fang, Hao Xuan Woon, Qianli Xu, Ying Sun

In this report, we present the technical details of our submission to the EPIC-KITCHENS-100 Unsupervised Domain Adaptation (UDA) Challenge for Action Recognition 2022.

Action Recognition Unsupervised Domain Adaptation

Paper
Add Code

DDR-ID: Dual Deep Reconstruction Networks Based Image Decomposition for Anomaly Detection

no code implementations • 18 Jul 2020 • Dongyun Lin, Yiqun Li, Shudong Xie, Tin Lay Nwe, Sheng Dong

One pivot challenge for image anomaly (AD) detection is to learn discriminative information only from normal class training images.

Adversarial Attack Detection Anomaly Detection +2

Paper
Add Code

Few-Shot Defect Segmentation Leveraging Abundant Normal Training Samples Through Normal Background Regularization and Crop-and-Paste Operation

no code implementations • 18 Jul 2020 • Dongyun Lin, Yanpeng Cao, Wenbing Zhu, Yiqun Li

In industrial inspection tasks, it is common to capture abundant defect-free image samples but very limited anomalous ones.

Anomaly Detection Benchmarking +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.