Search Results for author: Zhiyong Wang

Found 42 papers, 14 papers with code

Variance-Dependent Regret Bounds for Non-stationary Linear Bandits

no code implementations • 15 Mar 2024 • Zhiyong Wang, Jize Xie, Yi Chen, John C. S. Lui, Dongruo Zhou

We investigate the non-stationary stochastic linear bandit problem where the reward distribution evolves each round.

Paper
Add Code

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

no code implementations • 26 Feb 2024 • Hantao Yang, Xutong Liu, Zhiyong Wang, Hong Xie, John C. S. Lui, Defu Lian, Enhong Chen

We study the problem of federated contextual combinatorial cascading bandits, where $|\mathcal{U}|$ agents collaborate under the coordination of a central server to provide tailored recommendations to the $|\mathcal{U}|$ corresponding users.

Paper
Add Code

Design Your Own Universe: A Physics-Informed Agnostic Method for Enhancing Graph Neural Networks

no code implementations • 26 Jan 2024 • Dai Shi, Andi Han, Lequan Lin, Yi Guo, Zhiyong Wang, Junbin Gao

Physics-informed Graph Neural Networks have achieved remarkable performance in learning through graph-structured data by mitigating common GNN challenges such as over-smoothing, over-squashing, and heterophily adaption.

Paper
Add Code

Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection

no code implementations • 11 Jan 2024 • Weibo Jiang, Weihong Ren, Jiandong Tian, Liangqiong Qu, Zhiyong Wang, Honghai Liu

In this work, we propose to explore Self- and Cross-Triplet Correlations (SCTC) for HOI detection.

Human-Object Interaction Detection Knowledge Distillation +2

Paper
Add Code

XAI for In-hospital Mortality Prediction via Multimodal ICU Data

1 code implementation • 29 Dec 2023 • Xingqiao Li, Jindong Gu, Zhiyong Wang, Yancheng Yuan, Bo Du, Fengxiang He

To address this issue, this paper proposes an eXplainable Multimodal Mortality Predictor (X-MMP) approaching an efficient, explainable AI solution for predicting in-hospital mortality via multimodal ICU data.

Decision Making Mortality Prediction

Paper
Code

SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument Segmentation

2 code implementations • 22 Dec 2023 • Wenxi Yue, Jing Zhang, Kun Hu, Qiuxia Wu, ZongYuan Ge, Yong Xia, Jiebo Luo, Zhiyong Wang

Specifically, we achieve this by proposing (1) Collaborative Prompts that describe instrument structures via collaborating category-level and part-level texts; (2) Cross-Modal Prompt Encoder that encodes text prompts jointly with visual embeddings into discriminative part-level representations; and (3) Part-to-Whole Adaptive Fusion and Hierarchical Decoding that adaptively fuse the part-level representations into a whole for accurate instrument segmentation in surgical scenarios.

Segmentation Semantic Segmentation

Paper
Code

HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding

1 code implementation • 23 Nov 2023 • Peng Xia, Xingtong Yu, Ming Hu, Lie Ju, Zhiyong Wang, Peibo Duan, ZongYuan Ge

We explore constructing the class hierarchy into a graph, with its nodes representing the textual or image features of each category.

Fine-Grained Visual Recognition Graph Representation Learning

Paper
Code

Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation

1 code implementation • 7 Sep 2023 • Zhuqiang Lu, Kun Hu, Chaoyue Wang, Lei Bai, Zhiyong Wang

A 360-degree (omni-directional) image provides an all-encompassing spherical view of a scene.

Image Generation

Paper
Code

The FruitShell French synthesis system at the Blizzard 2023 Challenge

no code implementations • 1 Sep 2023 • Xin Qi, Xiaopeng Wang, Zhiyong Wang, Wang Liu, Mingming Ding, Shuchen Shi

The evaluation results of our system showed a quality MOS score of 3. 6 for the Hub task and 3. 4 for the Spoke task, placing our system at an average level among all participating teams.

Data Augmentation Speech Synthesis +1

Paper
Add Code

Terrain Diffusion Network: Climatic-Aware Terrain Generation with Geological Sketch Guidance

no code implementations • 31 Aug 2023 • Zexin Hu, Kun Hu, Clinton Mo, Lei Pan, Zhiyong Wang

Sketch-based terrain generation seeks to create realistic landscapes for virtual environments in various applications such as computer games, animation and virtual reality.

Denoising

Paper
Add Code

Bridging the Gap: Fine-to-Coarse Sketch Interpolation Network for High-Quality Animation Sketch Inbetweening

no code implementations • 25 Aug 2023 • Jiaming Shen, Kun Hu, Wei Bao, Chang Wen Chen, Zhiyong Wang

The 2D animation workflow is typically initiated with the creation of keyframes using sketch-based drawing.

Paper
Add Code

Robust Audio Anti-Spoofing with Fusion-Reconstruction Learning on Multi-Order Spectrograms

1 code implementation • 18 Aug 2023 • Penghui Wen, Kun Hu, Wenxi Yue, Sen Zhang, Wanlei Zhou, Zhiyong Wang

Robust audio anti-spoofing has been increasingly challenging due to the recent advancements on deepfake techniques.

Face Swapping

Paper
Code

SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation

1 code implementation • 17 Aug 2023 • Wenxi Yue, Jing Zhang, Kun Hu, Yong Xia, Jiebo Luo, Zhiyong Wang

However, we observe two problems with this naive pipeline: (1) the domain gap between natural objects and surgical instruments leads to inferior generalisation of SAM; and (2) SAM relies on precise point or box locations for accurate segmentation, requiring either extensive manual guidance or a well-performing specialist detector for prompt preparation, which leads to a complex multi-stage pipeline.

Image Segmentation Segmentation +1

Paper
Code

When Deep Learning Meets Multi-Task Learning in SAR ATR: Simultaneous Target Recognition and Segmentation

no code implementations • 14 Aug 2023 • Chenwei Wang, Jifang Pei, Zhiyong Wang, Yulin Huang, Junjie Wu, Haiguang Yang, Jianyu Yang

In this paper, we propose a new multi-task learning approach for SAR ATR, which could obtain the accurate category and precise shape of the targets simultaneously.

Decoder Learning Theory +2

Paper
Add Code

LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark

1 code implementation • NeurIPS 2023 • Zhenfei Yin, Jiong Wang, JianJian Cao, Zhelun Shi, Dingning Liu, Mukai Li, Lu Sheng, Lei Bai, Xiaoshui Huang, Zhiyong Wang, Jing Shao, Wanli Ouyang

To the best of our knowledge, we present one of the very first open-source endeavors in the field, LAMM, encompassing a Language-Assisted Multi-Modal instruction tuning dataset, framework, and benchmark.

267

Paper
Code

Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning

1 code implementation • 6 Jun 2023 • Peggy Tang, Junbin Gao, Lei Zhang, Zhiyong Wang

Recently, compressive text summarisation offers a balance between the conciseness issue of extractive summarisation and the factual hallucination issue of abstractive summarisation.

Hallucination reinforcement-learning

Paper
Code

Full Resolution Repetition Counting

no code implementations • 23 May 2023 • Jianing Li, Bowen Chen, Zhiyong Wang, Honghai Liu

Given an untrimmed video, repetitive actions counting aims to estimate the number of repetitions of class-agnostic actions.

Paper
Add Code

DSMNet: Deep High-precision 3D Surface Modeling from Sparse Point Cloud Frames

no code implementations • 9 Apr 2023 • Changjie Qiu, Zhiyong Wang, Xiuhong Lin, Yu Zang, Cheng Wang, Weiquan Liu

Second, we propose an modeling evaluation method based on HPMB for object-level modeling to overcome this limitation.

Point Cloud Registration Simultaneous Localization and Mapping

Paper
Add Code

Continuous Intermediate Token Learning with Implicit Motion Manifold for Keyframe Based Motion Interpolation

1 code implementation • CVPR 2023 • Clinton Ansun Mo, Kun Hu, Chengjiang Long, Zhiyong Wang

Deriving sophisticated 3D motions from sparse keyframes is a particularly challenging problem, due to continuity and exceptionally skeletal precision.

Motion Interpolation Motion Synthesis

Paper
Code

Multi-Scale Control Signal-Aware Transformer for Motion Synthesis without Phase

no code implementations • 3 Mar 2023 • Lintao Wang, Kun Hu, Lei Bai, Yu Ding, Wanli Ouyang, Zhiyong Wang

As past poses often contain useful auxiliary hints, in this paper, we propose a task-agnostic deep learning method, namely Multi-scale Control Signal-aware Transformer (MCS-T), with an attention based encoder-decoder architecture to discover the auxiliary information implicitly for synthesizing controllable motion without explicitly requiring auxiliary information such as phase.

Decoder Feature Engineering +1

Paper
Add Code

Efficient Explorative Key-term Selection Strategies for Conversational Contextual Bandits

1 code implementation • 1 Mar 2023 • Zhiyong Wang, Xutong Liu, Shuai Li, John C. S. Lui

To tackle these issues, we first propose ``ConLinUCB", a general framework for conversational bandits with better information incorporation, combining arm-level and key-term-level feedback to estimate user preference in one step at each time.

Computational Efficiency Multi-Armed Bandits +1

Paper
Code

Robust Knowledge Adaptation for Federated Unsupervised Person ReID

no code implementations • 18 Jan 2023 • Jianfeng Weng, Kun Hu, Tingting Yao, Jingya Wang, Zhiyong Wang

Thus, in this work, a federated unsupervised cluster-contrastive (FedUCC) learning method is proposed for Person ReID.

Federated Learning Person Re-Identification

Paper
Add Code

VAPCNet: Viewpoint-Aware 3D Point Cloud Completion

no code implementations • ICCV 2023 • Zhiheng Fu, Longguang Wang, Lian Xu, Zhiyong Wang, Hamid Laga, Yulan Guo, Farid Boussaid, Mohammed Bennamoun

In this paper, we thus propose an unsupervised viewpoint representation learning scheme for 3D point cloud completion without explicit viewpoint estimation.

Point Cloud Completion Representation Learning +1

Paper
Add Code

Towards Efficient Visual Simplification of Computational Graphs in Deep Neural Networks

no code implementations • 21 Dec 2022 • Rusheng Pan, Zhiyong Wang, Yating Wei, Han Gao, Gongchang Ou, Caleb Chen Cao, Jingli Xu, Tong Xu, Wei Chen

A computational graph in a deep neural network (DNN) denotes a specific data flow diagram (DFD) composed of many tensors and operators.

Paper
Add Code

TLDW: Extreme Multimodal Summarisation of News Videos

no code implementations • 16 Oct 2022 • Peggy Tang, Kun Hu, Lei Zhang, Jiebo Luo, Zhiyong Wang

Multimodal summarisation with multimodal output is drawing increasing attention due to the rapid growth of multimedia data.

Sentence

Paper
Add Code

Multi-level Adversarial Spatio-temporal Learning for Footstep Pressure based FoG Detection

no code implementations • 22 Sep 2022 • Kun Hu, Shaohui Mei, Wei Wang, Kaylena A. Ehgoetz Martens, Liang Wang, Simon J. G. Lewis, David D. Feng, Zhiyong Wang

The proposed scheme also sheds light on improving subject-level clinical studies from other scenarios as it can be integrated with many existing deep architectures.

Paper
Add Code

Skin Lesion Recognition with Class-Hierarchy Regularized Hyperbolic Embeddings

no code implementations • 13 Sep 2022 • Zhen Yu, Toan Nguyen, Yaniv Gal, Lie Ju, Shekhar S. Chandra, Lei Zhang, Paul Bonnington, Victoria Mar, Zhiyong Wang, ZongYuan Ge

Accordingly, the learned prototypes preserve the semantic class relations in the embedding space and we can predict the label of an image by assigning its feature to the nearest hyperbolic class prototype.

Paper
Add Code

Deep Laparoscopic Stereo Matching with Transformers

1 code implementation • 25 Jul 2022 • Xuelian Cheng, Yiran Zhong, Mehrtash Harandi, Tom Drummond, Zhiyong Wang, ZongYuan Ge

The self-attention mechanism, successfully employed with the transformer structure is shown promise in many computer vision tasks including image recognition, and object detection.

object-detection Object Detection +2

Paper
Code

Action Recognition With Motion Diversification and Dynamic Selection

no code implementations • TIP 2022 • Peiqin Zhuang, Yu Guo, Zhipeng Yu, Luping Zhou, Lei Bai, Ding Liang, Zhiyong Wang, Yali Wang, Wanli Ouyang

To address this issue, we introduce a Motion Diversification and Selection (MoDS) module to generate diversified spatio-temporal motion features and then select the suitable motion representation dynamically for categorizing the input video.

Ranked #18 on Action Recognition on Something-Something V1

Action Recognition Computational Efficiency

Paper
Add Code

1Cademy at Semeval-2022 Task 1: Investigating the Effectiveness of Multilingual, Multitask, and Language-Agnostic Tricks for the Reverse Dictionary Task

no code implementations • SemEval (NAACL) 2022 • Zhiyong Wang, Ge Zhang, Nineli Lashkarashvili

This paper describes our system for the SemEval2022 task of matching dictionary glosses to word embeddings.

Reverse Dictionary Word Embeddings

Paper
Add Code

OTExtSum: Extractive Text Summarisation with Optimal Transport

1 code implementation • Findings (NAACL) 2022 • Peggy Tang, Kun Hu, Rui Yan, Lei Zhang, Junbin Gao, Zhiyong Wang

Optimal sentence extraction is conceptualised as obtaining an optimal summary that minimises the transportation cost to a given document regarding their semantic distributions.

Sentence

Paper
Code

LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds

no code implementations • CVPR 2022 • Jialian Li, Jingyi Zhang, Zhiyong Wang, Siqi Shen, Chenglu Wen, Yuexin Ma, Lan Xu, Jingyi Yu, Cheng Wang

Quantitative and qualitative experiments show that our method outperforms the techniques based only on RGB images.

Ranked #3 on 3D Human Pose Estimation on SLOPER4D (using extra training data)

3D Human Pose Estimation

Paper
Add Code

Sign Language Translation with Hierarchical Spatio-TemporalGraph Neural Network

no code implementations • 14 Nov 2021 • Jichao Kan, Kun Hu, Markus Hagenbuchner, Ah Chung Tsoi, Mohammed Bennamounm, Zhiyong Wang

Therefore, in this paper, these unique characteristics of sign languages are formulated as hierarchical spatio-temporal graph representations, including high-level and fine-level graphs of which a vertex characterizes a specified body part and an edge represents their interactions.

Machine Translation NMT +2

Paper
Add Code

Deep Learning Techniques for In-Crop Weed Identification: A Review

no code implementations • 27 Mar 2021 • Kun Hu, Zhiyong Wang, Guy Coleman, Asher Bender, Tingting Yao, Shan Zeng, Dezhen Song, Arnold Schumann, Michael Walsh

Weeds are a significant threat to the agricultural productivity and the environment.

Paper
Add Code

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

4 code implementations • CVPR 2020 • Ziyu Liu, Hongwen Zhang, Zhenghao Chen, Zhiyong Wang, Wanli Ouyang

Spatial-temporal graphs have been widely used by skeleton-based action recognition algorithms to model human action dynamics.

Ranked #4 on 3D Action Recognition on Assembly101

Long-range modeling Skeleton Based Action Recognition

866

Paper
Code

Coupling Matrix Manifolds and Their Applications in Optimal Transport

no code implementations • 15 Nov 2019 • Dai Shi, Junbin Gao, Xia Hong, S. T. Boris Choy, Zhiyong Wang

These geometrical features of CMM have paved the way for developing numerical Riemannian optimization algorithms such as Riemannian gradient descent and Riemannian trust-region algorithms, forming a uniform optimization method for all types of OT problems.

Riemannian optimization

Paper
Add Code

IntersectGAN: Learning Domain Intersection for Generating Images with Multiple Attributes

no code implementations • 21 Sep 2019 • Zehui Yao, Boyan Zhang, Zhiyong Wang, Wanli Ouyang, Dong Xu, Dagan Feng

For example, given two image domains $X_1$ and $X_2$ with certain attributes, the intersection $X_1 \cap X_2$ denotes a new domain where images possess the attributes from both $X_1$ and $X_2$ domains.

Attribute

Paper
Add Code

Realtime and Accurate 3D Eye Gaze Capture with DCNN-based Iris and Pupil Segmentation

1 code implementation • IEEE Transactions on Visualization and Computer Graphics ( Early Access ) 2019 • Zhiyong Wang, Jinxiang Chai, Shihong Xia

A comparison against Wang et al.[3] shows that our method advances the state of the art in 3D eye tracking using a single RGB camera.

112

Paper
Code

Matrix Neural Networks

no code implementations • 15 Jan 2016 • Junbin Gao, Yi Guo, Zhiyong Wang

This process can be problematic.

Image Super-Resolution

Paper
Add Code

MRFalign: Protein Homology Detection through Alignment of Markov Random Fields

no code implementations • 12 Jan 2014 • Jianzhu Ma, Sheng Wang, Zhiyong Wang, Jinbo Xu

A sequence profile is usually represented as a position-specific scoring matrix (PSSM) or an HMM (Hidden Markov Model) and accordingly PSSM-PSSM or HMM-HMM comparison is used for homolog detection.

Multiple Sequence Alignment

Paper
Add Code

Protein Contact Prediction by Integrating Joint Evolutionary Coupling Analysis and Supervised Learning

no code implementations • 10 Dec 2013 • Jianzhu Ma, Sheng Wang, Zhiyong Wang, Jinbo Xu

To further improve the accuracy of the estimated precision matrices, we employ a supervised learning method to predict contact probability from a variety of evolutionary and non-evolutionary information and then incorporate the predicted probability as prior into our GGL framework.

Paper
Add Code

Predicting protein contact map using evolutionary and physical constraints by integer programming (extended version)

no code implementations • 8 Aug 2013 • Zhiyong Wang, Jinbo Xu

Most existing methods predict the contact map matrix element-by-element, ignoring correlation among contacts and physical feasibility of the whole contact map.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.