Search Results for author: Ran Yi

Found 45 papers, 23 papers with code

GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting

no code implementations • 29 Apr 2024 • Bo Chen, Shoukang Hu, Qi Chen, Chenpeng Du, Ran Yi, Yanmin Qian, Xie Chen

We present GStalker, a 3D audio-driven talking face generation model with Gaussian Splatting for both fast training (40 minutes) and real-time rendering (125 FPS) with a 3$\sim$5 minute video for training material, in comparison with previous 2D and 3D NeRF-based modeling frameworks which require hours of training and seconds of rendering per frame.

Talking Face Generation

Paper
Add Code

MotionMaster: Training-free Camera Motion Transfer For Video Generation

no code implementations • 24 Apr 2024 • Teng Hu, Jiangning Zhang, Ran Yi, Yating Wang, Hongrui Huang, Jieyu Weng, Yabiao Wang, Lizhuang Ma

Furthermore, we propose a few-shot camera motion disentanglement method to extract the common camera motion from multiple videos with similar camera motions, which employs a window-based clustering technique to extract the common features in temporal attention maps of multiple videos.

Disentanglement Motion Disentanglement +2

Paper
Add Code

Learning Topology Uniformed Face Mesh by Volume Rendering for Multi-view Reconstruction

no code implementations • 8 Apr 2024 • Yating Wang, Ran Yi, Ke Fan, Jinkun Hao, Jiangbo Lu, Lizhuang Ma

Our goal is to leverage the superiority of neural volume rendering into multi-view reconstruction of face mesh with consistent topology.

3D Reconstruction Face Reconstruction +1

Paper
Add Code

SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation

1 code implementation • 4 Apr 2024 • Sichen Chen, Yingyi Zhang, Siming Huang, Ran Yi, Ke Fan, Ruixin Zhang, Peixian Chen, Jun Wang, Shouhong Ding, Lizhuang Ma

To mitigate the problem of under-fitting, we design a transformer module named Multi-Cycled Transformer(MCT) based on multiple-cycled forwards to more fully exploit the potential of small model parameters.

Edge-computing Pose Estimation

Paper
Code

Continuous Piecewise-Affine Based Motion Model for Image Animation

1 code implementation • 17 Jan 2024 • Hexiang Wang, Fengqi Liu, Qianyu Zhou, Ran Yi, Xin Tan, Lizhuang Ma

To address this issue, we propose to model motion from the source image to the driving frame in highly-expressive diffeomorphism spaces.

Image Animation

Paper
Code

Automatic Tooth Arrangement with Joint Features of Point and Mesh Representations via Diffusion Probabilistic Models

no code implementations • 23 Dec 2023 • Changsong Lei, Mengfei Xia, Shaofeng Wang, Yaqian Liang, Ran Yi, Yuhui Wen, YongJin Liu

To address this challenge, we propose a general tooth arrangement neural network based on the diffusion probabilistic model.

Denoising

Paper
Add Code

Plasticine3D: Non-rigid 3D editting with text guidance

no code implementations • 15 Dec 2023 • Yige Chen, Ang Chen, Siyuan Chen, Ran Yi

Firstly, our work divides the editing process into a geometry editing stage and a texture editing stage to achieve more detailed and photo-realistic results ; Secondly, in order to perform non-rigid transformation with controllable results while maintain the fidelity towards original 3D models in the same time, we propose a multi-view-embedding(MVE) optimization strategy to ensure that the diffusion model learns the overall features of the original object and an embedding-fusion(EF) to control the degree of editing by adjusting the value of the fusing rate.

3D Generation Text to 3D

Paper
Add Code

AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model

1 code implementation • 10 Dec 2023 • Teng Hu, Jiangning Zhang, Ran Yi, Yuzhen Du, Xu Chen, Liang Liu, Yabiao Wang, Chengjie Wang

Existing anomaly inspection methods are limited in their performance due to insufficient anomaly data.

Image Generation

Paper
Code

SMaRt: Improving GANs with Score Matching Regularity

no code implementations • 30 Nov 2023 • Mengfei Xia, Yujun Shen, Ceyuan Yang, Ran Yi, Wenping Wang, Yong-Jin Liu

In this work, we revisit the mathematical foundations of GANs, and theoretically reveal that the native adversarial loss for GAN training is insufficient to fix the problem of subsets with positive Lebesgue measure of the generated data manifold lying out of the real data manifold.

valid

Paper
Add Code

SAMVG: A Multi-stage Image Vectorization Model with the Segment-Anything Model

no code implementations • 9 Nov 2023 • Haokun Zhu, Juang Ian Chong, Teng Hu, Ran Yi, Yu-Kun Lai, Paul L. Rosin

Vector graphics are widely used in graphical designs and have received more and more attention.

Image Segmentation Segmentation +2

Paper
Add Code

Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner

no code implementations • 14 Oct 2023 • Mengfei Xia, Yujun Shen, Changsong Lei, Yu Zhou, Ran Yi, Deli Zhao, Wenping Wang, Yong-Jin Liu

By viewing the generation of diffusion models as a discretized integrating process, we argue that the quality drop is partly caused by applying an inaccurate integral direction to a timestep interval.

Denoising

Paper
Add Code

T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation

1 code implementation • 4 Oct 2023 • Yuze He, Yushi Bai, Matthieu Lin, Wang Zhao, Yubin Hu, Jenny Sheng, Ran Yi, Juanzi Li, Yong-Jin Liu

Recent methods in text-to-3D leverage powerful pretrained diffusion models to optimize NeRF.

3D Generation Benchmarking +1

1,069

Paper
Code

MMPI: a Flexible Radiance Field Representation by Multiple Multi-plane Images Blending

no code implementations • 30 Sep 2023 • Yuze He, Peng Wang, Yubin Hu, Wang Zhao, Ran Yi, Yong-Jin Liu, Wenping Wang

In this paper, we explore the potential of MPI and show that MPI can synthesize high-quality novel views of complex scenes with diverse camera distributions and view directions, which are not only limited to simple forward-facing scenes.

Autonomous Driving Novel View Synthesis

Paper
Add Code

Contrastive Pseudo Learning for Open-World DeepFake Attribution

1 code implementation • ICCV 2023 • Zhimin Sun, Shen Chen, Taiping Yao, Bangjie Yin, Ran Yi, Shouhong Ding, Lizhuang Ma

The challenge in sourcing attribution for forgery faces has gained widespread attention due to the rapid development of generative techniques.

DeepFake Detection Face Swapping +1

Paper
Code

Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region

2 code implementations • 7 Sep 2023 • Teng Hu, Ran Yi, Haokun Zhu, Liang Liu, Jinlong Peng, Yabiao Wang, Chengjie Wang, Lizhuang Ma

To solve the problem, we propose Compositional Neural Painter, a novel stroke-based rendering framework which dynamically predicts the next painting region based on the current canvas, instead of dividing the image plane uniformly into painting regions.

Style Transfer

Paper
Code

Toward High Quality Facial Representation Learning

1 code implementation • 7 Sep 2023 • Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Liang Liu, Yabiao Wang, Chengjie Wang

To improve the facial representation quality, we use feature map of a pre-trained visual backbone as a supervision item and use a partially pre-trained decoder for mask image modeling.

Contrastive Learning Decoder +3

Paper
Code

Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption

1 code implementation • ICCV 2023 • Teng Hu, Jiangning Zhang, Liang Liu, Ran Yi, Siqi Kou, Haokun Zhu, Xu Chen, Yabiao Wang, Chengjie Wang, Lizhuang Ma

To address these problems, we propose a novel phasic content fusing few-shot diffusion model with directional distribution consistency loss, which targets different learning objectives at distinct training stages of the diffusion model.

Domain Adaptation

Paper
Code

LiDAR-Camera Panoptic Segmentation via Geometry-Consistent and Semantic-Aware Alignment

1 code implementation • ICCV 2023 • Zhiwei Zhang, Zhizhong Zhang, Qian Yu, Ran Yi, Yuan Xie, Lizhuang Ma

3D panoptic segmentation is a challenging perception task that requires both semantic segmentation and instance segmentation.

Instance Segmentation Panoptic Segmentation +1

Paper
Code

RFENet: Towards Reciprocal Feature Evolution for Glass Segmentation

1 code implementation • 12 Jul 2023 • Ke Fan, Changan Wang, Yabiao Wang, Chengjie Wang, Ran Yi, Lizhuang Ma

Glass-like objects are widespread in daily life but remain intractable to be segmented for most existing methods.

Semantic Segmentation

Paper
Code

Re-thinking Data Availablity Attacks Against Deep Neural Networks

no code implementations • 18 May 2023 • Bin Fang, Bo Li, Shuang Wu, Ran Yi, Shouhong Ding, Lizhuang Ma

The unauthorized use of personal data for commercial purposes and the clandestine acquisition of private data for training machine learning models continue to raise concerns.

Paper
Add Code

Towards Generalizable Data Protection With Transferable Unlearnable Examples

no code implementations • 18 May 2023 • Bin Fang, Bo Li, Shuang Wu, Tianyi Zheng, Shouhong Ding, Ran Yi, Lizhuang Ma

One of the crucial factors contributing to this success has been the access to an abundance of high-quality data for constructing machine learning models.

Paper
Add Code

Instance-Aware Domain Generalization for Face Anti-Spoofing

1 code implementation • CVPR 2023 • Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Xuequan Lu, Ran Yi, Shouhong Ding, Lizhuang Ma

To address these issues, we propose a novel perspective for DG FAS that aligns features on the instance level without the need for domain labels.

Domain Generalization Face Anti-Spoofing +1

Paper
Code

Towards Artistic Image Aesthetics Assessment: a Large-scale Dataset and a New Method

1 code implementation • CVPR 2023 • Ran Yi, Haoyuan Tian, Zhihao Gu, Yu-Kun Lai, Paul L. Rosin

To fill the gap in the field of artistic image aesthetics assessment (AIAA), we first introduce a large-scale AIAA dataset: Boldbrush Artistic Image Dataset (BAID), which consists of 60, 337 artistic images covering various art forms, with more than 360, 000 votes from online users.

Paper
Code

Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

2 code implementations • ICCV 2023 • Junshu Tang, Tengfei Wang, Bo Zhang, Ting Zhang, Ran Yi, Lizhuang Ma, Dong Chen

In this work, we investigate the problem of creating high-fidelity 3D content from only a single image.

Text to 3D

1,691

Paper
Code

Multimodal Industrial Anomaly Detection via Hybrid Fusion

1 code implementation • CVPR 2023 • Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Yabiao Wang, Chengjie Wang

2D-based Industrial Anomaly Detection has been widely discussed, however, multimodal industrial anomaly detection based on 3D point clouds and RGB images still has many untouched fields.

Ranked #3 on RGB+3D Anomaly Detection and Segmentation on MVTEC 3D-AD (using extra training data)

Contrastive Learning RGB+3D Anomaly Detection and Segmentation

119

Paper
Code

Remembering Normality: Memory-guided Knowledge Distillation for Unsupervised Anomaly Detection

1 code implementation • ICCV 2023 • Zhihao Gu, Liang Liu, Xu Chen, Ran Yi, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Annan Shu, Guannan Jiang, Lizhuang Ma

Specifically, we first propose a normality recall memory (NR Memory) to strengthen the normality of student-generated features by recalling the stored normal information.

Ranked #11 on Anomaly Detection on MVTec AD

Knowledge Distillation Unsupervised Anomaly Detection

Paper
Code

Generative Domain Adaptation for Face Anti-Spoofing

no code implementations • 20 Jul 2022 • Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Ran Yi, Kekai Sheng, Shouhong Ding, Lizhuang Ma

Most existing UDA FAS methods typically fit the trained models to the target domain via aligning the distribution of semantic high-level features.

Domain Adaptation Face Anti-Spoofing

Paper
Add Code

Adaptive Mixture of Experts Learning for Generalizable Face Anti-Spoofing

no code implementations • 20 Jul 2022 • Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Ran Yi, Shouhong Ding, Lizhuang Ma

Existing DG-based FAS approaches always capture the domain-invariant features for generalizing on the various unseen domains.

Domain Generalization Face Anti-Spoofing +1

Paper
Add Code

Dynamic Neural Textures: Generating Talking-Face Videos with Continuously Controllable Expressions

no code implementations • 13 Apr 2022 • Zipeng Ye, Zhiyao Sun, Yu-Hui Wen, Yanan sun, Tian Lv, Ran Yi, Yong-Jin Liu

In this paper, we propose a method to generate talking-face videos with continuously controllable expressions in real-time.

Video Generation

Paper
Add Code

LAKe-Net: Topology-Aware Point Cloud Completion by Localizing Aligned Keypoints

1 code implementation • CVPR 2022 • Junshu Tang, Zhijun Gong, Ran Yi, Yuan Xie, Lizhuang Ma

An asymmetric keypoint locator, including an unsupervised multi-scale keypoint detector and a complete keypoint generator, is proposed for localizing aligned keypoints from complete and partial point clouds.

Point Cloud Completion

Paper
Code

CtlGAN: Few-shot Artistic Portraits Generation with Contrastive Transfer Learning

no code implementations • 16 Mar 2022 • Yue Wang, Ran Yi, Luying Li, Ying Tai, Chengjie Wang, Lizhuang Ma

We propose a new encoder which embeds real faces into Z+ space and proposes a dual-path training strategy to better cope with the adapted decoder and eliminate the artifacts.

Decoder Image-to-Image Translation +1

Paper
Add Code

Quality Metric Guided Portrait Line Drawing Generation from Unpaired Training Data

1 code implementation • 8 Feb 2022 • Ran Yi, Yong-Jin Liu, Yu-Kun Lai, Paul L. Rosin

In this paper, we propose a novel method to automatically transform face photos to portrait drawings using unpaired training data with two new features; i. e., our method can (1) learn to generate high quality portrait drawings in multiple styles using a single network and (2) generate portrait drawings in a "new style" unseen in the training data.

Paper
Code

Audio-Driven Talking Face Video Generation with Dynamic Convolution Kernels

no code implementations • 16 Jan 2022 • Zipeng Ye, Mengfei Xia, Ran Yi, Juyong Zhang, Yu-Kun Lai, Xuwei Huang, Guoxin Zhang, Yong-Jin Liu

In this paper, we present a dynamic convolution kernel (DCK) strategy for convolutional neural networks.

Video Generation

Paper
Add Code

CFNet: Learning Correlation Functions for One-Stage Panoptic Segmentation

no code implementations • 13 Jan 2022 • Yifeng Chen, Wenqing Chu, Fangfang Wang, Ying Tai, Ran Yi, Zhenye Gan, Liang Yao, Chengjie Wang, Xi Li

Recently, there is growing attention on one-stage panoptic segmentation methods which aim to segment instances and stuff jointly within a fully convolutional pipeline efficiently.

Instance Segmentation Panoptic Segmentation +1

Paper
Add Code

ISDNet: Integrating Shallow and Deep Networks for Efficient Ultra-High Resolution Segmentation

1 code implementation • CVPR 2022 • Shaohua Guo, Liang Liu, Zhenye Gan, Yabiao Wang, Wuhao Zhang, Chengjie Wang, Guannan Jiang, Wei zhang, Ran Yi, Lizhuang Ma, Ke Xu

The huge burden of computation and memory are two obstacles in ultra-high resolution image segmentation.

Image Segmentation Segmentation +1

Paper
Code

Exploiting Fine-grained Face Forgery Clues via Progressive Enhancement Learning

no code implementations • 28 Dec 2021 • Qiqi Gu, Shen Chen, Taiping Yao, Yang Chen, Shouhong Ding, Ran Yi

The progressive enhancement process facilitates the learning of discriminative features with fine-grained face forgery clues.

Paper
Add Code

Domain Adaptive Semantic Segmentation via Regional Contrastive Consistency Regularization

1 code implementation • 11 Oct 2021 • Qianyu Zhou, Chuyun Zhuang, Ran Yi, Xuequan Lu, Lizhuang Ma

In this paper, we propose a novel and fully end-to-end trainable approach, called regional contrastive consistency regularization (RCCR) for domain adaptive semantic segmentation.

Ranked #31 on Synthetic-to-Real Translation on GTAV-to-Cityscapes Labels

Semantic Segmentation Synthetic-to-Real Translation +1

Paper
Code

Infrastructure Assisted Constrained Connected Automated Vehicle Trajectory Optimization on Curved Roads: A Spatial Formulation on a Curvilinear Coordinate

no code implementations • 1 Mar 2021 • Ran Yi, Yang Zhou, Xin Wang, Zhiyuan Liu, Xiaotian Li, Bin Ran

This paper presents an infrastructure assisted constrained connected automated vehicles (CAVs) trajectory optimization method on curved roads.

Model Predictive Control

Paper
Add Code

NPRportrait 1.0: A Three-Level Benchmark for Non-Photorealistic Rendering of Portraits

no code implementations • 1 Sep 2020 • Paul L. Rosin, Yu-Kun Lai, David Mould, Ran Yi, Itamar Berger, Lars Doyle, Seungyong Lee, Chuan Li, Yong-Jin Liu, Amir Semmo, Ariel Shamir, Minjung Son, Holger Winnemoller

Despite the recent upsurge of activity in image-based non-photorealistic rendering (NPR), and in particular portrait image stylisation, due to the advent of neural style transfer, the state of performance evaluation in this field is limited, especially compared to the norms in the computer vision and machine learning communities.

Style Transfer

Paper
Add Code

Unpaired Portrait Drawing Generation via Asymmetric Cycle Mapping

1 code implementation • CVPR 2020 • Ran Yi, Yong-Jin Liu, Yu-Kun Lai, Paul L. Rosin

We observe that due to the significant imbalance of information richness between photos and drawings, existing unpaired transfer methods such as CycleGAN tends to embed invisible reconstruction information indiscriminately in the whole drawings, leading to important facial features partially missing in drawings.

192

Paper
Code

3D-CariGAN: An End-to-End Solution to 3D Caricature Generation from Face Photos

1 code implementation • 15 Mar 2020 • Zipeng Ye, Mengfei Xia, Yanan sun, Ran Yi, MinJing Yu, Juyong Zhang, Yu-Kun Lai, Yong-Jin Liu

The most challenging issue for our system is that the source domain of face photos (characterized by normal 2D faces) is significantly different from the target domain of 3D caricatures (characterized by 3D exaggerated face shapes and textures).

Caricature

Paper
Code

Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose

1 code implementation • 24 Feb 2020 • Ran Yi, Zipeng Ye, Juyong Zhang, Hujun Bao, Yong-Jin Liu

In this paper, we address this problem by proposing a deep neural network model that takes an audio signal A of a source person and a very short video V of a target person as input, and outputs a synthesized high-quality talking face video with personalized head pose (making use of the visual information in V), expression and lip synchronization (by considering both A and V).

3D Face Animation Video Generation

702

Paper
Code

A Configuration-Space Decomposition Scheme for Learning-based Collision Checking

no code implementations • 17 Nov 2019 • Yiheng Han, Wang Zhao, Jia Pan, Zipeng Ye, Ran Yi, Yong-Jin Liu

Motion planning for robots of high degrees-of-freedom (DOFs) is an important problem in robotics with sampling-based methods in configuration space C as one popular solution.

BIG-bench Machine Learning Motion Planning +1

Paper
Add Code

APDrawingGAN: Generating Artistic Portrait Drawings From Face Photos With Hierarchical GANs

6 code implementations • CVPR 2019 • Ran Yi, Yong-Jin Liu, Yu-Kun Lai, Paul L. Rosin

Moreover, artists tend to use different strategies to draw different facial features and the lines drawn are only loosely related to obvious image features.

Image Stylization

773

Paper
Code

Content-Sensitive Supervoxels via Uniform Tessellations on Video Manifolds

no code implementations • CVPR 2018 • Ran Yi, Yong-Jin Liu, Yu-Kun Lai

We propose an efficient Lloyd-like method with a splitting-merging scheme to compute a uniform tessellation on M, which induces the CSS in X. Theoretically our method has a good competitive ratio O(1).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.