no code implementations • 20 Jun 2023 • Pengzhen Ren, Kaidong Zhang, Hetao Zheng, Zixuan Li, Yuhang Wen, Fengda Zhu, Mas Ma, Xiaodan Liang
To conduct a comprehensive and systematic evaluation of the robot manipulation model in terms of language understanding and physical execution, we also created a robotic manipulation benchmark with progressive reasoning tasks, called SeaWave.
1 code implementation • 7 Jun 2023 • Kaidong Zhang, Ziyang Gan, Dong Liu, Xifu Shang
For THA, it is of clinical significance to analyze the bone structure from the CT images, especially to observe the structure of the acetabulum and femoral head, before the surgical procedure.
1 code implementation • 1 Jun 2023 • Chang Liu, Shunxin Xu, Jialun Peng, Kaidong Zhang, Dong Liu
To address this problem, we propose a two-stage image inpainting method termed SketchRefiner.
1 code implementation • 19 May 2023 • Chang Liu, Rui Li, Kaidong Zhang, Xin Luo, Dong Liu
To offer more controllability for the generation process, existing studies, termed as early-constraint methods in this paper, leverage extra conditions and incorporate them into pre-trained diffusion models.
Ranked #1 on Conditional Text-to-Image Synthesis on COCO 2017 val
Conditional Image Generation Conditional Text-to-Image Synthesis
1 code implementation • 26 Apr 2023 • Kaidong Zhang, Dong Liu
Different from the previous methods, SAMed is built upon the large-scale image segmentation model, Segment Anything Model (SAM), to explore the new research paradigm of customizing large-scale models for medical image segmentation.
2 code implementations • 24 Jan 2023 • Kaidong Zhang, Jialun Peng, Jingjing Fu, Dong Liu
Transformers have been widely used for video processing owing to the multi-head self attention (MHSA) mechanism.
Ranked #1 on Video Inpainting on DAVIS (SSIM (square) metric)
1 code implementation • 14 Aug 2022 • Kaidong Zhang, Jingjing Fu, Dong Liu
Especially in spatial transformer, we design a dual perspective spatial MHSA, which integrates the global tokens to the window-based attention.
1 code implementation • CVPR 2022 • Kaidong Zhang, Jingjing Fu, Dong Liu
We propose a flow completion network to align and aggregate flow features from the consecutive flow sequences based on the inertia prior.