no code implementations • 19 May 2024 • Ziyu Zhu, Zhuofan Zhang, Xiaojian Ma, Xuesong Niu, Yixin Chen, Baoxiong Jia, Zhidong Deng, Siyuan Huang, Qing Li
A unified model for 3D vision-language (3D-VL) understanding is expected to take various scene representations and perform a wide range of tasks in a 3D scene.
1 code implementation • ICCV 2023 • Ziyu Zhu, Xiaojian Ma, Yixin Chen, Zhidong Deng, Siyuan Huang, Qing Li
3D vision-language grounding (3D-VL) is an emerging field that aims to connect the 3D physical world with natural language, which is crucial for achieving embodied intelligence.
no code implementations • 8 Aug 2023 • Ziyu Zhu, Ximing Xu
The integration of sequencing data with clinical information is a widely accepted strategy in bioinformatics and health informatics.
no code implementations • 18 Jun 2023 • Tianjiao Luo, Ziyu Zhu, Jianfei Chen, Jun Zhu
We theoretically prove that the training process of DiracGANs-BMC is globally exponential stable and derive bounds on the rate of convergence.
no code implementations • 20 Nov 2022 • Yiqi Jin, Ziyu Zhu, Tongda Xu, Yuhuan Lin, Yan Wang
For octree-based point cloud compression, previous works show that the information of ancestor nodes and sibling nodes are equally important for predicting current node.
1 code implementation • 20 Sep 2022 • Tongda Xu, Han Gao, Chenjian Gao, Yuanyuan Wang, Dailan He, Jinyong Pi, Jixiang Luo, Ziyu Zhu, Mao Ye, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang
In this paper, we consider the problem of bit allocation in Neural Video Compression (NVC).
1 code implementation • 8 Mar 2022 • Jiajun Fei, Ziyu Zhu, Wenlei Liu, Zhidong Deng, Mingyang Li, Huanjun Deng, Shuo Zhang
We strictly prove that any permutation-invariant function implemented by DuMLP-Pin can be decomposed into two or more permutation-equivariant ones in a dot-product way as the cardinality of the given input set is greater than a threshold.
no code implementations • 22 Aug 2019 • Hao Wu, Ziyu Zhu, Jiayi Wang, Nanning Zheng, Badong Chen
The framework comprises two parts: forward encoding model that deals with visual stimuli and inner state model that captures influence from intrinsic connections in the brain.