no code implementations • 27 Apr 2024 • Like Xin, Wanqi Yang, Lei Wang, Ming Yang
We assume that the view with a good cluster structure is the reliable view, which acts as a supervisor to guide the clustering of the other views.
no code implementations • 22 Apr 2024 • Xuzheng Yu, Chen Jiang, Xingning Dong, Tian Gan, Ming Yang, Qingpei Guo
In particular, text-video retrieval, which aims to find the top matching videos given text descriptions from a vast video corpus, is an essential function, the primary challenge of which is to bridge the modality gap.
no code implementations • 1 Apr 2024 • Jing Li, Quanxue Gao, Cheng Deng, Qianqian Wang, Ming Yang
Nevertheless, existing multi-view clustering methods based on anchor graph factorization lack adequate cluster interpretability for the decomposed matrix and often overlook the inter-view information.
1 code implementation • Expert Systems with Applications 2024 • Qian Zhang, Yi Zhu, Ming Yang, Ge Jin, YingWen Zhu, Qiu Chen
Although sample selection is a mainstream method in the field of learning with noisy labels, which aims to mitigate the impact of noisy labels during model training, the testing performance of these methods exhibits significant fluctuations across different noise rates and types.
Ranked #2 on Learning with noisy labels on Clothing1M
no code implementations • 17 Mar 2024 • Kangyang Xie, BinBin Yang, Hao Chen, Meng Wang, Cheng Zou, Hui Xue, Ming Yang, Chunhua Shen
Beyond the superiority of the text-to-image diffusion model in generating high-quality images, recent studies have attempted to uncover its potential for adapting the learned semantic knowledge to visual perception tasks.
no code implementations • 3 Mar 2024 • Wenhui Zhao, Quanxue Gao, Guangfei Li, Cheng Deng, Ming Yang
Despite their successes, current methods lack interpretability in the clustering process and do not sufficiently consider the complementary information across different views.
1 code implementation • 27 Feb 2024 • ZiCheng Zhang, Ruobing Zheng, Ziwen Liu, Congying Han, Tianqi Li, Meng Wang, Tiande Guo, Jingdong Chen, Bonan Li, Ming Yang
Recent works in implicit representations, such as Neural Radiance Fields (NeRF), have advanced the generation of realistic and animatable head avatars from video sequences.
no code implementations • 24 Feb 2024 • Shikun Mei, Fangfang Li, Quanxue Gao, Ming Yang
Additionally, we evolve the concept of the membership matrix between cluster centers and samples in FKM into an anchor graph encompassing multiple anchor points and samples.
no code implementations • 2 Feb 2024 • Haoxiang Gao, Yaqian Li, Kaiwen Long, Ming Yang, Yiqing Shen
The advent of foundation models has revolutionized the fields of natural language processing and computer vision, paving the way for their application in autonomous driving (AD).
1 code implementation • 31 Jan 2024 • Xingning Dong, Zipeng Feng, Chunluan Zhou, Xuzheng Yu, Ming Yang, Qingpei Guo
We then summarize this empirical study into the M2-RAAP recipe, where our technical contributions lie in 1) the data filtering and text re-writing pipeline resulting in 1M high-quality bilingual video-text pairs, 2) the replacement of video inputs with key-frames to accelerate pre-training, and 3) the Auxiliary-Caption-Guided (ACG) strategy to enhance video features.
1 code implementation • 29 Jan 2024 • Qingpei Guo, Furong Xu, Hanxiao Zhang, Wang Ren, Ziping Ma, Lin Ju, Jian Wang, Jingdong Chen, Ming Yang
Vision-language foundation models like CLIP have revolutionized the field of artificial intelligence.
Ranked #1 on Zero-shot Image Retrieval on Flickr30k-CN (using extra training data)
Zero-Shot Cross-Modal Retrieval Zero-shot Image Retrieval +3
no code implementations • 4 Jan 2024 • Ziping Ma, Furong Xu, Jian Liu, Ming Yang, Qingpei Guo
To achieve multimodal alignment from both global and local perspectives, this paper proposes Symmetrizing Contrastive Captioners (SyCoCa), which introduces bidirectional interactions on images and texts across the global and local representation levels.
no code implementations • 2 Jan 2024 • Shuang Li, Ke Li, Wei Li, Ming Yang
Constrained multi-objective optimization problems (CMOPs) pervade real-world applications in science, engineering, and design.
no code implementations • 15 Dec 2023 • Xin Guo, Jiangwei Lao, Bo Dang, Yingying Zhang, Lei Yu, Lixiang Ru, Liheng Zhong, Ziyuan Huang, Kang Wu, Dingxiang Hu, Huimei He, Jian Wang, Jingdong Chen, Ming Yang, Yongjun Zhang, Yansheng Li
Prior studies on Remote Sensing Foundation Model (RSFM) reveal immense potential towards a generic model for Earth Observation.
1 code implementation • 22 Nov 2023 • Weihao Yan, Yeqiang Qian, Xingyuan Chen, Hanyang Zhuang, Chunxiang Wang, Ming Yang
It involves Semantic-Guided Mask Labeling, which assigns semantic labels to unlabeled SAM masks using UDA pseudo-labels.
no code implementations • 18 Nov 2023 • Yueyuan Li, Wei Yuan, Songan Zhang, Weihao Yan, Qiyuan Shen, Chunxiang Wang, Ming Yang
Simulators play a crucial role in autonomous driving, offering significant time, cost, and labor savings.
2 code implementations • 18 Nov 2023 • Yueyuan Li, Songan Zhang, Mingyang Jiang, Xingyuan Chen, Ming Yang
For access to the source code and participation in discussions, visit the official GitHub page for Tactcis2D at https://github. com/WoodOxen/Tactics2D.
2 code implementations • 1 Oct 2023 • Shiyu Xuan, Qingpei Guo, Ming Yang, Shiliang Zhang
Specifically, we present a new method for constructing the instruction tuning dataset at a low cost by leveraging annotations in existing datasets.
1 code implementation • 20 Sep 2023 • Chen Jiang, Hong Liu, Xuzheng Yu, Qing Wang, Yuan Cheng, Jia Xu, Zhongyi Liu, Qingpei Guo, Wei Chu, Ming Yang, Yuan Qi
We thereby present a new Triplet Partial Margin Contrastive Learning (TPM-CL) module to construct partial order triplet samples by automatically generating fine-grained hard negatives for matched text-video pairs.
Ranked #4 on Video Retrieval on MSR-VTT-1kA
1 code implementation • 21 Aug 2023 • Yutao Chen, Xingning Dong, Tian Gan, Chunluan Zhou, Ming Yang, Qingpei Guo
Compared with images, we conjecture that videos necessitate more constraints to preserve the temporal consistency during editing.
no code implementations • 7 Jul 2023 • Ming Yang, Xiyuan Wei, Tianbao Yang, Yiming Ying
Then, we establish the compositional uniform stability results for two popular stochastic compositional gradient descent algorithms, namely SCGD and SCSC.
no code implementations • 23 Mar 2023 • Yi Huang, Xiaoguang Tu, Gui Fu, Tingting Liu, Bokai Liu, Ming Yang, Ziliang Feng
Images taken under low-light conditions tend to suffer from poor visibility, which can decrease image quality and even reduce the performance of the downstream tasks.
no code implementations • 21 Mar 2023 • Xiangchen Cheng, Wei Tang, Ming Yang, Li Jin
Signal-free intersections are a representative application of smart and connected vehicle technologies.
no code implementations • 13 Mar 2023 • Zihao Lin, Jinrong Li, Fan Yang, Shuangping Huang, Xu Yang, Jianmin Lin, Ming Yang
In this paper, we propose a novel model called Spatial Attention and Syntax Rule Enhanced Tree Decoder (SS-TD), which is equipped with spatial attention mechanism to alleviate the prediction error of tree structure and use syntax masks (obtained from the transformation of syntax rules) to constrain the occurrence of ungrammatical mathematical expression.
no code implementations • 5 Jan 2023 • Lei Yu, Wanqi Yang, Shengqi Huang, Lei Wang, Ming Yang
However, the goal of FS-UDA and FSL are relevant yet distinct, since FS-UDA aims to classify the samples in target domain rather than source domain.
1 code implementation • 21 Nov 2022 • Tao Li, Weihao Yan, Zehao Lei, Yingwen Wu, Kun Fang, Ming Yang, Xiaolin Huang
To fully uncover the great potential of deep neural networks (DNNs), various learning algorithms have been developed to improve the model's generalization ability.
no code implementations • 17 Nov 2022 • Ming Yang, Yanhan Wang, Xin Wang, Zhenyong Zhang, Xiaoming Wu, Peng Cheng
Federated learning is a distributed learning that allows each client to keep the original data locally and only upload the parameters of the local model to the server.
1 code implementation • 4 Nov 2022 • Xiaoyu Geng, Qiang Guo, Shuaixiong Hui, Ming Yang, Caiming Zhang
To this end, we integrate nonlocal self-similarity into N-TRPCA, and further develop a nonconvex and nonlocal TRPCA (NN-TRPCA) model.
no code implementations • 5 Oct 2022 • Qisheng Wang, Ming Yang, Xinrui Zhu
eertree) is a linear-size data structure that provides access to all palindromic substrings of a string.
no code implementations • 7 Sep 2022 • Weihao Yan, Yeqiang Qian, Chunxiang Wang, Ming Yang
Panoptic segmentation combines the advantages of semantic and instance segmentation, which can provide both pixel-level and instance-level environmental perception information for intelligent vehicles.
1 code implementation • 23 Aug 2022 • Weihao Yan, Yeqiang Qian, Chunxiang Wang, Ming Yang
In stage one, we design a threshold-adaptative unsupervised focal loss to regularize the prediction in the target domain, which has a mild gradient neutralization mechanism and mitigates the problem that hard samples are barely optimized in entropy-based methods.
no code implementations • 4 Dec 2021 • Xiaoxiao Yang, Yeqian Qiang, Huijie Zhu, Chunxiang Wang, Ming Yang
Thermal infrared (TIR) image has proven effectiveness in providing temperature cues to the RGB features for multispectral pedestrian detection.
no code implementations • 15 Oct 2021 • Wei Xia, Quanxue Gao, Ming Yang, Xinbo Gao
Thus, for the OOS nodes, SCAGC can directly calculate their clustering labels.
1 code implementation • ICLR 2022 • Pengcheng Yang, XiaoMing Zhang, Wenpeng Zhang, Ming Yang, Hong Wei
The recent trend of using large-scale deep neural networks (DNN) to boost performance has propelled the development of the parallel pipelining technique for efficient DNN training, which has resulted in the development of several prominent pipelines such as GPipe, PipeDream, and PipeDream-2BW.
1 code implementation • Findings (EMNLP) 2021 • Shifeng Huang, Jiawei Wang, Jiao Xu, Da Cao, Ming Yang
Specifically, given a math word problem, the model first retrieves similar questions by a memory module and then encodes the unsolved problem and each retrieved question using a representation module.
Ranked #7 on Math Word Problem Solving on Math23K
no code implementations • 6 Aug 2021 • Shengqi Huang, Wanqi Yang, Lei Wang, Luping Zhou, Ming Yang
Inspired by the recent local descriptor based few-shot learning (FSL), our general UDA model is fully built upon local descriptors (LDs) for image classification and domain adaptation.
no code implementations • 2 Jul 2021 • Guanghui Wang, Ming Yang, Lijun Zhang, Tianbao Yang
In this paper, we further improve the stochastic optimization of AURPC by (i) developing novel stochastic momentum methods with a better iteration complexity of $O(1/\epsilon^4)$ for finding an $\epsilon$-stationary solution; and (ii) designing a novel family of stochastic adaptive methods with the same iteration complexity, which enjoy faster convergence in practice.
1 code implementation • CVPR 2021 • Bowen Cheng, Lu Sheng, Shaoshuai Shi, Ming Yang, Dong Xu
Inspired by the back-tracing strategy in the conventional Hough voting methods, in this work, we introduce a new 3D object detection method, named as Back-tracing Representative Points Network (BRNet), which generatively back-traces the representative points from the vote centers and also revisits complementary seed points around these generated points, so as to better capture the fine local structural features surrounding the potential objects from the raw point clouds.
Ranked #17 on 3D Object Detection on ScanNetV2
1 code implementation • CVPR 2021 • Jialian Wu, Jiale Cao, Liangchen Song, Yu Wang, Ming Yang, Junsong Yuan
Most online multi-object trackers perform object detection stand-alone in a neural net without any input from tracking.
Ranked #1 on Instance Segmentation on nuScenes
no code implementations • 11 Mar 2021 • Xingyu Jiang, Mingyang Qin, Xinjian Wei, Zhongpei Feng, Jiezun Ke, Haipeng Zhu, Fucong Chen, Liping Zhang, Li Xu, Xu Zhang, Ruozhou Zhang, Zhongxu Wei, Peiyu Xiong, Qimei Liang, Chuanying Xi, Zhaosheng Wang, Jie Yuan, Beiyi Zhu, Kun Jiang, Ming Yang, Junfeng Wang, Jiangping Hu, Tao Xiang, Brigitte Leridon, Rong Yu, Qihong Chen, Kui Jin, Zhongxian Zhao
Iron selenide (FeSe) - the structurally simplest iron-based superconductor, has attracted tremendous interest in the past years.
Superconductivity
no code implementations • 21 Jan 2021 • Ming Yang, Alceste Z. Bonanos, Biwei Jiang, Man I Lam, Jian Gao, Panagiotis Gavras, Grigoris Maravelias, Shu Wang, Xiao-Dian Chen, Frank Tramper, Yi Ren, Zoi T. Spetsieri
Further separating RSG candidates from the rest of the LSG candidates is done by using semi-empirical criteria on NIR CMDs and resulted in 323 RSG candidates.
Solar and Stellar Astrophysics Astrophysics of Galaxies
1 code implementation • 19 Jan 2021 • Zhuoman Liu, Wei Jia, Ming Yang, Peiyao Luo, Yong Guo, Mingkui Tan
To address the above issues, in this paper, we propose a novel deep generative model, called Self-Consistent Generative Network (SCGN), which synthesizes novel views from the given input views without explicitly exploiting the geometric information.
no code implementations • ICCV 2021 • Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan
This task is confronted with two challenges: how to establish the 3D correspondences from views to the BEV map and how to assemble occupancy information across views.
Ranked #7 on Multiview Detection on MultiviewX
1 code implementation • 7 Dec 2020 • Jiansheng Fang, Xiaoqing Zhang, Yan Hu, Yanwu Xu, Ming Yang, Jiang Liu
Latent Factor Model (LFM) is one of the most successful methods for Collaborative filtering (CF) in the recommendation system, in which both users and items are projected into a joint latent factor space.
no code implementations • 23 Sep 2020 • Zehan Zhang, Ming Zhang, Zhidong Liang, Xian Zhao, Ming Yang, Wenming Tan, ShiLiang Pu
Experimental results on the KITTI dataset demonstrate significant improvement in filtering false positive over the approach using only point cloud data.
no code implementations • 20 Apr 2020 • Wanqi Yang, Tong Ling, Chengmei Yang, Lei Wang, Yinghuan Shi, Luping Zhou, Ming Yang
To address this issue, we propose a novel approach called Conditional ADversarial Image Translation (CADIT) to explicitly align the class distributions given samples between the two domains.
no code implementations • 2 Apr 2020 • Xiaoliang Wang, Yeqiang Qian, Chunxiang Wang, Ming Yang
As one of the most important tasks in autonomous driving systems, ego-lane detection has been extensively studied and has achieved impressive results in many scenarios.
1 code implementation • 24 Sep 2019 • Chenchen Zhao, Yeqiang Qian, Ming Yang
The 2D and 3D dimensions of pedestrians are determined from the camera captures and further utilized through two feedforward links connected to the orientation estimator.
2 code implementations • ICCV 2019 • Naiyu Gao, Yanhu Shan, Yupei Wang, Xin Zhao, Yinan Yu, Ming Yang, Kaiqi Huang
Moreover, incorporating with the learned affinity pyramid, a novel cascaded graph partition module is presented to sequentially generate instances from coarse to fine.
2 code implementations • 25 Jul 2019 • Shihao Zhang, Huazhu Fu, Yuguang Yan, Yubing Zhang, Qingyao Wu, Ming Yang, Mingkui Tan, Yanwu Xu
Learning structural information is critical for producing an ideal result in retinal image segmentation.
no code implementations • 29 Jun 2019 • Liuyuan Deng, Ming Yang, Tianyi Li, Yuesheng He, Chunxiang Wang
To instantiate this structure, the paper proposes a residual fusion block (RFB) to formulate the interdependences of the encoders.
Ranked #3 on Semantic Segmentation on ScanNetV2
1 code implementation • 24 Jun 2019 • Shunan Mao, Shiliang Zhang, Ming Yang
RIFE adopts two feature extraction streams weighted by a dual-attention block to learn features for low and high resolution images, respectively.
no code implementations • 27 May 2019 • Haoyan Liu, Yanming Liu, Ming Yang, Xiaoping Li
For reentry or near space communication, owing to the influence of the time-varying plasma sheath channel environment, the received IQ baseband signals are severely rotated on the constellation.
2 code implementations • CVPR 2019 • Jianzhong He, Shiliang Zhang, Ming Yang, Yanhu Shan, Tiejun Huang
Exploiting multi-scale representations is critical to improve edge detection for objects at different scales.
Ranked #2 on Edge Detection on BRIND
no code implementations • 20 Feb 2019 • Yi Ren, B. W. Jiang, Ming Yang, Jian Gao
The period-luminosity (P-L) relation is analyzed for the RSGs in the fundamental mode.
Solar and Stellar Astrophysics Astrophysics of Galaxies
no code implementations • 14 Feb 2019 • Zhidong Liang, Ming Yang, Chunxiang Wang
As a result, our framework can output both the semantic prediction and the instance prediction.
3D Instance Segmentation 3D Semantic Instance Segmentation +2
no code implementations • 31 Oct 2018 • Xiao Liang, Liyuan Chen, Dan Nguyen, Zhiguo Zhou, Xuejun Gu, Ming Yang, Jing Wang, Steve Jiang
Dose calculation accuracy using sCT images has been improved over the original CBCT images, with the average Gamma Index passing rate increased from 95. 4% to 97. 4% for 1 mm/1% criteria.
Medical Physics
no code implementations • ECCV 2018 • Liangliang Ren, Xin Yuan, Jiwen Lu, Ming Yang, Jie Zhou
Visual tracking is confronted by the dilemma to locate a target both}accurately and efficiently, and make decisions online whether and how to adapt the appearance model or even restart tracking.
1 code implementation • ECCV 2018 • Ke Gong, Xiaodan Liang, Yicheng Li, Yimin Chen, Ming Yang, Liang Lin
Instance-level human parsing towards real-world human analysis scenarios is still under-explored due to the absence of sufficient data resources and technical difficulty in parsing multiple instances in a single pass.
Ranked #6 on Human Part Segmentation on CIHP
17 code implementations • ECCV 2018 • Tianwei Lin, Xu Zhao, Haisheng Su, Chongjing Wang, Ming Yang
Temporal action proposal generation is an important yet challenging problem, since temporal proposals with rich action content are indispensable for analysing real-world videos with long duration and high proportion irrelevant content.
Ranked #3 on Temporal Action Proposal Generation on THUMOS' 14
no code implementations • CVPR 2018 • Jingwen Chen, Jia-Wei Chen, Hongyang Chao, Ming Yang
In this paper, we consider a typical image blind denoising problem, which is to remove unknown noise from noisy images.
no code implementations • CVPR 2018 • Weixiang Hong, Zhenzhen Wang, Ming Yang, Junsong Yuan
In recent years, deep neural nets have triumphed over many computer vision problems, including semantic segmentation, which is a critical task in emerging autonomous driving and medical image diagnostics applications.
no code implementations • 2 Jan 2018 • Liuyuan Deng, Ming Yang, Hao Li, Tianyi Li, Bing Hu, Chunxiang Wang
Finally, an RDC based semantic segmentation model is built; the model is trained for real-world surround view images through a multi-task learning architecture by combining real-world images with transformed images.
no code implementations • 9 Sep 2017 • Mingwei Cao, Ming Yang, Chunxiang Wang, Yeqiang Qian, Bing Wang
In view of contemporary panoramic camera-laser scanner system, the traditional calibration method is not suitable for panoramic cameras whose imaging model is extremely nonlinear.
no code implementations • 13 Feb 2017 • You Lin, Ming Yang, Can Wan, Jianhui Wang, Yonghua Song
Therefore, a novel multi-model combination (MMC) approach for short-term probabilistic wind generation forecasting is proposed in this paper to exploit the advantages of different forecasting models.
no code implementations • 3 Oct 2016 • Yingming Li, Ming Yang, Zhongfei Zhang
Consequently, we first review the representative methods and theories of multi-view representation learning based on the perspective of alignment, such as correlation-based alignment.
1 code implementation • 27 Sep 2016 • Zhao Kang, Chong Peng, Ming Yang, Qiang Cheng
To alleviate this problem, this paper proposes a simple recommendation algorithm that fully exploits the similarity information among users and items and intrinsic structural information of the user-item matrix.
no code implementations • 18 Dec 2014 • Yunchao Gong, Liu Liu, Ming Yang, Lubomir Bourdev
In this paper, we tackle this model storage issue by investigating information theoretical vector quantization methods for compressing the parameters of CNNs.
2 code implementations • Conference on Computer Vision and Pattern Recognition (CVPR) 2014 • Yaniv Taigman, Ming Yang, Marc’ Aurelio Ranzato, Lior Wolf
In modern face recognition, the conventional pipeline consists of four stages: detect => align => represent => classify.
Ranked #1 on 3D Face Modelling on LFW
no code implementations • CVPR 2015 • Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, Lior Wolf
Scaling machine learning methods to very large datasets has attracted considerable attention in recent years, thanks to easy access to ubiquitous sensing and data from the web.