Search Results for author: Dehao Zhang

Found 2 papers, 1 papers with code

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

1 code implementation20 May 2024 Jian Hu, Xibin Wu, Weixun Wang, Xianyu, Dehao Zhang, Yu Cao

However, unlike pretraining or fine-tuning a single model, scaling reinforcement learning from human feedback (RLHF) for training large language models poses coordination challenges across four models.

reinforcement-learning Scheduling

Method Towards CVPR 2021 Image Matching Challenge

no code implementations10 Aug 2021 Xiaopeng Bi, Yu Chen, Xinyang Liu, Dehao Zhang, Ran Yan, Zheng Chai, Haotian Zhang, Xiao Liu

This report describes Megvii-3D team's approach towards CVPR 2021 Image Matching Workshop.

Cannot find the paper you are looking for? You can Submit a new open access paper.