Search Results for author: Yufei Kuang

Found 8 papers, 0 papers with code

Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action Space

no code implementations • 20 May 2024 • Qianmei Liu, Yufei Kuang, Jie Wang

Many studies use adversarial learning to generate perturbation during training process to model the discrepancy and improve the robustness of DRL.

Paper
Add Code

Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming

no code implementations • 19 Apr 2024 • Jie Wang, Zhihai Wang, Xijun Li, Yufei Kuang, Zhihao Shi, Fangzhou Zhu, Mingxuan Yuan, Jia Zeng, Yongdong Zhang, Feng Wu

Moreover, we observe that (P3) what order of selected cuts to prefer significantly impacts the efficiency of MILP solvers as well.

Paper
Add Code

Machine Learning Insides OptVerse AI Solver: Design Principles and Applications

no code implementations • 11 Jan 2024 • Xijun Li, Fangzhou Zhu, Hui-Ling Zhen, Weilin Luo, Meng Lu, Yimin Huang, Zhenan Fan, Zirui Zhou, Yufei Kuang, Zhihai Wang, Zijie Geng, Yang Li, Haoyang Liu, Zhiwu An, Muming Yang, Jianshu Li, Jie Wang, Junchi Yan, Defeng Sun, Tao Zhong, Yong Zhang, Jia Zeng, Mingxuan Yuan, Jianye Hao, Jun Yao, Kun Mao

To this end, we present a comprehensive study on the integration of machine learning (ML) techniques into Huawei Cloud's OptVerse AI Solver, which aims to mitigate the scarcity of real-world mathematical programming instances, and to surpass the capabilities of traditional optimization techniques.

Decision Making Management

Paper
Add Code

Promoting Generalization for Exact Solvers via Adversarial Instance Augmentation

no code implementations • 22 Oct 2023 • Haoyang Liu, Yufei Kuang, Jie Wang, Xijun Li, Yongdong Zhang, Feng Wu

To tackle this problem, we propose a novel approach, which is called Adversarial Instance Augmentation and does not require to know the problem type for new instance generation, to promote data diversity for learning-based branching modules in the branch-and-bound (B&B) Solvers (AdaSolver).

Imitation Learning

Paper
Add Code

Accelerate Presolve in Large-Scale Linear Programming via Reinforcement Learning

no code implementations • 18 Oct 2023 • Yufei Kuang, Xijun Li, Jie Wang, Fangzhou Zhu, Meng Lu, Zhihai Wang, Jia Zeng, Houqiang Li, Yongdong Zhang, Feng Wu

Specifically, we formulate the routine design task as a Markov decision process and propose an RL framework with adaptive action sequences to generate high-quality presolve routines efficiently.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model

no code implementations • 1 Feb 2023 • Zhihai Wang, Xijun Li, Jie Wang, Yufei Kuang, Mingxuan Yuan, Jia Zeng, Yongdong Zhang, Feng Wu

Cut selection -- which aims to select a proper subset of the candidate cuts to improve the efficiency of solving MILPs -- heavily depends on (P1) which cuts should be preferred, and (P2) how many cuts should be selected.

Paper
Add Code

Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization

no code implementations • 20 Dec 2021 • Yufei Kuang, Miao Lu, Jie Wang, Qi Zhou, Bin Li, Houqiang Li

Many existing algorithms learn robust policies by modeling the disturbance and applying it to source environments during training, which usually requires prior knowledge about the disturbance and control of simulators.

Paper
Add Code

Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method

no code implementations • NeurIPS 2020 • Qi Zhou, Yufei Kuang, Zherui Qiu, Houqiang Li, Jie Wang

However, in continuous action spaces, integrating entropy regularization with expressive policies is challenging and usually requires complex inference procedures.

Continuous Control reinforcement-learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.