Search Results for author: Hanhan Zhou

Found 9 papers, 4 papers with code

Collaborative AI Teaming in Unknown Environments via Active Goal Deduction

no code implementations • 22 Mar 2024 • Zuyuan Zhang, Hanhan Zhou, Mahdi Imani, Taeyoung Lee, Tian Lan

With the advancements of artificial intelligence (AI), we're seeing more scenarios that require AI to work closely with other agents, whose goals and strategies might not be known beforehand.

Starcraft Starcraft II

Paper
Add Code

Real-time Network Intrusion Detection via Decision Transformers

no code implementations • 12 Dec 2023 • Jingdi Chen, Hanhan Zhou, Yongsheng Mei, Gina Adam, Nathaniel D. Bastian, Tian Lan

Many cybersecurity problems that require real-time decision-making based on temporal observations can be abstracted as a sequence modeling problem, e. g., network intrusion detection from a sequence of arriving packets.

Decision Making Network Intrusion Detection +1

Paper
Add Code

Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning

no code implementations • 28 Aug 2023 • Hanhan Zhou, Tian Lan, Vaneet Aggarwal

Offline reinforcement learning aims to utilize datasets of previously gathered environment-action interaction records to learn a policy without access to the real environment.

D4RL Off-policy evaluation +2

Paper
Add Code

MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

1 code implementation • 21 Feb 2023 • Yongsheng Mei, Hanhan Zhou, Tian Lan, Guru Venkataramani, Peng Wei

To this end, we propose MAC-PO, which formulates optimal prioritized experience replay for multi-agent problems as a regret minimization over the sampling weights of transitions.

Decision Making Multi-agent Reinforcement Learning +3

Paper
Code

ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning

no code implementations • 11 Feb 2023 • Yongsheng Mei, Hanhan Zhou, Tian Lan

Such an optimization problem can be relaxed and solved using the Lagrangian multiplier method to obtain the close-form optimal projection weights.

Decision Making reinforcement-learning +2

Paper
Add Code

PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning

1 code implementation • 22 Jun 2022 • Hanhan Zhou, Tian Lan, Vaneet Aggarwal

Multi-agent reinforcement learning (MARL) has witnessed significant progress with the development of value function factorization methods.

counterfactual Multi-agent Reinforcement Learning +5

Paper
Code

On the Convergence of Heterogeneous Federated Learning with Arbitrary Adaptive Online Model Pruning

1 code implementation • 27 Jan 2022 • Hanhan Zhou, Tian Lan, Guru Venkataramani, Wenbo Ding

In this paper, we present a unifying framework for heterogeneous FL algorithms with {\em arbitrary} adaptive online model pruning and provide a general convergence analysis.

Federated Learning Open-Ended Question Answering

Paper
Code

Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

1 code implementation • 4 Jan 2022 • Hanhan Zhou, Tian Lan, Vaneet Aggarwal

To this end, we present LSF-SAC, a novel framework that features a variational inference-based information-sharing mechanism as extra state information to assist individual agents in the value function factorization.

Starcraft Starcraft II +1

Paper
Code

PT-VTON: an Image-Based Virtual Try-On Network with Progressive Pose Attention Transfer

no code implementations • 23 Nov 2021 • Hanhan Zhou, Tian Lan, Guru Venkataramani

The virtual try-on system has gained great attention due to its potential to give customers a realistic, personalized product presentation in virtualized settings.

Pose Transfer Virtual Try-on

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.