Search Results for author: Yao-Xiang Ding

Found 6 papers, 1 papers with code

Reinforcement Learning from Bagged Reward

no code implementations6 Feb 2024 Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama

In Reinforcement Learning (RL), it is commonly assumed that an immediate reward signal is generated for each action taken by the agent, helping the agent maximize cumulative rewards to obtain the optimal policy.

Reinforcement Learning (RL)

Generating by Understanding: Neural Visual Generation with Logical Symbol Groundings

1 code implementation26 Oct 2023 Yifei Peng, Yu Jin, Zhexu Luo, Yao-Xiang Ding, Wang-Zhou Dai, Zhong Ren, Kun Zhou

There are two levels of symbol grounding problems among the core challenges: the first is symbol assignment, i. e. mapping latent factors of neural visual generators to semantic-meaningful symbolic factors from the reasoning systems by learning from limited labeled data.

Seeing Differently, Acting Similarly: Heterogeneously Observable Imitation Learning

no code implementations17 Jun 2021 Xin-Qiang Cai, Yao-Xiang Ding, Zi-Xuan Chen, Yuan Jiang, Masashi Sugiyama, Zhi-Hua Zhou

In many real-world imitation learning tasks, the demonstrator and the learner have to act under different observation spaces.

Imitation Learning

Imitation Learning from Pixel-Level Demonstrations by HashReward

no code implementations9 Sep 2019 Xin-Qiang Cai, Yao-Xiang Ding, Yuan Jiang, Zhi-Hua Zhou

One of the key issues for imitation learning lies in making policy learned from limited samples to generalize well in the whole state-action space.

Dimensionality Reduction Imitation Learning

Preference Based Adaptation for Learning Objectives

no code implementations NeurIPS 2018 Yao-Xiang Ding, Zhi-Hua Zhou

In many real-world learning tasks, it is hard to directly optimize the true performance measures, meanwhile choosing the right surrogate objectives is also difficult.

Multi-Label Learning

Crowdsourcing with Unsure Option

no code implementations1 Sep 2016 Yao-Xiang Ding, Zhi-Hua Zhou

One of the fundamental problems in crowdsourcing is the trade-off between the number of the workers needed for high-accuracy aggregation and the budget to pay.

Cannot find the paper you are looking for? You can Submit a new open access paper.