Search Results for author: Yunqiu Xu

Found 8 papers, 6 papers with code

Fire Burns, Sword Cuts: Commonsense Inductive Bias for Exploration in Text-based Games

1 code implementation ACL 2022 Dongwon Ryu, Ehsan Shareghi, Meng Fang, Yunqiu Xu, Shirui Pan, Reza Haf

Text-based games (TGs) are exciting testbeds for developing deep reinforcement learning techniques due to their partially observed environments and large action spaces.

Efficient Exploration Inductive Bias +2

Manifold-based Incomplete Multi-view Clustering via Bi-Consistency Guidance

no code implementations16 May 2024 Huibing Wang, Mingze Yao, Yawei Chen, Yunqiu Xu, Haipeng Liu, Wei Jia, Xianping Fu, Yang Wang

Moreover, to preserve the consistency information among multiple views, MIMB implements a biconsistency guidance strategy with reverse regularization of the consensus representation and proposes a manifold embedding measure for exploring the hidden structure of the recovered data.

Clustering Incomplete multi-view clustering +1

Goal Randomization for Playing Text-based Games without a Reward Function

no code implementations29 Sep 2021 Meng Fang, Yunqiu Xu, Yali Du, Ling Chen, Chengqi Zhang

In a variety of text-based games, we show that this simple method results in competitive performance for agents.

Decision Making text-based games

Generalization in Text-based Games via Hierarchical Reinforcement Learning

1 code implementation Findings (EMNLP) 2021 Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Chengqi Zhang

Deep reinforcement learning provides a promising approach for text-based games in studying natural language communication between humans and artificial agents.

Hierarchical Reinforcement Learning reinforcement-learning +2

Self-Correction for Human Parsing

2 code implementations22 Oct 2019 Peike Li, Yunqiu Xu, Yunchao Wei, Yi Yang

To tackle the problem of learning with label noises, this work introduces a purification strategy, called Self-Correction for Human Parsing (SCHP), to progressively promote the reliability of the supervised labels as well as the learned models.

Human Parsing Human Part Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.