Search Results for author: Kiante Brantley

Found 2 papers, 2 papers with code

Learning to Generate Better Than Your LLM

1 code implementation • 20 Jun 2023 • Jonathan D. Chang, Kiante Brantley, Rajkumar Ramamurthy, Dipendra Misra, Wen Sun

In particular, we extend RL algorithms to allow them to interact with a dynamic black-box guide LLM and propose RL with guided feedback (RLGF), a suite of RL algorithms for LLM fine-tuning.

Conditional Text Generation reinforcement-learning +1

116

Paper
Code

Disagreement-Regularized Imitation Learning

2 code implementations • ICLR 2020 • Kiante Brantley, Wen Sun, Mikael Henaff

We present a simple and effective algorithm designed to address the covariate shift problem in imitation learning.

Continuous Control Imitation Learning

391

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.