Search Results for author: Arjun Panickssery

Found 2 papers, 1 papers with code

LLM Evaluators Recognize and Favor Their Own Generations

no code implementations15 Apr 2024 Arjun Panickssery, Samuel R. Bowman, Shi Feng

Self-evaluation using large language models (LLMs) has proven valuable not only in benchmarking but also methods like reward modeling, constitutional AI, and self-refinement.

Benchmarking

Cannot find the paper you are looking for? You can Submit a new open access paper.