openai
PublishedJune 27, 2024 at 10:00 AM
Finding GPT-4’s mistakes with GPT-4
Publisher summary· verbatim
CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF
Discussion
No replies yet. Be first.
Originally published on openai ↗