News on Medial

๐Ÿ”ฅ 1 person is talking about this

OpenAIโ€™s new โ€œCriticGPTโ€ model is trained to criticize GPT-4 outputs

ArstechnicaArstechnica ยท 1y ago
OpenAIโ€™s new โ€œCriticGPTโ€ model is trained to criticize GPT-4 outputs
Medial

OpenAI has introduced CriticGPT, an AI model designed to detect errors in code generated by ChatGPT. The model aims to improve the alignment of AI systems with human expectations by employing Reinforcement Learning from Human Feedback (RLHF). CriticGPT, based on the GPT-4 family of large language models (LLMs), assists human reviewers by identifying coding mistakes in the output. The researchers trained CriticGPT on a dataset of intentionally bugged code to teach it how to recognize and flag different errors. According to experiments, CriticGPT's critiques were preferred by trainers over human critiques in 63% of cases involving LLM errors.

Related News

Download the medial app to read full posts, comements and news.