News on Medial

šŸ”„ 1 person is talking about this

OpenAIā€™s new ā€œCriticGPTā€ model is trained to criticize GPT-4 outputs

ArstechnicaArstechnica Ā· 4m
OpenAIā€™s new ā€œCriticGPTā€ model is trained to criticize GPT-4 outputs

OpenAI has introduced CriticGPT, an AI model designed to detect errors in code generated by ChatGPT. The model aims to improve the alignment of AI systems with human expectations by employing Reinforcement Learning from Human Feedback (RLHF). CriticGPT, based on the GPT-4 family of large language models (LLMs), assists human reviewers by identifying coding mistakes in the output. The researchers trained CriticGPT on a dataset of intentionally bugged code to teach it how to recognize and flag different errors. According to experiments, CriticGPT's critiques were preferred by trainers over human critiques in 63% of cases involving LLM errors.

Comments

Download the medial app to read full posts, comements and news.