OpenAI has released a new model named CriticGPT, designed to critique and improve the outputs of ChatGPT 4. CriticGPT operates by identifying subtle inaccuracies in AI-generated content that might escape human reviewers, thus enhancing the reinforcement learning from human feedback (RLHF) process. This tool works alongside human trainers to provide more comprehensive critiques, significantly reducing AI hallucinations and improving the overall quality of AI interactions. The Practical results show that human reviewers equipped with CriticGPT perform 60% better in assessing ChatGPT’s code outputs. OpenAI has trained CriticGPT by having AI trainers intentionally insert errors into the code, allowing CriticGPT to learn to spot and critique these errors accurately. The model uses a technique called Force Sampling Beam Search (FSBS) to balance thoroughness and accuracy in its critiques. Additionally, OpenAI has decided to cut ties with China, blocking API access from the mainland and Hong Kong, a move likely influenced by ongoing geopolitical tensions.