OpenAI introduces its new o3 reasoning AI models to a limited number of users
Dec 20, 2024 at 8:27 PM

OpenAI introduces its new o3 reasoning AI models to a limited number of users

OpenAI has unveiled the o3 family of AI models, which succeed the previous o1 "reasoning" model. The new series includes the o3 and a smaller version, o3-mini, designed for specific tasks.

The o3 models show significant performance enhancements, including a 22.8% improvement in coding tests on SWE-Bench compared to o1, outperforming OpenAI’s Chief Scientist in coding challenges, and achieving near-perfect scores in the AIME 2024 math competition. They also scored 87.7% on GPQA Diamond expert-level science problems and solved 25.2% of complex reasoning challenges, a substantial increase from earlier models' sub-2% scores.

OpenAI has also introduced a new safety paradigm called deliberative alignment, which involves models processing safety-related decisions step-by-step to improve adherence to safety policies. While not yet publicly available, OpenAI is inviting researchers to begin testing the o3-mini immediately, with a preview of the o3 model to be released later. The company plans to launch o3-mini by the end of January, followed by the release of the main o3 model.

Dec 20, 2024 by Mauricio B. Holguin

ChatGPT iconChatGPT
  369
  • ...

ChatGPT is an AI chatbot developed by OpenAI, leveraging the GPT-3 architecture to generate human-like text responses. Known for its AI-powered capabilities, it excels in producing high-quality text across various styles and formats. As a web-based service, it is rated 4.4. Notable alternatives include HuggingChat, Google Gemini, and GPT4ALL.

No comments so far, maybe you want to be first?
Gu