Strawberry AI is Here: OpenAI Introduces 'o1' Advanced Reasoning Models

Image Courtesy: OpenAI

In Short

OpenAI has introduced state-of-the-art 'o1' advanced reasoning models that excel at science, coding, and math.
New models include OpenAI o1, OpenAI o1-preview and OpenAI o1-mini. Preview and mini models are available starting today to ChatGPT Plus users.
OpenAI says o1 models have been trained using a chain-of-thought technique combined with reinforcement learning.

After months of anticipation, OpenAI has finally introduced a series of new models called ‘o1’ that excel at advanced reasoning, which were earlier referred to as Strawberry AI. New models include OpenAI o1, OpenAI o1-preview and OpenAI o1-mini. Preview and mini models are available starting today to paid ChatGPT Plus users. At a later date, OpenAI o1-mini will be available to free ChatGPT users as well.

OpenAI says o1 models take some time to think before generating a response, but they can “reason through complex tasks” and solve harder problems in math, science, and coding. In addition, OpenAI says that new reasoning models perform on par with PhD students on challenging science topics.

openai o1 model benchmark against gpt-4o — Image Courtesy: OpenAI

To give you a benchmark, the OpenAI o1 model scored 83% in a rigorous exam like the International Mathematics Olympiad (IMO) whereas GPT-4o could only solve 13% of problems. And in the Codeforces competition, the new o1 model reached the 89th percentile whereas GPT-4o stood at the 11th percentile.

openai o1 benchmarks — Image Courtesy: OpenAI

In the MMLU benchmark, OpenAI o1 scored 92.3 and on the MATH benchmark, it scored 94.8. OpenAI says in tasks where heavy reasoning is required, o1 closely matches the performance of human experts, which is pretty significant.

I Tried Out Gemini Live; It Can’t Compete with ChatGPT Advanced Voice Mode

Arjun Sha Aug 26, 2024

What is Artificial General Intelligence (AGI)? Explained

Arjun Sha Aug 8, 2024

The o1 models have been trained using a chain-of-thought technique through reinforcement learning. It breaks down the steps into simpler ones and approaches each step through different strategies until it reaches the correct conclusion. By the way, currently, o1 models only support textual input. You can’t use the model to browse the web or analyze files and images.

#Tags

#AI

Arjun Sha

Passionate about Windows, ChromeOS, Android, security and privacy issues. Have a penchant to solve everyday computing problems.

Comments 0

Strawberry AI is Here: OpenAI Introduces ‘o1’ Advanced Reasoning Models