- OpenAI has introduced state-of-the-art 'o1' advanced reasoning models that excel at science, coding, and math.
- New models include OpenAI o1, OpenAI o1-preview and OpenAI o1-mini. Preview and mini models are available starting today to ChatGPT Plus users.
- OpenAI says o1 models have been trained using a chain-of-thought technique combined with reinforcement learning.
After months of anticipation, OpenAI has finally introduced a series of new models called ‘o1’ that excel at advanced reasoning, which were earlier referred to as Strawberry AI. New models include OpenAI o1, OpenAI o1-preview and OpenAI o1-mini. Preview and mini models are available starting today to paid ChatGPT Plus users. At a later date, OpenAI o1-mini will be available to free ChatGPT users as well.
OpenAI says o1 models take some time to think before generating a response, but they can “reason through complex tasks” and solve harder problems in math, science, and coding. In addition, OpenAI says that new reasoning models perform on par with PhD students on challenging science topics.
To give you a benchmark, the OpenAI o1 model scored 83% in a rigorous exam like the International Mathematics Olympiad (IMO) whereas GPT-4o could only solve 13% of problems. And in the Codeforces competition, the new o1 model reached the 89th percentile whereas GPT-4o stood at the 11th percentile.
In the MMLU benchmark, OpenAI o1 scored 92.3 and on the MATH benchmark, it scored 94.8. OpenAI says in tasks where heavy reasoning is required, o1 closely matches the performance of human experts, which is pretty significant.
The o1 models have been trained using a chain-of-thought technique through reinforcement learning. It breaks down the steps into simpler ones and approaches each step through different strategies until it reaches the correct conclusion. By the way, currently, o1 models only support textual input. You can’t use the model to browse the web or analyze files and images.