OpenAI unveils cost-efficient GPT-4o mini model: What is it and how it works

Updated on 19-Jul-2024
HIGHLIGHTS

OpenAI has announced its most cost-efficient small AI model\ called GPT-4o mini.

GPT-4o mini is more than 60% cheaper than GPT-3.5 Turbo.

GPT-4o mini surpasses GPT-3.5 Turbo and other small models on academic benchmarks across both textual intelligence and multimodal reasoning.

OpenAI has announced its most cost-efficient small AI model called GPT-4o mini. The company expects that the GPT-4o mini will expand the range of applications built with AI by making intelligence much more affordable. GPT-4o mini is more than 60% cheaper than GPT-3.5 Turbo.

According to OpenAI, the GPT-4o mini model scores 82% on MMLU and currently outperforms GPT-4 on chat preferences in LMSYS leaderboard. GPT-4o mini supports text and vision in the API, with support for text, image, video and audio inputs and outputs coming in the future.

Also read: OpenAI launches GPT-4o AI model that’s free for all ChatGPT users: What’s new

This small model has a context window of 128K tokens, supports up to 16K output tokens per request, and has knowledge up to October 2023. With the improved tokenizer shared with GPT-4o, handling non-English text is now even more cost effective, as per the company.

GPT-4o mini surpasses GPT-3.5 Turbo and other small models on academic benchmarks across both textual intelligence and multimodal reasoning, and supports the same range of languages as GPT-4o. It also shows strong performance in function calling, and improved long-context performance.

“On MGSM, measuring math reasoning, GPT-4o mini scored 87.0%, compared to 75.5% for Gemini Flash and 71.7% for Claude Haiku. GPT-4o mini scored 87.2% on HumanEval, which measures coding performance, compared to 71.5% for Gemini Flash and 75.9% for Claude Haiku,” the company said.

Also read: OpenAI’s GPT-5 will be a major upgrade over GPT-4, says Sam Altman

Availability and pricing

GPT-4o mini is available as a text and vision model in the Assistants API, Chat Completions API, and Batch API. Developers have to pay 15 cents per 1M input tokens and 60 cents per 1M output tokens. OpenAI plans to release fine-tuning for GPT-4o mini in the coming days.

In ChatGPT, Free, Plus and Team users will be able to access GPT-4o mini starting today, in place of GPT-3.5. Furthermore, Enterprise users will also have access starting next week.

Ayushi Jain

Tech news writer by day, BGMI player by night. Combining my passion for tech and gaming to bring you the latest in both worlds.

Connect On :