OpenAI Announces GPT-4.1 - The Smartest Model for Complex Tasks

OpenAI has officially introduced three new models: GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano. These models come with a massive context capacity of up to 1 million tokens and a knowledge limit updated until June 2024.

The company says these models are superior to the recently updated GPT-4o and GPT-4o mini, which were released last July. GPT-4.1 is currently only available via API, so you won't be able to use it directly in ChatGPT yet.

OpenAI notes that GPT-4.1 will only be available via API. In ChatGPT, many improvements in instruction compliance, programming, and intelligence have been gradually built into the latest version of GPT-4o, and the company will continue to add more in future releases.

OpenAI Announces GPT-4.1 - The Smartest Model for Complex Tasks

Benchmarks show the notable improvements that GPT-4.1 brings. The model scored 54.6% on SWE-bench Verified, a 21.4-point increase over GPT-4o. The model scored 38.3% on MultiChallenge, a guideline-based benchmark, and set a new record for long-video understanding with a score of 72.0% on the Video-MME benchmark, where models analyze videos up to an hour long without captions.

OpenAI has also collaborated with alpha partners to test the performance of GPT-4.1 in real-world use cases.

  • Thomson Reuters tested GPT-4.1 with its legal AI assistant CoCounsel. Compared to GPT-4o, GPT-4.1 achieved a 17% increase in accuracy in multi-document evaluation. This type of work relies heavily on the ability to track context across multiple sources and identify complex relationships such as conflicting terms or hidden dependencies, and GPT-4.1 consistently demonstrated strong performance.
  • Carlyle used GPT-4.1 to extract financial data from long, complex documents, including Excel and PDF files. According to the company’s internal benchmarks, the model performed 50 percent better than previous models at document retrieval. It was the first model to reliably handle problems such as finding a “needle in a haystack,” missing information in the middle of a document, and arguments that require connecting information across multiple files.

Performance is one thing, but speed is equally important. OpenAI says GPT-4.1 returns the first token in about 15 seconds when processing 128,000 tokens, and up to 30 seconds at a full million tokens. The GPT-4.1 mini and nano are even faster.

GPT-4.1 nano typically responds in under 5 seconds to prompts with 128,000 input tokens. Prompt caching can further reduce latency while saving costs.

Image understanding also made significant progress. In particular, GPT-4.1 mini outperformed GPT-4o on various visual benchmarks.

  • On MMMU (including charts, diagrams, and maps), GPT-4.1 mini scored 73%. This is higher than GPT-4.5 and far exceeds the 56% of GPT-4o mini.
  • On MathVista (which tests the ability to solve image problems), both GPT-4.1 and GPT-4.1 mini scored 57%, far surpassing the 37% of GPT-4o mini.
  • On CharXiv-Reasoning , where models answer questions based on scientific graphs, GPT-4.1 continues to lead.
  • On Video-MME (long videos without subtitles), GPT-4.1 achieved 72%, a significant improvement over GPT-4o's 65%.

About price:

  • GPT-4.1 costs $2 per 1 million tokens input and $8 for output.
  • GPT-4.1 mini is priced at $0.40 for input and $1.60 for output.
  • GPT-4.1 nano costs $0.10 input and $0.40 output.

Using prompt caching or the Batch API can further reduce these costs, which is great for large-scale applications. OpenAI is also preparing to deprecate support for GPT-4.5 Preview on July 14, 2025, citing GPT-4.1's better performance, lower latency, and lower costs.

Sign up and earn $1000 a day ⋙

Leave a Comment

O1-pro is OpenAIs most expensive AI model to date

O1-pro is OpenAIs most expensive AI model to date

OpenAI has released a more powerful version of its o1 reasoning AI model, o1-pro, in its developer API.

OpenAI Announces ChatGPT Pro Plan for a whopping $200 per month

OpenAI Announces ChatGPT Pro Plan for a whopping $200 per month

OpenAI currently offers four ChatGPT subscription levels to meet the needs of different customer groups.

OpenAI Introduces ChatGPT Projects: New Features to Organize Smarter Conversations

OpenAI Introduces ChatGPT Projects: New Features to Organize Smarter Conversations

By creating a project, users can keep conversations, files, and customization instructions all in one place. This allows them to easily return to the work they were doing.

OpenAI Announces Initiative to Build AI Standards for Industries

OpenAI Announces Initiative to Build AI Standards for Industries

OpenAI has just announced the Pioneers Program – an effort to promote the application of AI in real-world situations.

Softbank plans to surpass Microsoft to become OpenAIs largest investor

Softbank plans to surpass Microsoft to become OpenAIs largest investor

Japanese investment giant Softbank is planning to invest between $15 billion and $25 billion in OpenAI. If the deal goes through, Softbank will become OpenAI’s largest investor, replacing Microsoft.

Users can chat with Santa using ChatGPTs Voice Mode

Users can chat with Santa using ChatGPTs Voice Mode

ChatGPT will help you do things better, giving you the opportunity to chat directly with Santa Claus.

OpenAI to Release Orion, Its Next Big AI Model, in December

OpenAI to Release Orion, Its Next Big AI Model, in December

OpenAI plans to launch Orion, its next major AI model, in December, according to The Verge.

Alibaba Launches AI Model That Can Read Human Emotions

Alibaba Launches AI Model That Can Read Human Emotions

Chinese e-commerce giant Alibaba has continued to make headlines by launching a new AI model that it claims is capable of reading human emotions.

OpenAI Launches GPT Store and ChatGPT Team, Taking ChatGPT Ecosystem to the Next Level

OpenAI Launches GPT Store and ChatGPT Team, Taking ChatGPT Ecosystem to the Next Level

After a long wait and countless rumors, OpenAI has finally announced the long-awaited launch of the GPT Store and ChatGPT Team.

Amazon Announces Nova Sonic Sound Model, Claims Performance Surpasses OpenAI and Google

Amazon Announces Nova Sonic Sound Model, Claims Performance Surpasses OpenAI and Google

Amazon today introduced Nova Sonic, an advanced speech-to-speech model that enables developers to build apps that can converse with human-like voices in real time.

Copilot is the best way to use GPT-4 Turbo for free

Copilot is the best way to use GPT-4 Turbo for free

If you want to try GPT-4 Turbo, using Microsoft's Copilot tool is the best way to do it.

OpenAI quietly kills hero GPT-4

OpenAI quietly kills hero GPT-4

OpenAI is officially discontinuing GPT-4, one of the company's most famous AI models that went viral two years ago.

OpenAI is close to striking a deal with Samsung to use its AI features in Galaxy phones

OpenAI is close to striking a deal with Samsung to use its AI features in Galaxy phones

According to South Korean publication The Korea Herald, artificial intelligence giant OpenAI wants to position itself as a potential rival to Google.

OpenAI develops voice reconstruction technology from just 15-second recording

OpenAI develops voice reconstruction technology from just 15-second recording

OpenAI Launches Technology That Can Recreate Anyone's Voice With Just a 15-Second Recording.

How to use Circle Ks CK Club app to receive attractive offers

How to use Circle Ks CK Club app to receive attractive offers

To get the fastest promotional information from Circle K, you should install the CK Club application. The application saves your payments when shopping or paying at Circle K as well as the number of stamps collected.

Instagram Will Allow Reels Up to 3 Minutes Long

Instagram Will Allow Reels Up to 3 Minutes Long

Instagram has just announced that it will allow users to post Reels videos up to 3 minutes long, double the previous 90-second limit.

How to view Chromebook CPU information

How to view Chromebook CPU information

This article will guide you how to view CPU information, check CPU speed directly on your Chromebook.

8 Cool Things You Can Do With an Old Android Tablet

8 Cool Things You Can Do With an Old Android Tablet

If you don't want to sell or give away your old tablet, you can use it in 5 ways: as a high-quality photo frame, music player, e-book & magazine reader, housework assistant, and as a secondary screen.

How to get beautiful nails quickly

How to get beautiful nails quickly

You want to have a beautiful, shiny, healthy nail quickly. The simple tips for beautiful nails below will be useful for you.

Color inspiration secrets only designers know

Color inspiration secrets only designers know

This article will list color-inspired tips, shared by top designers from the Creative Market community, so you can get the perfect color combination every time.

Everything you need to replace your laptop with a phone

Everything you need to replace your laptop with a phone

Can you really replace your laptop with your phone? Yes, but you'll need the right accessories to turn your phone into a laptop.

ChatGPT will soon be able to see everything happening on your screen

ChatGPT will soon be able to see everything happening on your screen

One important thing in the full event video was that the upcoming ChatGPT app feature was demoed but no real details were shared. That is, ChatGPT's ability to see everything that's happening on the user's device screen.

AI is learning to fool humans despite being trained to be honest

AI is learning to fool humans despite being trained to be honest

Many top AIs, despite being trained to be honest, learn to deceive through training and systematically induce users into false beliefs, a new study finds.

How to change questions on ChatGPT

How to change questions on ChatGPT

ChatGPT now has a question change option so users can edit the question or content they are exchanging with ChatGPT.

How to spot fake QR codes and keep your data safe

How to spot fake QR codes and keep your data safe

QR codes seem pretty harmless until you scan a bad one and get something nasty thrown at you. If you want to keep your phone and data safe, there are a few ways you can spot a fake QR code.

Qualcomm Launches X85 5G Modem With a Series of Notable Improvements

Qualcomm Launches X85 5G Modem With a Series of Notable Improvements

On stage at MWC 2025, Qualcomm made a splash when it introduced its eighth generation of 5G modem called the X85, which is expected to be used in flagship smartphones launching later this year.

New technology allows phones to change color flexibly

New technology allows phones to change color flexibly

You have a trendy “Ultramarine” iPhone 16, but one fine day you suddenly feel bored with that color; what will you do?

Microsoft integrates DeepSeek into the PC Copilot+ platform

Microsoft integrates DeepSeek into the PC Copilot+ platform

In January, Microsoft announced plans to bring NPU-optimized versions of the DeepSeek-R1 model directly to Copilot+ computers running on Qualcomm Snapdragon X processors.

Difference between IF and Switch functions in Excel

Difference between IF and Switch functions in Excel

The IF statement is a common logical function in Excel. The SWITCH statement is less well known, but you can use it instead of the IF statement in some situations.