OpenAI Unveils GPT-5.4 With Major Accuracy Improvements and New Tool Search System
News Synopsis
Artificial intelligence leader OpenAI has introduced GPT-5.4, the latest addition to its advanced AI model lineup. Designed for professional and enterprise applications, the new model promises stronger reasoning abilities, improved accuracy, and better efficiency compared to previous versions. Along with enhanced performance benchmarks, the release also introduces a new Tool Search system that improves how AI models interact with external tools and APIs.
OpenAI Launches GPT-5.4 With Improved Accuracy and Advanced Tool Integration
OpenAI Expands Its GPT-5 Series
OpenAI has officially announced the launch of GPT-5.4, calling it one of the company’s most capable and efficient frontier AI models developed to date. The new release is part of the expanding GPT-5 family and is designed to support demanding professional tasks across industries such as finance, legal services, data analysis, and software development.
Unlike earlier versions, GPT-5.4 is available in multiple variants tailored for different use cases. These include the standard GPT-5.4 model, GPT-5.4 Thinking, which focuses on advanced reasoning and complex problem solving, and GPT-5.4 Pro, a higher-performance version designed for enterprise-grade workloads.
By offering multiple versions, OpenAI aims to provide developers and organizations with flexibility depending on their performance requirements and application complexity.
Massive Context Window Enables Larger Data Processing
One of the most significant upgrades in GPT-5.4 is its expanded context window. The API version of the model supports up to one million tokens, allowing developers to process far larger documents, conversations, and datasets within a single request.
This expanded capability makes the model particularly useful for tasks such as analyzing long research papers, reviewing legal documents, summarizing complex reports, and processing large codebases. In previous generations of AI models, developers often had to split long documents into smaller segments, which could affect context and accuracy.
With GPT-5.4’s larger context capacity, developers can now work with significantly more information at once, improving both efficiency and consistency in AI-generated results.
Improved Efficiency and Lower Token Usage
OpenAI also highlights improved efficiency in the new model. According to the company, GPT-5.4 can solve similar problems using fewer tokens compared with earlier versions such as GPT-5.2.
Token efficiency is important because it directly affects both processing speed and operational cost. When fewer tokens are required to generate accurate responses, developers benefit from faster performance and lower API usage costs.
This improvement makes GPT-5.4 particularly appealing for enterprise environments where large-scale AI usage can generate significant computing expenses.
Strong Benchmark Performance Across Evaluations
Benchmark testing indicates substantial performance gains in several areas. GPT-5.4 achieved top scores in computer-use evaluations such as OSWorld-Verified and WebArena Verified, which test an AI model’s ability to interact with software systems and perform digital tasks.
The model also scored 83 percent on OpenAI’s GDPval evaluation, a benchmark designed to measure performance on knowledge-based professional tasks. These tasks include document analysis, business reasoning, research summarization, and analytical problem solving.
Additionally, GPT-5.4 performed strongly in the APEX-Agents benchmark, developed by Mercor, which evaluates professional capabilities like financial modeling, legal reasoning, and strategic analysis.
According to Brendan Foody, CEO of Mercor, the model demonstrated impressive performance when generating complex deliverables such as financial models, legal reports, and presentation slide decks. He also noted that GPT-5.4 operates faster and more cost-effectively than several competing frontier AI models.
Better Accuracy and Reduced Errors
OpenAI has also focused on improving the reliability of AI-generated information. Internal testing shows that GPT-5.4 is 33 percent less likely to produce incorrect factual claims compared with GPT-5.2.
In addition, the company reports that overall responses generated by the model are 18 percent less likely to contain mistakes. This improvement reflects ongoing efforts to reduce hallucinations and increase factual consistency in AI systems.
For businesses relying on AI for research, reporting, and professional tasks, improved reliability can significantly enhance trust in automated outputs.
Introduction of the New Tool Search System
Another major innovation introduced with GPT-5.4 is a feature called Tool Search, designed to improve how AI models interact with external tools and APIs.
Previously, developers needed to include definitions for every available tool within system prompts. This process often consumed large numbers of tokens and could slow down request processing, particularly in applications that used dozens of integrated tools.
Tool Search solves this problem by allowing the model to retrieve tool definitions only when they are needed. As a result, token usage is reduced, requests become faster, and developers can integrate larger tool ecosystems into their applications without sacrificing efficiency.
This improvement is particularly valuable for complex AI workflows that rely on multiple software integrations.
New Safety Evaluation for Chain-of-Thought Reasoning
OpenAI has also introduced a new safety evaluation focusing on chain-of-thought reasoning, which refers to the internal step-by-step logic AI models generate when solving complex problems.
Researchers have raised concerns that AI systems might sometimes misrepresent their reasoning process or hide internal steps under certain circumstances. To address this issue, OpenAI conducted tests to evaluate how GPT-5.4 handles reasoning transparency.
Early results suggest that GPT-5.4 Thinking is less likely to conceal its reasoning process compared with earlier models. According to the company, this indicates that monitoring chain-of-thought behavior can remain an effective method for improving AI safety and accountability.
A Major Step Forward for Enterprise AI
With enhanced reasoning capabilities, improved accuracy, better efficiency, and new tool integration systems, GPT-5.4 represents a significant advancement in OpenAI’s AI model lineup.
The model is expected to play a key role in enterprise applications, helping businesses automate complex workflows, analyze large datasets, and improve productivity across industries.
As AI technology continues to evolve, releases like GPT-5.4 highlight how rapidly the field is advancing toward more capable and reliable intelligent systems.
You May Like


