Categories: Business

OpenAI’s agent tool may be about to be released

OpenAI may be about to release an AI tool that can take control of your PC and perform actions on your behalf.

Tibor Blaho, a software engineer known for accurately disclosing upcoming AI products, claims to have discovered evidence of OpenAI’s long-rumored Operator tool. Publications including Bloomberg have previously reported on Operator, which is considered an “agent” system capable of autonomously handling tasks such as writing code and booking travel.

According to The Information, OpenAI is targeting January as Operator’s release month. The code discovered by Blaho this weekend adds credence to this information.

OpenAI’s ChatGPT client for macOS has gained options, hidden for now, to set shortcuts to “Toggle Operator” and “Force Quit Operator,” according to Blaho. And OpenAI has added references to Operator on its website, Blaho said – although references that are not yet publicly visible.

According to Blaho, OpenAI’s site also contains not-yet-public tables comparing Operator’s performance to other computer-based AI systems. Tables could very well be placeholders. But if the numbers are accurate, they suggest that Operator is not 100% reliable, depending on the task.

On OSWorld, a benchmark that attempts to mimic a real-world computing environment, “OpenAI Computer Use Agent (CUA)” – perhaps the AI ​​model that powers Operator – scores 38.1%, ahead of the model from computer control of Anthropic but well below the 72.4% of humans. score. OpenAI CUA outperforms human performance on WebVoyager, which evaluates an AI’s ability to navigate and interact with websites. But the model falls far short of the human-level scores of another web-based benchmark, WebArena, according to leaked benchmarks.

The operator also struggles to perform tasks that a human could easily perform, if the leak is to be believed. In a test that asked Operator to register with a cloud provider and launch a virtual machine, Operator only succeeded 60% of the time. Tasked with creating a Bitcoin wallet, the operator was only successful 10% of the time.

We have reached out to OpenAI for comment and will update this article if we receive a response.

OpenAI’s imminent entry into the AI ​​agent space comes as competitors including Anthropic, Google and others play for the nascent segment. AI agents may be risky and speculative, but tech giants are already touting them as the next big thing in AI. The AI ​​agent market could be worth $47.1 billion by 2030, according to analytics firm Markets and Markets.

Today’s agents are rather primitive. But some experts have expressed concerns about their safety, should the technology improve rapidly.

One of the leaked charts shows the operator performing well on some security assessments, including tests that attempt to trick the system into performing “illicit activities” and searching for “sensitive personal data.” It appears that security testing is one of the reasons for Operator’s long development cycle. In a recent article on X, OpenAI co-founder Wojciech Zaremba criticized Anthropic for releasing an agent who he said lacked security measures.

“I can only imagine the backlash if OpenAI made a similar release,” Zaremba wrote.

It is worth noting that OpenAI has been criticized by AI researchers, including former staff members, for allegedly downplaying security work in favor of rapid production of its technology.

remon Buul

Recent Posts

The former LASD jailer does not argue any tales

A former guard assistant from the Sheriff's Department of the County of Los Angeles did…

4 minutes ago

Novak Djokovic has dropped Indian wells by the world n ° 85 Botic van de Zandschulp … while coach Andy Murray looks from the stands

By Matthew Lambwell Posted: 21:03 East, March 8, 2025 | Update: 21:03 East, March 8,…

18 minutes ago

Koalas, a dik-dik and lazy bears: these are new baby animals at the San Diego Zoo

Spring is approaching and a safe sign of the season is the new babies of…

20 minutes ago

Worse penalty of all time!? The YouTube Mrbeast phenomenon sees its dynamic kick easily saved in the Sidemen charity match in Wembley

The most subscribed creator of Youtube Mrbeast was refused by Streamer XQCThe Sidemen charity match…

34 minutes ago

Women responsible for cruelty to animals in Costa Dog-Kicking Case contra

Two women who were arrested at the end of February for their alleged roles in…

36 minutes ago